AWSCloudDynamoDBGraphQLJavaScriptKafkaMySQLNode.jsPostgresPythonReactTableauTypeScriptAIMLGenerative AILarge Language ModelsAnalyticsBIPower BIExpressGitHub ActionsServerlessCloudFormationLambdaS3RDSIAMCloudWatchSNSSQSBedrockAPI GatewayRESTfulGitHubCachingCI/CDOWASP
About this role
Role Overview
Design, build and operate serverless APIs and event-driven services on AWS (API Gateway, Lambda, Step Functions, Event Bridge)
Integrate document ingestion, OCR/ML extraction, and generative AI pipelines, normalize extracted data to a common schema
Build tooling and UIs for human-in-the-loop validation (React, TypeScript, back-end APIs)
Ensure security, observability, scalability, and cost efficiency across services (IAM, VPCs, monitoring, tracing, CI/CD)
Requirements
Hands-on experience with AWS Lambda Node.js or Python runtime including cold start optimization, memory tuning, and concurrency limits
API Gateway REST and/or HTTP APIs routing, authorization, request/response transformations, throttling, and caching
Event driven services Event Bridge, SNS, and SQS for asynchronous pipelines, retry semantics, DLQs, and at least once/exactly once considerations
Orchestration AWS Step Functions Standard and Express for coordinating multi-step workflows and long-running jobs
Storage Amazon S3 for object storage, lifecycle policies, versioning, presigned URLs, and understanding of S3 performance and consistency model
Databases DynamoDB single table design, GSIs, transactions, and/or RDS Postgres/MySQL depending on relational needs, familiarity with choosing the right persistence model
IAM security Least privilege IAM policies, resource based policies, KMS for encryption and secure access patterns
Generative AI / ML and document extraction
Experience integrating large language models (LLMs) and generative AI APIs e.g., Open AI, Anthropic, AWS Bedrock or self-hosted models and designing safe cost-efficient prompt strategies, batching and caching
Basic familiarity with document OCR and extraction tools (i.e., AWS Textract, Tesseract) and third-party OCR/IDP Intelligent Document Processing platforms
Knowledge of structured information extraction approaches, prompt engineering for LLMs, retrieval augmented generation (RAG), text embeddings, and vector similarity search
Data normalization, schema mapping, designing canonical schemas, entity extraction, schema matching and validation, and transformation pipelines
API design and eventing (RESTful and/or GraphQL), API design best practices, versioning, pagination, error handling, idempotency, and spec-driven development
Open API, Async API, Idempotent operations, concurrency control, optimistic locking, and sequence guarantees where required
Event modeling, designing domain events, event contracts, event versioning, and event driven architectures with durable delivery (Event Bridge, SNS/SQS, Kafka)
Frontend and human-in-the-loop tools, Frontend frameworks (React preferred), TypeScript, component libraries, and state management for building validation UIs and dashboards
UX for human-in-the-loop designing review/approve/reject workflows, diffing extracted vs. original content, confidence indicators, and bulk validation tools
Micro Frontend
Implement micro frontends to compose UX components as independently deployable versioned modules
Analytics visualization (integrating charts, graphs for accuracy metrics, error rates, model drift indicators) leveraging tools like Power BI or Tableau
Observability, reliability, and security (Monitoring, tracing, CloudWatch metrics, logs, XRay, or Open Telemetry tracing, structured logging, and building dashboards, alerts for SLOs and error budgets)