Lead the development of a real-time AI application leveraging AWS serverless architecture, FastAPI, and LLM-based pipelines.
Requirements
- Proven experience in building production-grade GenAI features using LLM APIs and RAG pipelines
- Hands-on experience with serverless app development on AWS
- Solid understanding of FastAPI, microservices architecture, and REST/WebSocket API design
- Comfort with both structured data (PostgreSQL) and NoSQL (DynamoDB)
- Experience working in a CI/CD DevOps environment with infrastructure-as-code
- React
- TypeScript
Responsibilities
- Build and enhance React-based frontends hosted on CloudFront
- Design scalable, low-latency APIs using FastAPI (Python) integrated with API Gateway (REST + WebSocket)
- Develop AWS Lambda functions for backend services, data handling, and orchestration
- Hands-on experience with OpenSearch for implementing scalable search functionality
- Manage authentication using SSO, and enable secure access flows
- Integrate real-time WebSocket interfaces for LLM streaming and dashboarding
- Work closely with data science teams to connect LLM pipelines (LangChain + RAG) and vector search mechanisms
Other
- Hands-on
- Hybrid - 3 days onsite
- Immediate
- Excellent problem-solving skills, and ability to troubleshoot complex cloud-native systems