Design, develop, fine-tune, and deploy generative AI models into scalable production environment
Build and maintain APIs and microservices using FastAPI to expose AI capabilities enterprise wide
Collaborate with the AI Infrastructure team to architect robust LLM pipelines, including training workflows and retrieval-augmented generation (RAG) systems
Integrate AI solutions into enterprise applications using secure, cloud-native architectures and best practices
Ensure AI models are explainable, reliable, and compliant with regulatory and internal governance standards
Continuously monitor and optimize model performance using evaluation frameworks, observability tools, and iterative fine-tuning
Requirements
Bachelor's degree in Computer Science, AI, or a related field (or equivalent professional experience)
8+ years of IT industry experience
Minimum 2+ years of hands-on AI development experience
3+ years of experience in Python programming
Proficient in Python with practical experience in LLMs, embeddings, vector databases and RAG architecture.
Demonstrated experience with generative AI models, including multimodal models
Hands-on experience with cloud-native AI infrastructure, including Azure AI Foundry or AWS Bedrock, Open AI Models and AI model governance frameworks
Tech Stack
AWS
Azure
Cloud
Microservices
Python
Benefits
100% paid medical, dental and vision premiums for you and your qualifying dependents
A 50% 401(k) match, up to the IRS maximum
20 days of PTO, plus 10 paid holidays
Family Support programs including 8 week Paid Primary Caregiver Leave, $10,000 fertility, family forming, and hormonal health assistance, and back-up child, adult, and elder care