Build and deploy scalable LLM, RAG, and agent-based systems
Architect LLM inference and deployment pipelines
Optimize models for efficient and cost-effective production
Collaborate with data science, research, and product teams
Ensure clean code, testing, reproducibility, and CI/CD
Mentor junior engineers and drive engineering best practices
Ensure ethical, secure, and responsible AI development
Requirements
Requires a BA/BS degree in Information Technology, Computer Science or related field of study.
Strong problem-solving and analytical skills, with the ability to translate complex AI concepts into scalable engineering solutions.
Ability to rapidly prototype and iterate on GenAI and LLM-based applications, balancing innovation with performance and reliability.
Effective collaboration across cross-functional teams, including data scientists, researchers, and product stakeholders, to deliver impactful AI solutions.
Clear and concise communication skills, capable of presenting technical ideas to both technical and non-technical audiences.
Commitment to engineering excellence, including writing clean, maintainable code, conducting thorough code reviews, and following best practices in software development.
Proactive learning mindset, staying current with emerging trends in AI, GenAI, and agentic systems, and applying them to real-world problems.
Experience in mentoring and knowledge sharing, supporting junior engineers and contributing to team growth and capability building.
Ownership and accountability in delivering high-quality solutions under tight deadlines and evolving requirements.
Focus on reproducibility and reliability, using tools like Git, MLflow, and CI/CD pipelines to ensure consistent experimentation and deployment.
Ethical and responsible AI development, with awareness of safety, fairness, and privacy considerations in model design and deployment.