NTT DATA is a leading business and technology services provider committed to responsible innovation and client success. They are seeking an AI Application Engineer to support the development of next-generation AI-powered applications, focusing on production-grade LLM application engineering and orchestration of AI pipelines.
Responsibilities:
- Design, develop, and optimize production-grade LLM-powered applications
- Own AI quality, RAG accuracy, prompt engineering, and AI safety across multiple applications
- Develop and maintain multi-step LLM orchestration pipelines using LangChain, LlamaIndex, or custom frameworks
- Implement and optimize RAG pipelines including chunking strategies, embedding selection, reranking, and hybrid search
- Design multi-turn conversational AI experiences with context management and session memory
- Integrate NVIDIA technologies including NIM, NeMo, NeMoGuardrails, and Riva into enterprise AI applications
- Build automated evaluation pipelines for model quality, hallucination detection, regression testing, and release gating
- Perform latency profiling and optimization across multi-step LLM call chains
- Implement AI safety guardrails including prompt injection prevention, jailbreak mitigation, and topical control
- Collaborate with globally distributed engineering and product teams to deliver scalable AI solutions
- Support deployment, monitoring, and continuous improvement of AI applications in production environments
Requirements:
- 4+ years of software engineering experience with at least 2 years focused on production LLM application development
- 4+ years of experience with Python for AI/ML application development and async programming
- 3+ years of experience with multi-step LLM orchestration frameworks such as LangChain or LlamaIndex
- 3+ Years of Experience designing and optimizing RAG pipelines and retrieval systems
- 3+ Years of Experience with vector databases, similarity search tuning, and reranking techniques
- Must reside in the US