Capital One is an industry leader in using machine learning to create real-time, personalized customer experiences. They are seeking a Senior Lead AI Engineer to partner with cross-functional teams to deliver AI-powered products and support the development of AI software components, contributing to the technical vision and roadmap of foundational AI systems.
Responsibilities:
- Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One
- Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc
- Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
- Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
- Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One
Requirements:
- Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies
- At least 6 years of experience programming with Python, Go, Scala, or Java
- 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
- Experience designing, developing, integrating, delivering, and supporting complex AI systems
- Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders
- Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
- Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
- Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production
- Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers