NVIDIA is a leader in computer graphics and accelerated computing, now focusing on the potential of AI. The role involves building groundbreaking agentic AI systems for the CUDA ecosystem and defining technical direction across multiple teams to enhance engineering workflows and capabilities.

Responsibilities:

Set strategy and lead execution for agentic AI systems for the CUDA ecosystem, defining roadmaps and measurable success metrics (performance, quality, reliability, developer productivity)
Co-design agentic system solutions with software, hardware and algorithm teams; influence and adopt new capabilities as they become available
Develop reproducible, high-fidelity evaluation frameworks covering performance, quality and developer productivity
Collaborate across the AI stack and help drive architecture and key technical decisions —from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and inference/serving—and with model and research/engineering teams
Scale impact through leadership: mentor and grow senior technical talent

Requirements:

Bachelor's degree in Computer Science, Electrical Engineering, or related field (or equivalent experience)
17+ years industry and/or academia experience with AI systems development; strong exposure to building foundational models, agents or orchestration frameworks; hands-on experience with deep learning frameworks and modern inference stacks
Strong C/C++ and Python programming skills; solid software engineering fundamentals; ability to set engineering standards and review architecture at scale
Experience with GPU programming and performance optimization (CUDA or equivalent)
Proven track record leading large, cross-team efforts from concept through production, including navigating ambiguity, aligning stakeholders, and delivering measurable outcomes
MS or PhD preferred
Track record building/evaluating deep learning models, coding agents and developer tooling, and driving broad adoption across teams or customers
Demonstrated ability to optimize and deploy high-performance models, including on resource-constrained platforms. Deep expertise in GPU performance optimizations, evidenced by benchmark wins or published results
Publications or open-source leadership in deep learning, multi-agent systems, reinforcement learning, or AI systems; contributions to widely used repos or standards
Experience leading projects end-to-end, mentoring small teams; ability to drive concepts to production
Recognized technical leadership (e.g., setting platform direction, creating widely used architectures/APIs, or establishing evaluation/benchmarking standards)

Distinguished Engineer – High Performance AI

Key skills

About this role

Responsibilities:

Requirements: