NVIDIA is a leading technology company specializing in AI systems development. They are seeking a Senior AI Software Engineer to innovate and develop technologies in the inference systems software stack, focusing on building libraries and GPU kernel technologies for efficient AI inference.
Responsibilities:
- Innovating and developing new AI systems technologies for efficient inference
- Designing, implementing, and optimizing kernels for high impact AI workloads
- Designing and implementing extensible abstractions for LLM serving engines
- Building efficient just-in-time domain specific compilers and runtimes
- Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
- Contributing to open source communities like FlashInfer, vLLM, and SGLang