NVIDIA is looking for outstanding software engineers to help us expand our enterprise GPU management and monitoring tools. In this role, you will work closely with the broader NVIDIA team to design and build cloud-native management agents and Kubernetes integrations that enhance GPU system integration and management.
Responsibilities:
- Develop and maintain distributed, robust and scalable Go programs deployed to Kubernetes environments that manage large datacenters
- Develop and maintain user-space applications, containers, Go-bindings, and CLI tools
- Enable GPU management integration with the state-of-the-art open-source ecosystem, including Kubernetes and Docker
- Support internal and external users through bug fixes, documentation, and feature improvements
- Maintain high-quality products through robust test coverage