NVIDIA is a leading technology company known for its innovations in GPU technology and AI computing. The role of Senior Systems Software Engineer involves developing and sustaining Linux-based OS images for NVIDIA platforms, collaborating with software and hardware teams to enhance system infrastructure.
Responsibilities:
- NVIDIA BaseOS is the Linux distribution of the NVIDIA GPU based platforms deployed in AI, Data-Analytics, Autonomous vehicle industries. Your role as a Base OS Engineer is to develop the Ubuntu and RHEL based OS image flavors across platforms
- Your role as a Base OS Engineer will also include sustaining the existing NVIDIA DGX and HGX systems
- Work closely with System/Software development teams to build the common infrastructure for automation and use the latest industry standard tools!
- Work closely with Hardware and Firmware development teams to bring up new NVIDIA platforms, from power on to OS boot
Requirements:
- BS or Masters in computer science, or related field or equivalent experience
- 5+ years of experience
- Excellent C and Python programming skills. Your expertise in coding is essential to your success in this role
- Familiarity with at least one major Linux distro (Ubuntu, RedHat, SLES) is required
- Knowledge of OS concepts such as filesystems, job scheduling, device drivers, and storage, is required
- Experience with complex system-level debugging is invaluable, from X/Wayland to systemd to analyzing crash dumps to install issues
- Excellent communication and planning skills
- Experience creating your own Debian or RPM package and repository is highly desirable
- Knowledge of virtualization concepts such as device passthrough and SR-IOV
- Background in PCIe, USB, and similar low level protocols is recommended. Experience with UEFI and BMC firmware is strongly favored
- Basic knowledge of tools deploying Containers or Kubernetes technologies is preferred
- Having written software in GitHub with exposure to opensource development life cycle is desired