Enhance NVIDIA's GPU Networking offerings for accelerating AI workloads
Design and prototype features and optimizations that accelerate data movement
Identify and evaluate new technologies, innovations and partner relationships
Develop and evaluate innovative features with respect to runtime systems, communication libraries
Requirements
Hold a M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a leading university (or equivalent experience)
5+ years of industry experience (or equivalent) in system architecture, AI systems architecture, scaling of AI, Parallelism of AI frameworks, or deep learning training workloads
Experienced in algorithm design, system programming, computer architecture and operating systems
Experienced in virtualization, networking and storage
Deep understanding of performance profiling and optimization techniques, together with defining and using hardware features
Strong programming background in C++, Python, and ideally CUDA or other GPU programming models
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment
Tech Stack
Python
Benefits
NVIDIA is committed to fostering a diverse work environment