NVIDIA is a leader in computer graphics, PC gaming, and accelerated computing, now focusing on AI to redefine computing. The role involves designing, developing, and optimizing software for next-generation system-on-chips (SoCs) to enhance performance and efficiency in datacenter products.
Responsibilities:
- Design, develop, test, and optimize software for our next-generation SoCs, in both pre-silicon and post-silicon phases of execution
- Analyze and identify performance bottlenecks in the SOC
- Using strong communication skills, analysis documents and communications to internal and/or external audiences about our technology
- Competitive analysis comparing uArchitecture & workload performance metrics on NVIDIA's ARM SoCs against emerging processors from other silicon vendors
- Develop tools and use benchmarks to analyze performance
Requirements:
- BS or MS degree in Computer Engineering, Computer Science, or related degree (or equivalent experience)
- 2+ years of relevant computer architecture or SW development experience
- Hands on technical experience and demonstrated excellence in an environment with complex software and hardware designs
- Strong understanding of multicore hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers and real-time programming
- Experience in performance analysis, data analysis and performance optimization
- Expertise in ARM architecture and SW ecosystem
- Proficient in analyzing, debugging and tuning performance of complex system software stacks
- Experience with CPU server system workloads and performance analysis
- Familiarity with CUDA programming and/or GPUs
- Experience with HPC or large-scale computing environments