Home
Jobs
Saved
Resumes
Deep Learning Performance Architect at NVIDIA | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Deep Learning Performance Architect
NVIDIA
Website
LinkedIn
Deep Learning Performance Architect
Shanghai, Shanghai, China
Full Time
1 week ago
Visa Sponsorship
Apply Now
Key skills
Python
C++
C
AI
Deep Learning
Performance Optimization
Agile
About this role
Role Overview
Develop highly optimized deep learning kernels for inference
Do performance optimization, analysis, and tuning
Work with cross-collaborative teams across automotive, image understanding, and speech understanding to develop innovative solutions
Occasionally travel to conferences and customers for technical consultation and training
Requirements
Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)
SW Agile skills helpful
Excellent C/C++ programming and software design skills
Python experience a plus
Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU
GPU programming experience (CUDA or OpenCL) desired
5 years of relevant work experience
Tech Stack
Python
Apply Now
Home
Jobs
Saved
Resumes