Lead the development and integration of new features for Red Hat AI & Cloud, utilizing Nvidia’s cutting-edge technologies (GPUs, DPUs, and more).
Write and optimize code that integrates hardware accelerators into the Red Hat portfolio, ensuring high performance for AI/ML workloads.
Take ownership of project modules, ensuring they meet rigorous non-functional requirements such as security, resiliency, and maintainability.
Partner with UX, UI, and QE teams to build seamless user experiences for our partners and customers.
Maintain high code quality through rigorous peer reviews and automated testing (CI), and proactively address security vulnerabilities.
Coach and guide junior and mid-level engineers, sharing technical knowledge to foster a culture of continuous learning.
Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling.
Requirements
4+ years of relevant technical experience in software development
Advanced experience working in a Linux environment with at least one language like Golang, Python, Java, C, or C++
Advanced experience with a container orchestration ecosystem like Kubernetes, or Red Hat OpenShift
Strong experience with microservices architectures and concepts including APIs, versioning, monitoring, etc.
Virtual Networking / Storage / Compute experience
Ability to quickly learn and guide others on using new tools and technologies
Proven ability to innovate and a passion for staying at the forefront of technology
Excellent system understanding and troubleshooting capabilities
Autonomous work ethic, thriving in a dynamic, fast-paced environment
Technical leadership acumen in a global team environment
Proficient written and verbal communication skills in English
Experience with cloud development for public cloud services (AWS, GCE, Azure)
Hands-on experience with Nvidia CUDA/DOCA or performance profiling for GPUs
Background in DevOps or site reliability engineering (SRE)
Experience with hardware accelerators (e.g., GPUs) for AI workloads
Recent hands-on experience with distributed computation, either at the end-user or infrastructure provider level
Experience with performance analysis tools
Tech Stack
AWS
Azure
Cloud
Java
Kubernetes
Linux
Microservices
OpenShift
Python
Go
Benefits
Comprehensive medical, dental, and vision coverage
Flexible Spending Account
healthcare and dependent care
Health Savings Account
high deductible medical plan
Retirement 401(k) with employer match
Paid time off and holidays
Paid parental leave plans for all new parents
Leave benefits including disability, paid family medical leave, and paid military leave
Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!