SambaNova is a leading company in the AI space, providing a full-stack generative AI platform optimized for enterprise and government organizations. The Runtime Engineer will work on the runtime stack to support high-performance machine learning inference and training applications, collaborating with teams to enhance system software and user-facing tools.
Responsibilities:
- Work on design and implementation of new and enhanced features of the runtime stack to support high performance and scalable ML inference and training applications
- System software (drivers and kernel) support for the next generation silicon
- Design user-space libraries for high performance and high utilization of HW resources
- User-facing tools (analysis, job and HW management, profiling, debugging, etc) for Datascale systems
- Collaborate with other teams including Hardware, ML Application, Compiler, DevOps
Requirements:
- Bachelor's in Computer Science, Computer Engineering, or equivalent and with 3-5 years of industry experience
- Proficiency in C/C++ and Python
- Experience with user space libraries, operating systems, and kernel drivers
- Experience working with highly concurrent and distributed systems, with a focus on performance and scalability
- Experience with different types of fabrics, such as PCIe, Infiniband, and RoCE
- Experience with fast networking stacks, such as RDMA
- Good communication skills and enthusiasm to help colleagues