Baseten powers mission-critical inference for the world's most dynamic AI companies, and they are seeking an Infrastructure Software Engineer to build and maintain components of their ML inference platform. The role involves developing infrastructure components, implementing Kubernetes deployments, and enhancing monitoring systems for model performance metrics.
Responsibilities:
- Develop infrastructure components for our ML inference platform using Python and Go
- Implement and maintain Kubernetes deployments for model serving
- Contribute to our inference orchestration layer for model deployments
- Build and enhance monitoring systems for model performance metrics
- Implement efficient resource management solutions for ML workloads
- Support infrastructure automation to improve ML deployment workflows
- Work closely with team members to implement technical solutions
- Help balance performance optimization with system reliability
- Participate in technical discussions around infrastructure improvements
- Learn and apply infrastructure best practices
Requirements:
- Bachelor's degree or higher in Computer Science or related field
- Proficient coding abilities in one or more popular programming or scripting languages; Go proficiency is a plus
- Working knowledge of Kubernetes and containerization
- Basic understanding of machine learning concepts and model serving
- Familiarity with distributed systems concepts
- Experience with basic monitoring and logging tools
- Interest in ML/AI infrastructure and willingness to learn
- Strong collaboration and communication skills