Baseten powers mission-critical inference for leading AI companies, and they are seeking an Applied AI Inference Engineer to partner directly with customers in building and deploying high-scale production AI applications. This role involves hands-on coding, software development, and collaborating with customers to translate business goals into reliable services.
Responsibilities:
- Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects
- Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion
- Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
- Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs
- Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution
- Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity
- Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates
Requirements:
- Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field
- 1+ years of professional work experience in a fast-paced, high-growth environment
- Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python
- Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment
- Strong communication skills, particularly on complex technical topics
- Experience in building or optimizing AI/ML projects is highly valued