Baseten powers mission-critical inference for the world's most dynamic AI companies, and they are seeking a customer-obsessed software engineer to help ship their training products. The role involves owning features from conception to MVP, working closely with research engineers to enhance model development and improve user workflows.
Responsibilities:
- Iterate like crazy
- Design ergonomic APIs and abstractions to model complex resources and lifecycles
- Work throughout the stack (API layer, backend and database implementation, infra layer; frontend is a plus) to implement features
- Fine-tune and deploy models to develop intuition around training workflows
- Partner closely with model developers and world-class research engineers to understand the requirements and pain points of post-training workflows
- Drive long-term improvements to improve reliability of systems and velocity of development
- Fix bugs & resolve customer issues with urgency
Requirements:
- 5+ years experience building software applications
- Deep knowledge of the web stack, databases, and distributed systems
- Experience developing developer tooling or infrastructure products for external or internal users
- Good taste in product, particularly developer-oriented tools
- Interest in ML/AI infrastructure and willingness to learn
- Driven by high agency and ownership
- Strong communication skills with the ability to bridge technical depth and business needs
- Experience launching features and products through different release cycles (MVP, Beta, GA, etc.)
- Experience with model development methods and paradigms, like Supervised Fine-Tuning, Reinforcement Learning, Synthetic Data Generation, LoRA, Full Finetunes, etc
- Familiarity or experience with the open source training stack and frameworks (NCCL, PyTorch, Megatron, NemoRL, VeRL, Axolotl, HF Trainer) and distributed training techniques (FSDP, DeepSpeed)
- Experience developing AI products, tooling, or agents
- Frontend fluency