Airbnb is a global company that connects hosts and guests for unique stays and experiences. They are seeking a Senior Software Engineer to lead the AI Compute team in overseeing the lifecycle of their Kubernetes-based GPU platform, enhancing the Machine Learning engineering experience and ensuring operational efficiency.
Responsibilities:
- Provide technical leadership on high-impact projects
- Influence and coach a distributed team of engineers
- Drive reliability, cost efficiency and capability enhancements for GPU fleet
- Facilitate cross-team alignment on goals, outcomes, and timelines
- Manage project priorities, deadlines, and deliverables
- Contribute to and execute the multi-year strategy for Airbnb’s AI Compute Platform
- Design, develop, test, deploy, maintain, and enhance the Airbnb AI Compute Platform
Requirements:
- BS, MS or Ph.D. in computer science or related field, or equivalent work experience
- 5+ years of relevant work experience in infrastructure
- 2+ years of expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g. EC2)
- Experience setting technical direction, planning, and successfully executing on large projects spanning multiple teams
- Kubernetes Experience is required
- Passionate about efficiency, availability, quality and developer experience
- ML Infrastructure (LLM fundamentals, tuning, optimization) Experience is preferred