Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. They are seeking a Backend Engineer to build a multi-cloud, multi-tenant platform for inference services, focusing on operational excellence and scalability.
Responsibilities:
- Build the multi-cloud, multi-tenant platform powering Modular’s inference services
- Build fault-tolerant, low toil services able to make use of resources in a variety of hardware platforms Clouds (Tier 1 Cloud Providers & neoclouds)
- Push the envelope for operational excellence with request-to-kernel observability, multi-cloud deployments, cold-start optimizations, and more
- Build helm charts, kubernetes operators, and more to make a create simple, effective, aintainable deployments
Requirements:
- 5+ years of experience working in backend engineering
- Experience with Cloud Providers (AWS, GCP, Azure, neoclouds)
- Experience with kubernetes and operating your own services
- A passion for building and operating high performance, low toil, observable systems
- Experience in machine learning technologies and use cases
- Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture
- Strongly identifies with our core company cultural values
- Practical experience implementing & maintaining security in multi-tenancy environments
- Experience working on high scale ML inference infrastructure (traditional AI or genAI)
- Familiarity with golang