About this role

Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. They are seeking a Backend Engineer to build a multi-cloud, multi-tenant platform for inference services, focusing on operational excellence and scalability.

Responsibilities:

Build the multi-cloud, multi-tenant platform powering Modular’s inference services
Build fault-tolerant, low toil services able to make use of resources in a variety of hardware platforms Clouds (Tier 1 Cloud Providers & neoclouds)
Push the envelope for operational excellence with request-to-kernel observability, multi-cloud deployments, cold-start optimizations, and more
Build helm charts, kubernetes operators, and more to make a create simple, effective, aintainable deployments

Requirements:

5+ years of experience working in backend engineering
Experience with Cloud Providers (AWS, GCP, Azure, neoclouds)
Experience with kubernetes and operating your own services
A passion for building and operating high performance, low toil, observable systems
Experience in machine learning technologies and use cases
Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture
Strongly identifies with our core company cultural values
Practical experience implementing & maintaining security in multi-tenancy environments
Experience working on high scale ML inference infrastructure (traditional AI or genAI)
Familiarity with golang

Backend Engineer, Multi-cloud Inference Platform

Key skills

About this role

Responsibilities:

Requirements: