Archetype AI is developing an innovative AI platform aimed at transforming real-world data into valuable insights. They are seeking a backend engineer to build and maintain distributed systems that support high-throughput AI model inference and data services, working closely with researchers and product teams.
Responsibilities:
- Architect, implement, and maintain distributed systems that support high-throughput, low-latency AI model inference and data services
- Partner with ML researchers and product teams to turn experimental models into production-grade services
- Continuously optimize performance across GPU clusters, cloud infrastructure, and backend systems
- Build tooling and observability to monitor system health, identify bottlenecks, and proactively resolve instability
- Introduce new techniques, architectures, and best practices to push the limits of scalability, efficiency, and reliability
- Own problems end-to-end—from design to deployment—with a strong bias toward quality, automation, and continuous improvement
- Balance rapid iteration on early-stage systems with long-term maintainability and architectural soundness
- Contribute to a culture of engineering excellence, mentorship, and team-first collaboration