Archetype AI is developing an AI platform aimed at bringing AI into the real world. The role focuses on building performant, scalable, and resilient distributed systems in collaboration with researchers and product teams to enhance AI capabilities in production.
Responsibilities:
- Architect, implement, and maintain distributed systems that support high-throughput, low-latency AI model inference and data services
- Partner with ML researchers and product teams to turn experimental models into production-grade services
- Continuously optimize performance across GPU clusters, cloud infrastructure, and backend systems
- Build tooling and observability to monitor system health, identify bottlenecks, and proactively resolve instability
- Introduce new techniques, architectures, and best practices to push the limits of scalability, efficiency, and reliability
- Own problems end-to-end—from design to deployment—with a strong bias toward quality, automation, and continuous improvement
- Balance rapid iteration on early-stage systems with long-term maintainability and architectural soundness
- Contribute to a culture of engineering excellence, mentorship, and team-first collaboration