Design, build, and operate core systems that enable autonomous agents to function reliably in production
Build production-grade agentic workflows, retrieval and memory systems, multi-model execution, and tool-calling integrations that interact safely with enterprise systems
Explore new approaches, prototype quickly, and turn what works into durable production systems
Ensure systems must be reliable, secure, observable, debuggable, and maintainable under real-world conditions
Requirements
Have built and operated complex backend or distributed systems in production
Have built LLM-powered or AI-native systems beyond demos, with real users and real constraints
Have strong judgment around reliability, security, observability, and failure modes
Are comfortable operating in ambiguous frontier areas and validating ideas through rapid iteration
Use AI as a core part of your development workflow, not as an occasional convenience
Operate with high ownership and autonomy and take systems end-to-end
TypeScript required, Python strongly preferred
Strong SQL proficiency
Experience with production infrastructure; Docker/Kubernetes experience is a plus
Familiarity with enterprise security patterns is a plus
Domain familiarity with DevOps, SecOps, or infrastructure automation is a plus