Atlan is a pioneering company focused on transforming data chaos into clarity through its active metadata platform. They are seeking a Staff Engineer to architect and scale foundational data systems that support AI applications, driving technical direction and multi-quarter initiatives.
Responsibilities:
- Design and build platform services—APIs, infrastructure components, runtime systems, and ingestion frameworks—at enterprise scale
- Architect the context store that transforms lakehouse infrastructure into AI-ready systems with multimodal capabilities (structured, unstructured, vector, graph)
- Solve complex multi-tenant isolation and scaling problems for enterprise SaaS
- Design data contracts governing ingestion, validation, processing, routing, storage, and serving across heterogeneous systems
- Own critical shared infrastructure including lakehouse (Iceberg/Polaris), vector stores, graph databases, and OLTP systems
- Drive technical standards through RFCs, architecture reviews, and documentation
- Mentor senior engineers and influence architecture decisions across teams
- Write production code using AI-assisted development tools (Claude Code, Cursor)
- Debug distributed systems issues across Kubernetes, workflow orchestration, and microservices
Requirements:
- 8+ years in platform engineering, infrastructure, or backend systems at a SaaS company
- Experience building enterprise-scale distributed systems at scale
- Deep expertise in multi-tenant architectures and tenant isolation strategies
- Strong Kubernetes, containerization, and cloud infrastructure skills (AWS/GCP/Azure)
- Hands-on experience with distributed systems patterns—service mesh, event-driven architecture, orchestration
- Track record of driving multi-quarter technical initiatives from concept through production at scale
- Design and build platform services—APIs, infrastructure components, runtime systems, and ingestion frameworks—at enterprise scale
- Architect the context store that transforms lakehouse infrastructure into AI-ready systems with multimodal capabilities (structured, unstructured, vector, graph)
- Solve complex multi-tenant isolation and scaling problems for enterprise SaaS
- Design data contracts governing ingestion, validation, processing, routing, storage, and serving across heterogeneous systems
- Own critical shared infrastructure including lakehouse (Iceberg/Polaris), vector stores, graph databases, and OLTP systems
- Drive technical standards through RFCs, architecture reviews, and documentation
- Mentor senior engineers and influence architecture decisions across teams
- Write production code using AI-assisted development tools (Claude Code, Cursor)
- Debug distributed systems issues across Kubernetes, workflow orchestration, and microservices
- You embrace AI-native development and want to pioneer new engineering workflows
- You have high agency and take ownership of ambiguous problems
- You're a strong async communicator who can influence without authority
- You're comfortable with fast-changing priorities in a scale-up environment
- You act as a force multiplier—elevating the technical bar for those around you
- Experience designing contract-driven or schema-first data platforms
- Familiarity with Temporal or similar workflow orchestration systems
- Data quality frameworks, observability systems, and cost attribution at scale
- Experience supporting enterprise workloads with strict compliance requirements
- CI/CD pipeline design and GitOps practices