Tecton is a company that solves complex data problems in production machine learning. They are seeking a Staff Software Engineer to develop and implement high-performance online infrastructure for AI inference, ensuring low latency and high availability across multiple cloud platforms.
Responsibilities:
- Develop and communicate a clear 18-month technical vision to align the team and guide our development efforts
- Architect and implement solutions to scale our serving platform to handle millions of requests per second with low latency and high availability
- Evolve Tecton’s query execution engine to support complex, multi-stage queries with user-defined Directed Acyclic Graphs (DAGs)
- Build an integrated observability solution that provides an exceptional operational experience with logs, metrics, and traces
- Launch our serving infrastructure across multiple cloud platforms, ensuring compliance with security protocols and data residency requirements
- Assess and prioritize tasks, demonstrating a keen awareness of performance-critical areas
Requirements:
- 7+ years of experience in programming, debugging, and performance tuning distributed and/or highly concurrent software systems
- Degree in Computer Science, Software Engineering, or a related field, or equivalent practical experience, with strong proficiency in building high throughput infrastructure
- Experience with Database Query Engines
- Experience with at least one of AWS, GCP
- Experience with low latency online storage like DynamoDB, Redis, and BigTable
- Experience with Data warehouses like Snowflake, BigQuery, Object Storage like S3