Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. The role focuses on designing and operating Kubernetes-based platforms on on-prem data centers, enabling engineers and customers to run workloads efficiently on Tenstorrent hardware.
Responsibilities:
- Design and build platform services for workload orchestration, ML services, and internal development workflows
- Develop APIs and systems that enable users and services to interact with infrastructure platforms
- Own Kubernetes-based platforms including cluster lifecycle, scaling, and operational maturity
- Integrate platform systems with CI/CD pipelines, GitOps workflows, and internal tooling
- Partner with SRE, infrastructure, and deployment teams to support large-scale internal and external environments
Requirements:
- Experienced backend or infrastructure engineer with a focus on platform development in large-scale environments
- Strong expertise in Kubernetes, including cluster provisioning, operators, and production debugging
- Proficient in Python or Go for building APIs and platform services
- Comfortable working with Linux systems, networking fundamentals, and distributed systems
- Collaborative and adaptable, able to work across engineering, infrastructure, and deployment teams