Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. They are seeking an Infrastructure and Platform Engineer to design and operate Kubernetes-based platforms on on-prem data centers, enabling efficient workload management on Tenstorrent hardware.
Responsibilities:
- Design and build platform services for workload orchestration, ML services, and internal development workflows
- Develop APIs and systems that enable users and services to interact with infrastructure platforms
- Own Kubernetes-based platforms including cluster lifecycle, scaling, and operational maturity
- Integrate platform systems with CI/CD pipelines, GitOps workflows, and internal tooling
- Partner with SRE, infrastructure, and deployment teams to support large-scale internal and external environments
Requirements:
- Experienced backend or infrastructure engineer with a focus on platform development in large-scale environments
- Strong expertise in Kubernetes, including cluster provisioning, operators, and production debugging
- Proficient in Python or Go for building APIs and platform services
- Comfortable working with Linux systems, networking fundamentals, and distributed systems
- Collaborative and adaptable, able to work across engineering, infrastructure, and deployment teams