Sight Machine is a company focused on transforming manufacturing through innovative data solutions. The Infrastructure Engineer role involves owning and evolving cloud infrastructure, implementing CI/CD pipelines, and driving automation to enhance operational efficiency.
Responsibilities:
- Owning and evolving our Kubernetes-based cloud infrastructure across Azure and other providers, including fleet management, networking, and cluster operations at scale
- Designing and implementing CI/CD pipelines that let the engineering team ship faster and with more confidence, including automated testing, progressive delivery, and rollback capability
- Building AI-assisted automation for operational tasks: runbook generation, anomaly triage, alerting logic, and anywhere else we can eliminate repetitive human intervention without sacrificing control
- Driving Infrastructure as Code discipline across the platform (Terraform, Helm, FluxCD) so that every environment is reproducible, auditable, and fast to recover
- Building and maintaining monitoring and observability infrastructure that gives the team real signal across our stack, from container health to database performance to customer-facing SLAs
- Participating in on-call rotation and using every incident as a forcing function to improve the system: better runbooks, better alerting, better automation
- Collaborating closely with Development Engineering to close the gap between what gets built and what gets operated well in production