Hubscale is a well-funded security platform undergoing a major technical evolution, focusing on rebuilding infrastructure for scale and integrating AI into their product. They are seeking a Staff Site Reliability Engineer to set technical direction for reliability and infrastructure, own a Kubernetes environment on GCP, and enhance CI/CD pipelines.
Responsibilities:
- Set technical direction for reliability and infrastructure
- Own and evolve a Kubernetes (GKE) environment on GCP
- Define SLIs/SLOs and improve observability across the platform
- Enhance CI/CD pipelines and streamline Bazel build systems
- Lead incident response and drive automation-first reliability
Requirements:
- Strong background in SRE / Platform / Infrastructure engineering
- Deep hands-on experience with Kubernetes and GCP
- Solid programming skills (Go preferred)
- Experience improving reliability at scale through automation & tooling
- Practical use of AI coding tools (Copilot, Claude, etc.)