Clover Health is transforming healthcare with its innovative primary care tool, Counterpart Assistant. They are seeking a Senior Manager of Site Reliability Engineering to lead a team of SREs, focusing on making the infrastructure reliable, scalable, and cost-efficient while fostering proactive collaboration with product engineering teams.
Responsibilities:
- Lead and grow our SRE team of ~10 engineers, including hiring, retention, career development, and performance management across multiple time zones (US, HK, NZ)
- Build strategic partnerships with product engineering pillars — shifting SRE from reactive, ticket-based support to proactive co-ownership of reliability outcomes
- Scale our multi-tenant infrastructure to support new customer onboarding and growing patient populations
- Own cloud cost management and FinOps practices, building frameworks that balance cost control with reliability and performance
- Champion developer self-service and platform engineering. Build self-service capabilities so product teams can manage routine operations without filing SRE tickets. Establish SLOs/SLIs for critical services and improve alert quality so every page is meaningful
- Ensure the SRE team is fully leveraging AI tooling in their workflows — using tools like Claude Code for IaC generation, log analysis, root cause investigation, and automating repetitive work — at the same level as the rest of engineering
Requirements:
- 6+ years managing an SRE team
- 10+ years of hands-on SRE or infrastructure engineering experience
- Deeply comfortable with core stack: Kubernetes, GCP (GKE, Cloud SQL, Pub/Sub, GCS), Terraform, Helm, ArgoCD, PostgreSQL, and Prometheus/Grafana
- Strong programming skills in Python and/or Go
- Comfortable writing and reviewing infrastructure tooling code, including using AI coding tools
- Experience with CI/CD pipelines (GitHub Actions)
- Track record of building or improving developer tooling and automation
- Sound build vs. buy judgment
- Experience leading teams across multiple time zones
- Track record of developing engineers into strong technical contributors