Own infrastructure reliability and uptime across all environments — development, staging, and production
Design and implement scalable, resilient systems on AWS that support continued customer and device growth
Manage multi-regional or highly available infrastructure with clear SLAs, failover patterns, and capacity planning
Identify and remediate security risks in infrastructure and deployment pipelines in partnership with engineering and security teams
Build and maintain CI/CD systems and GitOps workflows that enable product teams to deploy with speed and confidence
Improve developer tooling, deployment pipelines, and environment management to reduce operational burden on product engineering teams
Manage Kubernetes and containerized workloads; drive infrastructure-as-code practices using Terraform, Helm, and related tooling
Build and enhance observability practices — monitoring, logging, tracing, and alerting — that provide clear visibility into system health, performance, and risk
Establish and lead incident response practices including on-call structures, root cause analysis, and post-incident improvement processes
Define and execute the infrastructure and platform roadmap aligned with company growth and product needs
Establish operational processes, runbooks, and documentation that increase reliability, consistency, and knowledge transfer across the team
Drive infrastructure cost management — build visibility into cloud spend, identify waste, and align resource usage with business goals
Support new product initiatives with scalable platform solutions, ensuring DevOps considerations are embedded early in architecture decisions
Lead, mentor, and grow a high-performing DevOps team — foster a culture of ownership, collaboration, and operational excellence
Requirements
7+ years of experience in DevOps, SRE, or infrastructure engineering, with at least 3 years leading teams
Demonstrated track record of building or significantly maturing a DevOps function — not just operating within an established one
Experience operating and scaling systems to millions of users or connected devices
Strong experience with AWS and cloud-based infrastructure at production scale
Experience with Kubernetes and containerized workloads in production environments
Expertise in infrastructure as code (Terraform, Helm) and CI/CD pipeline design (GitOps workflows preferred)
Strong command of observability practices: metrics, logging, tracing, and alerting systems
Experience managing multi-regional or highly available production systems with defined SLAs
Experience with infrastructure cost optimization — not just monitoring spend, but actively managing it
Tech Stack
AWS
Cloud
Kubernetes
Terraform
Benefits
Mission-driven company protecting kids and making a real difference for families
Competitive compensation commensurate with experience
Generous insurance coverage (up to 100% of premiums based on tenure)
401(k) with employer match
Unlimited PTO and flexible scheduling where the role allows
Stock options
Pet insurance
An energetic, collaborative culture — and the best coworkers around