Own the day-to-day operation of our AWS and Kubernetes infrastructure across multiple business units
Lead a team that delivers reliably against a roadmap set in partnership with senior technical leadership
Partner closely with the VP of Technical Operations and Automation, who serves as the architecture lead for DevOps
Review a Terraform PR, debug a production issue, and coach your engineers through hard problems
Responsible for the platform meeting its specifications — uptime, security, throughput, access
Requirements
BS / MS in Computer Science or Engineering, or equivalent hands-on experience.
7+ years of infrastructure engineering experience overall, with 3+ years leading or managing a DevOps, SRE, or Cloud Platform team.
A track record of reliably delivering against a roadmap — you're excited by making the trains run on time and making your team more effective, and you're energized by executing well within a defined architectural direction rather than setting that direction yourself.
Experience operating a platform team — where your team provides well-specified infrastructure surfaces and holds the boundary between platform and application concerns.
Deep AWS expertise — VPC, Transit Gateway, EC2, RDS, S3, IAM, EKS, ECR, ELB/NLB, Route 53, Lambda, Transfer Family, CloudWatch, CloudTrail, and multi-account environments.
Strong Kubernetes background — EKS in production, Helm, ArgoCD or another GitOps tool, and the common supporting controllers.
Strong Terraform experience, including module maintenance, Terraform Cloud, and reviewing changes in production environments.
Solid CI/CD and Git experience (GitHub Actions or equivalent), and comfort with Docker and container-based workloads.
Cloud security fundamentals — IAM design, IRSA, secrets management, key and credential rotation, CVE triage, network segmentation, and audit readiness.
Practical FinOps experience — you've had to bring a cloud or observability bill back under control and can describe how.
Experience operating in a regulated environment (SOC 2, HIPAA, or HITRUST) is strongly preferred given our healthcare context.
Experience with secure file transfer at scale (SFTP, SFTPGo, AWS Transfer Family, PGP/GPG) is a plus.
Experience with Datadog (or a comparable observability platform) at serious scale.
Comfortable in Jira, Confluence, and GitHub, and familiar with Agile/Scrum delivery.
AWS Solutions Architect Associate or Professional certification is a plus, not a requirement.