Nomi Health is dedicated to transforming the healthcare system by eliminating inefficiencies and improving the patient experience. They are seeking a Senior Manager of Cloud and DevOps Engineering to oversee the AWS and Kubernetes infrastructure while leading a team to ensure operational excellence and reliability in service delivery.
Responsibilities:
- Lead by example through hands-on technical contributions (80%) while supporting team performance, mentorship, and delivery outcomes (20%)
- Run day-to-day operations of AWS across multiple accounts and environments — VPC, Transit Gateway, EC2, RDS, S3, IAM, EKS, ECR, ELB/NLB, Route 53, Transfer Family, and Lambda
- Operate our Kubernetes platform in production: EKS clusters, GitOps via ArgoCD, Helm, and supporting controllers (NGINX ingress, external-secrets, external-dns, Kyverno, Datadog Operator)
- Maintain and extend our infrastructure-as-code footprint — Terraform modules, Terraform Cloud, pipeline hygiene, and review practices that keep production safe from unintended changes
- Operate our secure file-transfer platform (SFTP / SFTPGo / AWS Transfer Family) to the specifications set by the business — uptime, access, encryption, and key management
- Own observability and FinOps execution — Datadog monitors, dashboards, log ingestion budgets and exclusion filters, Cloud Cost Management, and AWS Cost Anomaly Detection
- Drive release engineering and production deployment practices — go-live runbooks, release coordination, and post-mortem follow-through
- Partner with Security and Compliance to execute against SOC 2 and HITRUST audits, credential rotation, CVE monitoring and remediation, SIEM integration, pentest environment provisioning, and third-party access (VPN, Okta/Entra, Zscaler)
- Provide and operate the infrastructure underneath internal AI and automation tooling (n8n, kagent, agent-gateway, internal AI platform AWS account) so those teams can build on a stable surface
- Execute infrastructure-layer provisioning and teardown for client onboarding and termination — accounts, access, and credentials
- Manage, mentor, and grow a team of cloud and DevOps engineers; own sprint planning, on-call health, and delivery against the roadmap set with the VP of Technical Operations and Automation
Requirements:
- BS / MS in Computer Science or Engineering, or equivalent hands-on experience
- 7+ years of infrastructure engineering experience overall, with 3+ years leading or managing a DevOps, SRE, or Cloud Platform team
- A track record of reliably delivering against a roadmap — you're excited by making the trains run on time and making your team more effective, and you're energized by executing well within a defined architectural direction rather than setting that direction yourself
- Experience operating a platform team — where your team provides well-specified infrastructure surfaces and holds the boundary between platform and application concerns
- Deep AWS expertise — VPC, Transit Gateway, EC2, RDS, S3, IAM, EKS, ECR, ELB/NLB, Route 53, Lambda, Transfer Family, CloudWatch, CloudTrail, and multi-account environments
- Strong Kubernetes background — EKS in production, Helm, ArgoCD or another GitOps tool, and the common supporting controllers
- Strong Terraform experience, including module maintenance, Terraform Cloud, and reviewing changes in production environments
- Solid CI/CD and Git experience (GitHub Actions or equivalent), and comfort with Docker and container-based workloads
- Cloud security fundamentals — IAM design, IRSA, secrets management, key and credential rotation, CVE triage, network segmentation, and audit readiness
- Practical FinOps experience — you've had to bring a cloud or observability bill back under control and can describe how
- Experience operating in a regulated environment (SOC 2, HIPAA, or HITRUST) is strongly preferred given our healthcare context
- Experience with secure file transfer at scale (SFTP, SFTPGo, AWS Transfer Family, PGP/GPG) is a plus
- Experience with Datadog (or a comparable observability platform) at serious scale
- Comfortable in Jira, Confluence, and GitHub, and familiar with Agile/Scrum delivery
- AWS Solutions Architect Associate or Professional certification is a plus, not a requirement