Own and evolve AWS infrastructure supporting data platform, ML training, and analytics workloads (Iceberg/Trino, ETL pipelines, Kubeflow/MLflow)
Design, deploy, and maintain EKS-based services and Kubernetes workloads
Build and manage Terraform infrastructure across environments (dev/staging/prod)
Design and maintain CI/CD pipelines for infrastructure and application deployment (GitLab/GitHub)
Operate and improve Kafka/Redpanda clusters
Improve reliability, observability, and performance of prediction services
Support Dagster for data workflow orchestration
Collaborate with Data Science, ML Engineering, and Data Engineering to productionize models and data pipelines
Strengthen AWS IAM, networking, and connectivity between cloud and on-prem systems
Support cyber hardening efforts with our RTX cyber team
Identify and incrementally improve existing infrastructure and deployment patterns
Requirements
Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and minimum 8 years prior relevant experience or an Advanced Degree in a related field and minimum 5 years of experience
Proficiency with Python.
Strong experience with AWS (EKS, IAM, VPCs, networking)
Hands-on Kubernetes experience operating production workloads
Experience managing Kafka (or Redpanda) in production environments
Proficiency with Terraform for infrastructure as code
Experience building and maintaining CI/CD pipelines (GitLab, GitHub Actions, or similar)
Solid understanding of distributed systems, reliability, and scaling
Experience supporting production data pipelines or ML systems
Experience with observability stacks (Prometheus, Grafana, Thanos, etc.)
Tech Stack
AWS
Cloud
Distributed Systems
ETL
Grafana
Kafka
Kubernetes
Prometheus
Python
Terraform
Benefits
Medical, dental, and vision insurance
Three weeks of vacation for newly hired employees
Generous 401(k) plan that includes employer matching funds and separate employer retirement contribution, including a Lifetime Income Strategy option
Tuition reimbursement program
Student Loan Repayment Program
Life insurance and disability coverage
Optional coverages you can buy: pet insurance, home and auto insurance, additional life and accident insurance, critical illness insurance, group legal, ID theft protection
Birth, adoption, parental leave benefits
Ovia Health, fertility, and family planning
Adoption Assistance
Autism Benefit
Employee Assistance Plan, including up to 10 free counseling sessions