Optum is a global leader in health care innovation, developing cutting-edge solutions to improve health systems. The Senior DevOps Engineer will support and operate cloud-based AI and data platforms, focusing on reliability, scalability, and security, while collaborating with engineering teams to troubleshoot issues in production environments.
Responsibilities:
- Design, implement, and support cloud based infrastructure for AI, data, and application platforms
- Provide DevOps and AIOps support for production AI/ML and data applications
- Support MLOps and LLMOps practices, including deployment, monitoring, and operational troubleshooting
- Troubleshoot and resolve cloud application issues and vulnerabilities to ensure platform stability and security
- Collaborate with engineering teams to support application deployments and operational readiness
- Contribute hands on to cloud based development and automation efforts
- Leverage enterprise‑approved AI tools to streamline workflows, automate tasks, and drive continuous improvement
Requirements:
- 8+ years of hands on experience supporting cloud based infrastructure in production environments (AWS, Azure, or GCP)
- 5+ years of experience applying DevOps practices (CI/CD, infrastructure as code, deployment automation) in enterprise systems
- Proven solid understanding of cloud networking: VNETs, Subnets, Load Balancers, Security Groups
- Demonstrated experience supporting AI/ML workloads in production, including exposure to AIOps, MLOps, or LLMOps use cases
- Experience with Infrastructure as Code (IaC) development using Terraform
- Proven ability to troubleshoot and resolve production incidents, including root cause analysis for cloud applications and services
- Experience remediating cloud security issues or vulnerabilities in coordination with engineering or security teams
- Python development experience used for automation, tooling, or operational support (e.g., scripts, pipelines, monitoring utilities)
- Containerization: Experience with Docker and Kubernetes
- Experience supporting AI/ML or data driven platforms
- Experience addressing cloud security and vulnerability remediation
- Demonstrated familiarity with enterprise scale cloud environments
- Reside in Minnesota
- All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy