Optum is a global leader in health care innovation, developing cutting-edge solutions for healthier living. The Senior AI ML Engineer will support AI initiatives by building and productionizing AI systems while ensuring model quality and compliance.
Responsibilities:
- Responsible AI & Compliance
- Building & Productionizing AI Systems (agentic services, APIs/SDKs, RAG/CAG, E2E pipelines)
- Ensure Model Quality, Evaluation, Security & Safety
- Implement Observability, Operations excellence
- Drive Architecture, Governance & Engineering Standards
- Cross Functional Leadership & Collaboration
Requirements:
- 5+ years in Observability & Monitoring
- SLOs for latency/cost/error; drift/skew detection; tracing/telemetry (OpenTelemetry style); alerting (eg, Prometheus/Grafana equivalents)
- 3+ years in Programming & Packaging
- Python (typing, pytest, packaging), SQL; shell for automation
- 3+ years in ML / DL Frameworks
- scikit learn, XGBoost; PyTorch or TensorFlow/Keras for deep learning
- 3+ years in GenAI & Agents
- LLM evaluation methods; RAG pipelines; prompt/route registries; LangChain style orchestration; model hubs (eg, Model Garden / Hugging Face)
- 3+ years in Pipelines & Orchestration
- Kubeflow or Vertex AI Pipelines; Apache Airflow/Composer
- 3+ years in Model Registry, Experiment Tracking & A/B
- Model registry (eg, Vertex AI Model Registry/MLflow); experiment tracking; canary/A B rollout
- 3+ years in Serving & Inference
- Batch/online endpoints, autoscaling; versioning and rollback strategies
- 3+ years in MLOps / DevOps
- CI/CD (GitHub Actions or equivalent); IaC; secrets/certs; containerization (Docker) and orchestration (Kubernetes)
- 3+ years in Responsible AI & Compliance
- Explainability/fairness testing; PHI/PII handling; model cards; AIRB/RAI artifacts and audit ready evidence
- 3+ years in Security
- Access controls, key management, secrets hygiene; secure data paths end to end
- 1+ years in Cloud Platforms
- Public Cloud (Vertex AI, GPUs/TPUs), AWS/Azure AI services