AzureBigQueryCloudGoogle Cloud PlatformGrafanaJavaKafkaPySparkPythonSplunkTerraformAIMLNLPLarge Language ModelsLangChainMLOpsLangGraphGCPGoogle CloudPub/SubVertex AIOpenTelemetryCI/CDLeadershipCommunicationCollaboration
About this role
Role Overview
Infrastructure Development: Build and maintain Enterprise Feature Stores and real-time data lakes (BigQuery, Pub/Sub) to streamline predictive modeling and NLP workflows.
Design and implement resilient, secure multi-cloud architectures (GCP and Azure) using Terraform (IaC) to support high-availability AI operations.
Operationalize ML models using Python, PySpark, and Kafka for both batch and real-time inference.
Leverage advanced orchestration frameworks such as LangChain, LangGraph, CrewAI, and Vertex AI Agent Builder to develop sophisticated agentic AI solutions.
Implement end-to-end CI/CD pipelines for automated model training, deployment, and seamless rollback capabilities.
Standardize LLMOps and MLOps observability practices using tools like LangSmith and Arize platforms.
Collaborate with data scientists to understand model objectives and translate them into technical specifications.
Deploy AI models into multiple environments using CI/CD pipelines, ensuring seamless integration with existing systems.
Establish strong observability and data governance using tools like Grafana, OpenTelemetry, Splunk or Arize to monitor pipeline health and model behavior.
Implement security measures to protect sensitive data and ensure compliance with industry regulations.
Requirements
Bachelor’s or master’s degree in computer science, Engineering, or a related field
Proven experience in cloud infrastructure management, experience in GCP, Vertex AI, MLOps and Terraform strongly preferred
Strong understanding of Large Language Models and experience in model development and deployment
Proficiency in programming languages such as Python, Java, or similar
Familiarity with CI/CD tools and practices
Excellent problem-solving skills and the ability to work in a fast-paced environment
Strong communication and collaboration skills
Familiarity in using AutoML platforms such as Vertex AI AutoML, DataRobot and Open-source platforms such as Snorkel and H2O.ai
Tech Stack
Azure
BigQuery
Cloud
Google Cloud Platform
Grafana
Java
Kafka
PySpark
Python
Splunk
Terraform
Benefits
Comprehensive benefits: Medical, dental, vision, 401(k) match, paid time off, PTO cash out
Support for you and your family: Family resources, EAP counseling sessions, access Headspace ® , backup child and elder care, maternity/paternity leave and more
Professional development programs: DaVita offers a variety of programs to help strong performers grow within their career and also offers on-demand virtual leadership and development courses through DaVita’s online training platform StarLearning.