Milliman is a global consultancy that provides a range of services including healthcare analytics through its subsidiary, MedInsight. They are seeking a skilled Data Engineer to design, build, and optimize scalable data solutions that support healthcare insights. The role involves building data pipelines, optimizing data workflows, and collaborating with various teams to enhance data-driven decision-making.
Responsibilities:
- Build & Enhance Pipelines: Design, develop, and maintain scalable data pipelines to ingest, transform, and enrich complex healthcare data using Databricks and Spark
- Optimize Data Workflows: Analyze and improve data intake processes and optimize SparkSQL/Python workloads for performance, scalability, reliability, and cost efficiency
- Design Data Models: Develop and maintain data marts, semantic models, and curated datasets that support analytics products, reporting, and business intelligence initiatives
- Ensure Data Quality & Reliability: Monitor pipeline health, troubleshoot production issues, implement data validation frameworks, and maintain high standards for data quality and governance
- Collaborate Across Teams: Partner with product, analytics, and engineering teams to understand business requirements and deliver scalable data solutions
- Drive Continuous Improvement: Contribute to architecture decisions, engineering standards, automation efforts, and best practices across the data platform
Requirements:
- Bachelor's degree in computer science, engineering, information systems, or related field; or equivalent practical experience
- 3+ years of experience in data engineering, software engineering, or a related technical field
- Strong proficiency in SQL and Python, with experience developing and maintaining production-grade data pipelines
- Experience working with large-scale data processing frameworks such as Apache Spark
- Strong understanding of relational databases, data modeling, ETL/ELT patterns, and data engineering best practices
- Experience troubleshooting and optimizing data workflows for performance and reliability
- Strong problem-solving skills, ownership mindset, and ability to collaborate effectively across technical teams
- Experience with Databricks in a production environment
- Experience building Power BI Semantic Models and supporting analytics use cases
- Experience working with Azure or AWS cloud platforms
- Familiarity with data orchestration and workflow management tools
- Familiarity with AI-assisted development tools, such as Claude Code
- Experience working with healthcare data domains such as claims, EMR/EHR, provider, or payer data
- Experience mentoring junior engineers or providing technical guidance on projects