Milliman is a respected consultancy that develops data-driven SaaS products for the insurance and health IT sectors. The Data Engineer will design and implement robust Data Platform solutions, ensuring compliance with data privacy standards while collaborating with cross-functional teams to drive data architecture decisions.
Responsibilities:
- Creation of a Databricks Data Warehouse(s) and Lakehouse solutions for a healthcare data focused enterprise
- Configuring and maintaining unity catalog to enable enterprise data lineage and data quality
- Building out Data Security protocols and best practices including the management of identified and de-identified (PHI/PII) solutions
- Building data solutions for clients while upholding the best standards for reliability, quality, and performance
- Building solutions within Delta Live Tables and automation of transformations
- Building out performant enterprise-level medallion architecture(s)
- Building fit-for-purpose near real-time streaming and batch solutions
- Building out performant and efficient enterprise solutions for internal and external users for both structured and unstructured healthcare data
- Building out Infrastructure as Code using Terraform and Asset Bundles
- Working with the business to build cost effective and cost transparent Data solutions
- Help architect, build, and maintain robust and scalable data pipelines, monitoring, and optimizing performance
- Experience working with Migration tools i.e. Fivetran, AWS technologies and custom solutions
- Identify and implement improvements to enhance data processing efficiency
- Experience with building out effective pipeline monitoring solutions
- Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, Delta Live Tables, Python, Scala, and cloud-based ‘big data’ technologies
- Partner internally and externally with key stakeholders to ensure we are providing meaningful, functional, and valuable data
- Effectively work with Data, Development, Analysts, Data Science, and Business team members to gather requirements, propose, and build solutions
- Communicate complex technical concepts to non-technical stakeholders and provide guidance on best practices
- Ensure that technology execution aligns with business strategy and provides efficient, secure solutions and systems
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc
- Build analytics tools that utilize the data pipeline to provide actionable insights into operational efficiency and other key business performance metrics
- Create data tools for clinical, analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
Requirements:
- 3+ years of relevant experience in design, development, and testing of Data Platform solutions, such as Data Warehouses, Data Lakes, and Data Products
- Experience working in Databricks and AWS
- Experience working in both relational and non-relational databases such as SQL Server, PostgreSQL, and MongoDB
- Experience managing and standardizing clinical data from structured and unstructured sources
- Experience building and managing solutions on AWS
- Familiarity with designing and building APIs, ETL and data ingestion processes and utilization of tools to support enterprise solutions
- Experience in performance tuning, query optimization, security, monitoring, and release management
- Experience working with and managing large, disparate, identified and de-identified data sets from multiple data sources
- Familiarity with building and deploying IAC using terraform, asset bundles and github
- Bachelor's degree or master's degree in computer science, data engineering or related field
- Health and Life Insurance business experience
- Associate or Professional level solution architecture certification in Azure and/or AWS
- Experience in Snowflake