Sellers Dorsey is a healthcare impact strategy firm focused on improving care access, quality, and outcomes for vulnerable populations. The Healthcare Data Engineer will build, optimize, and maintain scalable data pipelines to turn healthcare data into actionable intelligence, working closely with architects, analysts, and governance teams.
Responsibilities:
- Design, develop, and maintain robust data pipelines and architectures using modern ETL/ELT frameworks
- Integrate and standardize healthcare data from diverse sources including EHR, claims, lab systems, and patient portals
- Collaborate with Data Architects to implement scalable models that support BI, analytics, and data science initiatives
- Build automated ETL workflows that ensure high performance, reliability, and data integrity
- Monitor data jobs and troubleshoot issues across cloud and on-prem environments
- Document pipelines and technical processes with precision and clarity
- Ensure proper logging and transparency into pipeline performance and data load analytics
- Shape data from raw intake and inputs into usable and meaningful data stages for consumption by BI programs and analytics platforms
- Work with Backend Engineers to understand API data requirements and shape data accordingly
- Properly document transformation steps for ongoing support and maintenance of Data Governance
- Work in multiple existing database structures to understand table schema and build queries to extract necessary data
- Build and / or utilize queries and strategies for forensic discovery of data elements within an existing database architecture
- Contribute to the development and execution of enterprise data strategy, aligning technical work with business goals
- Translate analytics needs into data engineering deliverables and identify opportunities for innovation
- Support client relations with ad-hoc client data queries and record requests
- Ensure data quality, accuracy, and security across engineering processes
- Partner with governance teams to implement metadata management, data lineage, and stewardship practices
- Maintain compliance with HIPAA, HITECH, and internal policies
Requirements:
- Bachelor's degree in Computer Science, Information Systems, Health Informatics, or related field preferred
- 5+ years of experience in data engineering, preferably in a healthcare or regulated industry, required
- Experience with cloud platforms such as Azure, AWS, or GCP
- Knowledge of data governance frameworks and tools (e.g., Collibra, Informatica)
- Exposure to DevOps, CI/CD pipelines, and Agile development practices
- Familiarity with healthcare data formats (HL7, FHIR, CCD), structures, and compliance requirements
- Expertise in ETL development and tools (e.g., Python, Azure Data Factory, etc.)
- Experience in data transformation models (e.g., dbt, T-SQL, etc.)
- Proficiency in SQL, Python, YAML, JSON
- Comfortable with multiple varied data formats and the pros / cons of each (e.g., CSV, Feather, Parquet, etc.)
- Proficiency in multiple database structures (e.g., MS SQL, Postgres, Snowflake, etc.)