AmeriHealth Caritas is a mission-driven organization with over 30 years of experience in healthcare solutions. The Senior Data Engineer will design, develop, and deliver data assets that enable enterprise reporting, analytics, and regulatory use cases, while ensuring data quality and governance across clinical systems.
Responsibilities:
- Develop and maintain internal querying tools, data utilities, and reusable components that improve clinical systems build efficiency, troubleshooting speed, and error prevention; partner with Clinical Informatics stakeholders to adopt and use these tools within operational workflows
- Develop and maintain data structures and governance practices that support clinical informatics, analytics, and population health reporting; ensure models are scalable, understandable, and aligned to governed definitions. Work across business domains to apply data standards consistently and partner with stakeholders to deliver solutions that reflect real workflows and user needs
- Analyze clinical system workflows and how users interact with data in day to day work; develop a deep understanding of user experience to ensure data solutions are practical, fit for purpose, and aligned with operational needs. Apply judgment to recommend the simplest effective solution, avoiding unnecessary complexity
- Create and maintain clear documentation that traces how clinical system fields, workflows, and configuration outputs correspond to underlying data structures and analytic outputs; contribute to data dictionaries and lineage artifacts to support accurate and reliable reporting
- Improve the speed and reliability of data processing and reporting by identifying performance issues and making targeted improvements to data pipelines and queries. Implement data quality practices, including automated validation and testing, to detect issues early and prevent downstream impact to operations and reporting
- Integrate data from external systems and interoperability feeds (including standards based interfaces such as FHIR as applicable); participate in data design and architecture reviews to ensure solutions are scalable, secure, and maintainable
- Coordinate work across Information Systems, the Enterprise Data Office, Enterprise Analytics, and Population Health Analytics to ensure alignment with shared priorities, dependencies, and data standards; operate effectively in a matrixed environment where decision-making is distributed and requires collaboration, influence, and alignment across teams
- Contribute to shared data engineering standards, documentation, and ways of working; provide guidance to team members to promote consistency, quality, and maintainable solutions
Requirements:
- Bachelor's Degree in Computer Science, Data Engineering, or Health Informatics
- Minimum of 5 years of experience in an automation of reporting environment
- Minimum of 5 years of experience with report automation engineering, batch and real time processing, and agile methods
- Current or prior experience in the healthcare industry
- Experience working in SQL and Python
- Experience working with source control and familiarity with Azure DevOps
- Ability to work with multiple database platforms, such as SQL Server and Oracle
- Understand data lake concepts and familiarity with Azure Databricks
- Familiarity with CI / CD concepts
- Advanced SQL/PySpark, Python (Pandas, PySpark), dbt, Airflow/workflow orchestration, Snowflake or Databricks
- Cloud data services (AWS Glue/S3/Redshift, Azure Data Factory/ADLS, GCP BigQuery)
- Data Modeling (dimensional, Data Vault), FHIR R4 APIs
- HealthCare data standards (NCPDP, X12, HL7)
- CI/CD for data pipelines
- Data Catalog Tools
- Master's Degree