The US Oncology Network is looking for a Remote Healthcare Senior Data Engineer to join their Precision Medicine Informatics Team. This role is pivotal in designing and maintaining data pipelines that manage large-scale clinical and molecular datasets, enabling data-driven insights for oncology care and research.
Responsibilities:
- Build and optimize scalable ETL/ELT pipelines for ingesting and transforming clinical, molecular, and lab data
- Improve data architectures and ensure efficient, accurate data integration
- Run advanced data queries to support analytics, reporting, and validation activities
- Document data assets, pipelines, and logic for transparency and compliance
- Support regulatory reporting and research workflows, including PHI protection and risk modeling
- Translate clinical and research needs into scalable technical solutions
- Integrate AI and LLM capabilities into data platforms and collaborate with AI teams on implementation
- Stay current with emerging AI technologies to enhance platform functionality
- Ensure data quality and integrity through monitoring, validation, and reconciliation
- Implement alerting systems and resolve issues proactively
- Perform regular cloud infrastructure maintenance and improvements
- Partner with cross-functional teams to deliver robust, end-to-end data solutions
- Manage multiple priorities in a fast-paced environment
- Maintain clear documentation of architectures, processes, and procedures
- Promote best practices in cloud engineering, ETL development, and AI integration
Requirements:
- Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or related field
- Minimum five (5) years of healthcare data engineering experience focused on ETL/ELT development
- Hands-on experience integrating AI/LLM capabilities into data platforms
- Expertise in scalable ETL/ELT pipeline design and maintenance
- Strong SQL and relational database skills (e.g., SQL Server)
- Proficiency in scripting/automation (Python, Perl, PHP, Bash)
- Experience with cloud data platforms (Azure, AWS, GCP) and big‑data tools (Databricks)
- Skilled in data modeling and schema design for clinical/molecular data
- Knowledge of AI technologies and large language models
- Understanding of precision oncology workflows and clinical data formats
- Familiarity with molecular/genomic data (NGS, variants, biomarkers)
- Experience integrating lab, pathology, and molecular testing data
- Knowledge of healthcare standards (HL7, FHIR, ICD‑10, LOINC, SNOMED)
- Experience with EHR systems (iKnowMed, Epic, Orchard Enterprise Labs)
- Strong communication skills with ability to work independently and collaboratively
- Excellent problem‑solving, prioritization, and stakeholder‑management skills
- Relevant certifications such as SQL Certified Associate or Azure Data Fundamentals
- Experience with code repositories (e.g., GitHub) and knowledge‑management tools (e.g., Confluence)
- Strong understanding of data‑warehousing concepts and technologies
- Ability to design, deploy, and manage cloud infrastructure across AWS, Azure, or GCP