Designing and scaling a robust data architecture that handles massive event-driven communication flows across nearly every GP practice in the UK.
Collaborating deeply with product and data teams to transform raw healthcare communication data into high-performance models for analytics and Machine Learning.
Implementing sophisticated ELT practices in a cloud environment where data integrity and security are paramount for patient safety.
Requirements
Proven experience designing and implementing ingestion and ETL/ELT pipelines from scratch using Python and SQL in production environments.
Deep understanding of modern cloud data platforms (e.g. Snowflake, BigQuery, Redshift) and data transformation tools like DBT, with a focus on creating clean, reliable, and reusable datasets.
Experience (or strong interest) in building and operationalising machine learning models, with an eye toward how strong data foundations enable better AI outcomes.
Comfortable making and owning decisions around data architecture to support analytics, experimentation, and ML at scale.
You think beyond pipelines — you care about how data drives real-world outcomes, especially in a healthcare context.
Able to work cross-functionally with product, engineering, and data teams to shape problems and deliver solutions.
Excited by greenfield challenges, undefined problems, and the opportunity to build something meaningful from the ground up.
Tech Stack
Amazon Redshift
BigQuery
Cloud
ETL
Python
SQL
Benefits
Access to Happl
a flexible benefits provider with a given budget to spend on perks of your choice. Options include private health insurance, wellness providers and more
Flexible Working: We are an office first culture and ask you are in at least 3 days a week.
Enhanced parental leave policy
Free healthy breakfasts, snacks and lunches will be provided, with the occasional sweet treat!