BURGEON IT SERVICES is seeking a senior-level GCP Data Engineer / Data Architect with strong healthcare experience to design and build an enterprise-grade BigQuery-based Data Lake & Data Warehouse platform on Google Cloud. The role requires deep hands-on expertise in data pipelines, healthcare data standards, governance, and cloud architecture best practices.
Responsibilities:
- Architect and build a GCP-based data lakehouse using BigQuery, GCS, Dataflow, Dataproc, Pub/Sub, and Cloud Composer
- Design batch, near real-time, and streaming ingestion pipelines
- Develop transformation frameworks using BigQuery SQL, Dataflow, Dataproc, or dbt
- Build bronze/silver/gold data layers and consumption-ready data marts
- Ingest clinical data (EHR/EMR, LIS, RIS/PACS) and non-clinical systems (ERP, HR, finance)
- Implement healthcare data standards such as HL7, FHIR, CCD/C-CDA, and DICOM
- Ensure HIPAA compliance, PHI protection, IAM/RBAC, encryption, and data governance
- Support hybrid on-prem to GCP data migrations
- Provide architectural guidance and mentor data engineers
Requirements:
- Healthcare domain experience is mandatory
- 10+ years in Data Engineering / Data Architecture
- Strong enterprise-level GCP data platform experience
- Proven healthcare domain project experience
- Architect and build a GCP-based data lakehouse using BigQuery, GCS, Dataflow, Dataproc, Pub/Sub, and Cloud Composer
- Design batch, near real-time, and streaming ingestion pipelines
- Develop transformation frameworks using BigQuery SQL, Dataflow, Dataproc, or dbt
- Build bronze/silver/gold data layers and consumption-ready data marts
- Ingest clinical data (EHR/EMR, LIS, RIS/PACS) and non-clinical systems (ERP, HR, finance)
- Implement healthcare data standards such as HL7, FHIR, CCD/C-CDA, and DICOM
- Ensure HIPAA compliance, PHI protection, IAM/RBAC, encryption, and data governance
- Support hybrid on-prem to GCP data migrations
- Provide architectural guidance and mentor data engineers
- GCP Data Stack (Strong Hands-On Required)
- BigQuery
- Dataflow
- Pub/Sub
- Dataproc
- Cloud Composer
- Google Cloud Storage (GCS)
- HL7 v2.x
- FHIR
- CCD / C-CDA
- DICOM
- LIS / RIS / PACS systems
- EHR / EMR data integration
- Strong SQL
- Data lakehouse architecture
- Star/snowflake schema design
- Canonical data modeling
- Bronze / Silver / Gold data layering
- Metadata management and data lineage
- HIPAA compliance
- PHI/PII handling
- IAM, RBAC
- VPC security
- DLP and encryption
- VPC networking
- Hybrid connectivity (VPN/Interconnect)
- On-prem to cloud migration strategies
- Dataplex
- Medical device / IoT data ingestion