Home
Jobs
Saved
Resumes
Data Engineer – GCP, Databricks at Grupo Med4U | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Data Engineer – GCP, Databricks
Grupo Med4U
Website
LinkedIn
Data Engineer – GCP, Databricks
Curitiba, Paraná, Brazil
Full Time
1 hour ago
No Sponsorship
Apply Now
Key skills
Apache
BigQuery
Cloud
Google Cloud Platform
Oracle
Postgres
Python
Spark
SQL
AI
Data Engineering
Data Lake
Analytics
Databricks
Apache Spark
GCP
Google Cloud
IAM
Cloud Storage
PostgreSQL
Performance Optimization
About this role
Role Overview
Design, implement and evolve data architectures using Google Cloud Platform (GCP) and Databricks;
Structure and maintain data ingestion, transformation and delivery pipelines with a focus on scalability, traceability and data quality;
Integrate data from electronic medical records, hospital systems, relational databases and other clinical and administrative sources;
Organize structured, semi-structured and unstructured data for use in advanced analytics and AI applications;
Define standards for modeling, documentation, governance, data quality and data lineage;
Build datasets, analytical tables and data products that support predictive models, AI solutions, dashboards and clinical studies;
Administer and evolve the Databricks environment, including permission management, catalogs, monitoring and performance optimization;
Implement security, privacy and data governance controls in compliance with the LGPD (Brazilian General Data Protection Law);
Work in partnership with Data Science, AI Engineering, Product, IT and clinical specialist teams;
Monitor data pipelines and infrastructure, identifying opportunities for continuous improvement;
Keep technical documentation up to date and contribute to innovation, research and development initiatives.
Requirements
Degree in Computer Science, Computer Engineering, Information Systems, Software Engineering, Data Engineering or related fields;
Experience building and supporting production data pipelines;
Proficiency in SQL and experience with Python for data engineering;
Experience with Apache Spark and distributed processing;
Hands-on experience with Databricks, including Delta Lake, notebooks and workflows;
Experience with Google Cloud Platform (GCP), especially BigQuery, Cloud Storage and IAM;
Knowledge of data modeling, Data Lake, Data Warehouse and/or Lakehouse architectures;
Experience with relational databases, preferably Oracle or PostgreSQL;
Knowledge of data governance, quality, security and observability;
Experience with pipeline orchestration tools.
Tech Stack
Apache
BigQuery
Cloud
Google Cloud Platform
Oracle
Postgres
Python
Spark
SQL
Apply Now
Home
Jobs
Saved
Resumes