Responsible for supporting the development and maintenance of SQL queries for data extraction, transformation, and manipulation.
Contribute to the preparation, cleaning, and organization of data for analytical use.
Support the construction and maintenance of data pipelines using PySpark.
Assist in organizing and structuring data in Data Warehouse environments.
Apply basic data quality validations.
Collaborate with the development team and the Product Owner (PO) to understand business rules.
Requirements
Desired knowledge: Python (pandas and data pipeline development)
Data warehouse concepts: dimensional modeling, data partitioning, relational model.
Medallion Architecture
Data quality: tools and/or validation techniques (desired)
HQL (desired)
Basic understanding of LGPD (Brazilian General Data Protection Law)
Data streaming, e.g., Kafka
Tech Stack
Kafka
Pandas
PySpark
Python
SQL
Benefits
Company-subsidized health insurance for the employee.
Option to include dependents in the health plan with payroll deduction.
Dental assistance (optional).
Option to include dependents in the dental plan with payroll deduction.
Meal allowance or food allowance.
Transportation voucher (optional).
Impact & Care
Personal guidance program offering confidential emotional support and counseling (psychological, legal, financial, social, and pet-related) at no cost for the employee and legal dependents.
Gympass
Wellhub (Access to over 700 gyms across Brazil with plans starting at R$ 29.90, deducted from payroll).
Option to include dependents in Gympass
Wellhub (up to 3 dependents
paid via credit card).
Access to Udemy through our intranet.
Partnerships with major consumer brands.
SESC benefits for the employee and dependents.
Discount agreements with educational institutions (undergraduate and postgraduate) and language schools/certification providers.