Design, build, and maintain robust ETL/ELT pipelines in particular with the centralized data lakehouse and also supporting the DWH, using DBT and Argo Workflow, ensuring data is reliable, well-tested, and documented
Evolve our AWS data lakehouse and DWH architecture and infrastructure (S3, Athena, Redshift), ensuring it meets modern and scalable standards
Develop and optimize analytical data models that serve as the foundation for dashboards and analytics use cases for the entire company
Ensure data quality, integrity, and performance across the data platforms
Act as a technical lead on data initiatives: reviewing code, setting standards, and guiding junior colleagues
Collaborate closely with data analysts and scientists to understand their needs and translate them into robust data infrastructure and models, supporting the analysis tasks. Critically assess incoming data requirements, challenge assumptions, identify the most meaningful metrics, and help stakeholders define what they actually need to measure
Requirements
5+ years of experience in a data engineering or similar data-focused development role
Strong, hands-on experience with DBT (models, tests, documentation, incremental strategies)
Strong experience with AWS S3 and the broader AWS data ecosystem (Athena, Redshift, Glue or similar)
Proficiency in Python for data processing and pipeline development
Solid command of SQL, including complex transformations and performance tuning
Excellent analytical thinking, questioning requirements, understanding the business context, and helping in translating business requirements
Strong data intuition: capable of independently assessing whether a data model or metric makes sense, and proactively flagging issues