Develop the DWH core model that transforms raw Kafka topics into documented, easy-to-use data products that bring value to data consumers
Collaborate closely with data engineers to develop our streaming ETL framework and improve its reliability and user experience
Help the team deliver new features and maintain the existing ones in the OpenMetadata data catalog
Actively participate in data quality and data governance tools and processes implementations
Partner with data consumers across the business, supporting their daily data needs and turning requirements into reliable, well-documented data products
Tackle unique challenges that come with building a Data Platform from scratch
Requirements
You've worked as an analyst or analytics engineer on technical projects involving data platform implementation
You write excellent SQL for analytical tasks
You have experience with Python and PySpark
You're proficient with Git, ETL tools, or dbt
You've built data products or DWH models on a modern data platform (Databricks, Spark, or dbt)
You understand how streaming pipelines work
You're confident gathering requirements from business stakeholders and turning them into technical solutions
You have a high level of autonomy and self-direction