PySparkPythonSQLAIData EngineeringAnalyticsBIPower BIDatabricksGitGitHubSource Control
About this role
Role Overview
Own the reliability and usability of Redgate’s data, designing and delivering data pipelines and products that support reporting, analytics, and AI across the business.
Build end-to-end data products on Databricks, using SQL, Python, and PySpark, from ingestion through to curated models used in Power BI and AI use cases.
Work across a complex, real-world data landscape, integrating multiple source systems with varying levels of quality and maturity.
Apply strong engineering judgement, balancing data modelling, performance, cost, and structure to build durable solutions that stand up to real usage.
Shape data engineering standards and practices, improving reliability, testing, observability, and cost efficiency, and helping enable future AI use cases.
Requirements
Proven experience as a Data Engineer
Experience working with Databricks
Strong SQL and data querying skills
Strong practical experience working with source control solutions such as GitHub, using methodologies such as Git Flow
Experience using Python and PySpark to build and maintain data pipelines