Coltech is a nationally recognised consumer retail organisation undertaking a large-scale enterprise modernisation programme. They are seeking an experienced Data Engineer to build a next-generation data platform and contribute to the design and implementation of a modern lakehouse architecture.
Responsibilities:
- Contribute to the design and implementation of a modern lakehouse architecture
- Establish core data infrastructure to support enterprise-wide analytics
- Work within a transformation programme focused on long-term digital capability
- Build and optimise scalable batch and streaming data pipelines
- Design ingestion frameworks for structured and semi-structured data
- Implement transformation logic and ensure seamless system integrations
- Improve storage architecture for performance and cost efficiency
- Maintain high standards for data quality, lineage, and observability
- Monitor system performance and proactively address bottlenecks
- Support governance frameworks to ensure consistency and trust in data
- Deliver infrastructure that supports BI, reporting, and self-service analytics
- Partner with analytics stakeholders to operationalise data assets
- Lay groundwork for future machine learning and predictive initiatives
- Identify optimisation opportunities across architecture and workflows
- Contribute to evolving best practices for data engineering standards
- Operate effectively in an environment where systems are being built from scratch
Requirements:
- Degree in Computer Science, Engineering, or related technical discipline (or equivalent practical experience)
- Proven track record delivering data pipelines and data platform solutions
- Strong coding capability (Python, Java, or Scala preferred)
- Advanced SQL knowledge with experience across relational and NoSQL systems
- Hands-on exposure to at least one major cloud platform (AWS, Azure, or GCP) and associated data services
- Experience using orchestration or workflow tooling (Airflow, Dagster, Prefect, Fivetran, or similar)
- Understanding of data modelling principles and scalable architecture design
- Strong problem-solving ability and comfort operating autonomously
- Exposure to analytics, machine learning, or statistical workflows is advantageous