You’ll be the voice of our customers, using data to tell their stories and put them at the heart of all decision-making
You’ll drive customer value by understanding complex business problems and requirements to correctly apply the most appropriate and reusable tool to gather and build data solutions
Your responsibilities will also include: Building advanced automation of data engineering pipelines through removal of manual stages
Embedding new data techniques into our business through role modelling, training, and experiment design oversight
Delivering a clear understanding of data platform costs to meet your departments cost saving and income targets
Sourcing new data using the most appropriate tooling for the situation
Developing solutions for streaming data ingestion and transformations in line with our streaming strategy
Requirements
A strong background in building and optimizing pipelines for LLMs, traditional ML models, and data processing pipelines using OSS frameworks or within the AWS/GCP ecosystems
Strong experience in supervised or unsupervised learning, time-series forecasting, optimization, feature engineering, model evaluation, and retraining strategies along with RAG, embedding pipelines, LLM orchestration, prompt engineering, and API integration
Strong proficiency with Python
Hands on engineering excellence in defining and enforcing standards for ML Ops, CI/CD, experiment tracking, model deployment, observability, security, and performance tuning
The ability to evaluate and integrate capabilities across OpenAI/Anthropic models, fine-tuning, GPU compute, AutoML, and feature stores
A background of working with code repositories, bug tracking tools and wikis