Design, build and maintain CI/CD pipelines to support automated testing and deployment.
Develop and manage Infrastructure as Code (IaC) using tools such as Terraform, Bicep or CloudFormation.
Manage and optimise cloud-based environments delivering solutions on AWS and Azure to support secure, scalable, and resilient services.
Administer and support Red Hat / Linux platforms, including patching, configuration, and secure baselining.
Implement monitoring, logging and alerting using tooling such as Prometheus, Grafana, or similar to ensure platform reliability and performance, including defining and tracking SLIs/SLOs and supporting incident response and post-incident reviews.
Work with development teams to improve build, release and deployment processes.
Manage containerised workloads using Docker and Kubernetes, including deployment tooling, ingress and scaling.
Maintain secure infrastructure and ensure compliance with security and governance standards, including IAM/least privilege, secrets management, secure CI/CD practices and policy controls.
Identify opportunities to improve cost, performance and reliability (FinOps-aware), and help clients understand key cost drivers and trade-offs.
Troubleshoot issues across environments including development, staging and production.
Contribute to client-ready documentation including architecture decision records (ADRs), operational runbooks, as-built documentation and handover packs.
Promote DevOps culture including collaboration, automation and continuous improvement.
Requirements
Experience working in a client-facing delivery environment, able to communicate technical concepts clearly and manage stakeholders.
Experience working in a DevOps, Platform Engineering or Site Reliability role.
Hands-on experience delivering on AWS and Azure, including core services for networking, compute, storage, identity and logging.
Experience using Infrastructure as Code tools (e.g. Terraform, ARM, Bicep, CloudFormation).
Experience administering and supporting Red Hat Enterprise Linux or similar.
Knowledge of containerisation technologies such as Docker and Kubernetes.
Experience with scripting or programming (e.g. Python, Bash, PowerShell).
Familiarity with version control systems such as Git.
Experience supporting production systems, including incident response, problem management and reliability improvements using SLIs/SLOs.
Experience implementing observability solutions (logging, metrics and ideally tracing) and turning telemetry into actionable alerting and dashboards.
Tech Stack
AWS
Azure
Cloud
Docker
Grafana
Jenkins
Kubernetes
Linux
Prometheus
Python
Terraform
Benefits
Private healthcare/medical cover & Group life insurance
Annual bonus scheme (dependent on personal and company performance)
25 days holiday plus bank holidays (increasing by 1 day per each calendar year, after your 3rd anniversary with the company– rising to a maximum of 30 days + bank holidays).
Enhanced Reservist Leave – up to 10 days paid.
Annual leave purchase scheme (up to 5 days per year)
5% company pension contribution
£250.00 annual donation towards a charity or grassroots organisation of your choice
Personal wellness benefit of £120.00 per month, access to unlimited 1-1 counselling support and a wealth of wellbeing and support resources
Enhanced parental leave
Electric car leasing salary sacrifice scheme
Cycle to work scheme (save 25-39% on a bike and accessories)
Paid qualifications for employees at all levels
Internal Employee Networks, regular social events throughout the year and charity fundraising activities to get involved with if you wish.