Design and Automate Infrastructure: Build and maintain cloud infrastructure on AWS, GCP, or Azure, utilizing tools like Terraform, Ansible, or CloudFormation to automate provisioning.
Optimize CI/CD Pipelines: Develop and manage continuous integration and deployment pipelines using Jenkins, GitLab, or ArgoCD to streamline software delivery.
Enhance Observability: Implement monitoring and logging systems (Prometheus, Grafana, Datadog) to define alerts, dashboards, and log-based metrics that improve application availability.
Lead Incident Management: Respond to real-time outages, perform root cause analysis, and participate in on-call rotations to ensure rapid service restoration.
Ensure Application Sustainment: Oversee the full lifecycle of critical applications, including upgrades, patching, and scaling services to meet global demand while maintaining strict SLOs.
Drive Reliability and Security: Implement self-healing systems, disaster recovery strategies, and security best practices, including regular vulnerability patching and audits.
Requirements
Must have UK passport, be UK based for more than five consecutive years and able to obtain SC security clearance
Proven experience as DevOps Engineer, with focus on SRE Engineer.
Strong technical expertise in managed Kubernetes services and cloud networking concepts (VPC, DNS, Load Balancers, and TCP/IP).
Proven ability in complex troubleshooting using debugging tools like tcpdump or strace and log aggregation tools like the ELK stack or Splunk.
Software Development skills in Python, Java, or .Net, along with experience developing scalable microservices and REST APIs.
A Bachelor's Degree in Computer Science, Engineering, or a related field.
Preferred Certifications: Scrum Master, PMP, or Agile SAFe certification.
Tech Stack
Ansible
AWS
Azure
Cloud
DNS
Google Cloud Platform
Grafana
Java
Jenkins
Kubernetes
Microservices
PMP
Prometheus
Python
Splunk
TCP/IP
Terraform
Benefits
Competitive Compensation: Salary and benefits aligned with your professional experience.
Work-Life Balance: Flexible work options and alternative arrangements to support your personal needs.
Health & Wellness: Comprehensive health, wellness, and retirement plans.
Growth Opportunities: Access to continuous learning and professional development to accelerate your career.