Implementing best practices for monitoring, alerting, and incident response using DataDog and other tools.
Designing, building, and maintaining cost-effective, reliable, and scalable AWS infrastructure.
Collaborating with cross-functional teams to identify and address performance bottlenecks and reliability issues.
Conducting post-incident reviews to analyse root causes and implement preventive measures.
Automating routine tasks and processes to improve efficiency and reduce manual intervention.
Participating in an on-call rotation to respond to system outages and emergencies.
Requirements
Monitoring and observability best practice including using tools like Datadog, Prometheus, Grafana
Expertise in setting up and managing alerts, dashboards, and logging
Understanding of networking concepts, security best practices, and performance optimization in AWS.
Proficiency in AWS services: EKS, EC2, ECS, S3, RDS, VPC, IAM, Route 53, etc.
Experience with containerization and orchestration tools like Docker and Kubernetes.
Strong knowledge of Infrastructure as Code (IaC) tools such as Terraform, CDK or CloudFormation
Knowledge of scripting and automation using languages like Python, Bash, or PowerShell.
Experience with CI/CD pipelines for deploying and testing applications in AWS.
Bonus points if you have any additional experience that includes working with AWS ECS, exposure to .NET CDK for infrastructure provisioning, or a good understanding of cloud cost optimisation and FinOps practices.
Tech Stack
AWS
Cloud
Docker
EC2
Grafana
Kubernetes
Prometheus
Python
Terraform
.NET
Benefits
25 days’ holiday + bank holidays
Option to buy or sell 5 extra annual leave days per year
Vitality Health Insurance, including private healthcare, virtual GP access, mental‑health support and wellbeing perks (50% off gym memberships -Virgin Active, Nuffield, PureGym)
Pension with 5% matched contribution
Regular team‑wide and company‑wide events
2 volunteering days per year to give back
Remote‑first working environment with offices in London and Nottingham