Lead the operational management of the organisation’s cloud platform, ensuring high availability, performance, and reliability of platform services.
Manage platform incidents, problems, and service requests.
Establish and drive Site Reliability Engineering (SRE) practices.
Lead and mentor a squad of platform engineers responsible for operating and improving the cloud platform.
Collaborate with Platform Engineering and Platform Design teams to ensure new capabilities are designed with operability, scalability, and supportability in mind.
Requirements
10+ years of managing IT teams
5+ years managing cloud infrastructure, platform engineering, or DevOps operations
Strong hands-on experience with Infrastructure as Code (Terraform, Bicep, CloudFormation, Ansible, etc.)
Experience operating platforms in AWS (preferred) and/or Azure environments