AWSAzureCloudKubernetesTerraformAIArtificial IntelligenceAmazon Web ServicesGitOpsAgileLeadership
About this role
Role Overview
Lead three teams (~15 people) across internal system teams, shared services, and release functions
Ensure operational readiness across environments, including monitoring, support, and reliability
Establish and evolve operating models, runbooks, and support frameworks leveraging AI tools
Drive clear ownership and accountability across operational domains with key leaders
Own release management and execution across platforms and supported products
Ensure predictable, high-quality releases through defined processes and governance
Coordinate release planning, scheduling, and cross-team dependencies
Drive automation of release processes to improve speed, reliability, and repeatability
Ensure effective release support, validation, and rollback mechanisms
Own operational delivery outcomes across internal and external teams
Support Program Increment (PI) planning, prioritisation, and execution
Define and track operational metrics (availability, Mean Time to Recovery (MTTR), deployment success, cost)
Ensure effective environment provisioning, release support, and platform availability
Drive resolution of operational issues, risks, and bottlenecks
Define and execute cost-reduction objectives and targets across infrastructure and operations
Establish cost baselines and track ongoing cost performance against targets
Drive efficiency through automation, standardisation, and reuse of platform capabilities
Reduce reliance on manual processes and bespoke solutions
Promote repeatable platform patterns and reference architectures
Drive adoption of automation-first operational practices
Embed Infrastructure as Code (IaC) and GitOps-based delivery models
Leverage Artificial Intelligence (AI) tools and workflows to improve operational productivity
Continuously improve platform services using data, insights, and emerging technologies
Ensure alignment to platform architecture and engineering standards
Partner with Architecture to deliver scalable, supportable, and consistent solutions
Drive operational quality, reliability, and discipline across teams
Establish guardrails for cloud and Kubernetes platform usage
Manage ~15 engineers and platform engineers across three teams
Own hiring, performance management, and capability development
Build a high-performance, accountable team culture
Develop leadership and technical capability within operations teams
Requirements
Proven experience leading operations, platform, or engineering services teams
Experience managing multiple teams (10–20 people) in a complex environment
Strong track record of owning or managing release processes and delivery operations
Strong track record of defining and delivering cost-reduction initiatives
Experience managing development and release infrastructure (on-premise and cloud)
Experience with cloud platforms (Amazon Web Services (AWS), Microsoft Azure) and Kubernetes
Hands-on experience with Infrastructure as Code (IaC) (e.g. Terraform)
Experience implementing automation and DevOps / GitOps practices
Exposure to Artificial Intelligence (AI) tools and workflows in engineering environments
Experience working in Agile / Scaled Agile Framework (SAFe) environments
Tech Stack
AWS
Azure
Cloud
Kubernetes
Terraform
Benefits
If you would like to be considered for employment opportunities with CSG and need special assistance due to a disability or accommodation for a disability throughout any aspect of the application process, please call us at +1 (402) 431-7440 or email us at accommodations@csgi.com. CSG provides accommodations for persons with disabilities in employment, including during the hiring process and any interview and/or testing processes.