Leads, mentors, and develops a team of DevOps engineers, SREs, and infrastructure engineers supporting Azure and on-premises environments.
Establishes performance expectations, career development plans, and technical standards for the infrastructure organization.
Fosters a culture of collaboration, accountability, innovation, and continuous improvement.
Partners with engineering, security, operations, and IT teams to align priorities and drive successful execution.
Oversees the architecture, deployment, and operational management of Microsoft Azure cloud environments.
Ensures high availability, scalability, security, and cost optimization across cloud infrastructure resources.
Defines and enforces best practices for Infrastructure as Code (IaC), CI/CD pipelines, and automation strategies.
Manages cloud networking, identity management, access controls, and compliance frameworks.
Leads management of infrastructure supporting on-premises data center systems in partnership with corporate IT and shared services teams.
Oversees compute, storage, virtualization, networking, and backup/recovery solutions.
Ensures seamless integration between cloud and on-premises systems within hybrid architecture environments.
Drives infrastructure modernization initiatives, including migration strategies and cloud adoption efforts.
Defines and implements enterprise DevOps strategies, tools, and operational frameworks.
Drives adoption of CI/CD pipelines, containerization technologies such as Docker, and orchestration platforms such as Kubernetes.
Promotes automation across provisioning, configuration management, monitoring, and incident response processes.
Leads adoption of AI tools and automation technologies to improve operational efficiency.
Improves system reliability through Site Reliability Engineering (SRE) practices including observability, SLIs/SLOs, and incident management.
Ensures robust monitoring, alerting, change management, and incident response processes, including participation in 24/7 on-call support for mission-critical systems.
Leads root cause analysis (RCA) activities and continuous improvement initiatives.
Establishes SLAs, SLIs, and SLOs aligned with business objectives and operational goals.
Ensures infrastructure security, compliance, patch management, and disaster recovery readiness.
Collaborates closely with engineering, product, security, operations, and IT leadership teams to achieve business objectives.
Communicates technical strategies, risks, and project updates to executive stakeholders.
Manages vendor relationships and evaluates tools, technologies, and infrastructure solutions.
Requirements
Bachelor’s Degree in Computer Science, Engineering, or related field required, or equivalent experience.
Ten (10) plus years of experience in DevOps, Site Reliability Engineering (SRE), or infrastructure engineering.
Three (3) plus years of hands-on architecture, engineering, or operational experience within Microsoft Azure environments.
Five (5) plus years of leadership or management experience, including hiring, coaching, performance management, technical direction, and team development.
Proven experience managing on-premises data center infrastructure environments.
Expertise with Infrastructure as Code (IaC) tools such as Pulumi or Terraform.
Experience with CI/CD platforms such as Azure DevOps or Jenkins.
Strong understanding of networking, security, and system architecture across hybrid cloud environments.
Strong troubleshooting, analytical, problem-solving, and decision-making skills.
Experience utilizing AI tools such as Codex, Copilot, or similar technologies.
Experience with hybrid cloud architectures and cloud migration strategies preferred.