Own uptime, availability, scalability, and performance of all production systems.
Define and manage SLOs, SLAs, error budgets, and incident response practices.
Lead post-incident reviews and drive systemic reliability improvements.
Implement observability standards (logging, metrics, tracing).
Own cloud infrastructure strategy (AWS, Azure, hybrid).
Lead infrastructure-as-code (Terraform, CloudFormation, ARM, etc.).
Ensure disaster recovery, backup, and business continuity plans are tested and compliant.
Monitor and optimize cloud spend through cost governance and FinOps practices.
Own CI/CD pipelines, deployment automation, and release strategies.
Enable safe, frequent releases (blue/green, canary, feature flags).
Standardize DevOps tooling and platform capabilities across teams.
Partner with Engineering to remove friction and increase delivery velocity.
Set plan and manage execution of dashboards, availability management and reporting.
Align with Product Engineering teams to define NFRs related to definition, instrumentation and logging.
Embed security into DevOps practices (DevSecOps).
Partner with Security, Legal, and Compliance on audits and certifications (SOC 2, HIPAA, HITRUST, PCI, etc.).
Ensure secrets management, access controls, and vulnerability remediation.
Build and lead DevOps, SRE, and Cloud Engineering teams.
Define the DevOps operating model (centralized, embedded, hybrid).

10+ years of hands-on experience in DevOps, SRE, cloud engineering, and infrastructure.
5+ years as a Director in leadership/people management role (leading managers and/or large teams)
Deep expertise in modern tools and practices: Cloud platforms (AWS, Azure).
CI/CD (GitHub Actions, GitLab CI, Jenkins, ArgoCD).
Containers & orchestration (Kubernetes, Docker, Helm).
Infrastructure as Code (Terraform, Pulumi, Crossplane).
Monitoring/Observability (DataDog, Sumo, Grafana, ELK, Datadog, New Relic).
Scripting/automation (Python, Go, Bash).
Strong understanding of Agile/Scrum/SAFe methodologies.
Proven track record of building high-performance teams and driving cultural change.
Excellent communication, strategic thinking, and cross-functional collaboration skills.
Experience with large-scale, high-availability environments.

Director – DevOps & Cloud Infrastructure

Key skills