Direct the design, implementation, and maintenance of CI/CD pipelines, automated provisioning, and monitoring processes across cloud and hybrid environments.
Lead efforts to standardize and optimize platform engineering practices, adopting Infrastructure-as-Code (IaC) and microservices deployment models.
Develop and enforce SRE principles, including release management, system reliability, observability, incident management, SLAs/SLOs, and fault tolerance.
Integrate security throughout the SDLC, ensuring robust code review, vulnerability scanning, and threat modeling within automated pipelines.
Mentor and lead a multidisciplinary team, promoting an agile, collaborative, and innovative work environment.
Champion creation and curation of reusable infrastructure patterns, automation scripts, cloud orchestration templates, and developer self-service platforms.
Oversee operational readiness, incident response, root cause analysis, and continuous improvement initiatives to ensure high availability and rapid recovery from service disruptions.
Drive a culture of innovation by assessing and implementing advancements in DevOps, platform engineering, and SRE practices.
Prepare and present operational dashboards, incident reports, risk assessments, and status updates to program leadership and customers.
Requirements
Bachelor’s degree and 10+ years of progressive experience in software development, DevOps, or platform engineering.
3+ years of technical team leadership or management experience.
Demonstrated expertise in advanced DevOps practices, including CI/CD, configuration management, automation, and cloud-native operations (AWS, Azure, or similar).
Hands-on experience with SRE frameworks, monitoring, logging, alerting, and reliability engineering techniques.
Proven background in securing applications and systems, including integrating security into pipelines and coordinating with security/compliance teams.
Strong technical knowledge of container orchestration (Kubernetes, Docker), IaC (Terraform, CloudFormation), and end-to-end application/platform lifecycle management.
Excellent interpersonal, written, and verbal communication skills.
Strong problem-solving skills and ability to thrive in a fast-paced, dynamic environment.
U.S. citizenship and ability to obtain and maintain required government security clearance.