Design, maintain, and troubleshoot CI/CD release pipelines for OpenShift Virtualization component builds and promotions across multiple release streams, using pipeline-as-code and GitOps-driven promotion patterns
Build and operate container image build systems, managing Dockerfiles, midstream repositories, and operator catalog assembly for a large-scale multi-component product
Develop and maintain automation tooling in Python and Bash for release workflows -
issue tracker integration, commit tracking, cross-team task coordination, and automated source synchronization between upstream and product repositories
Contribute to the team's release observability platform (Node.js/TypeScript) -
adding backend services, integrating data sources, and improving release visibility for stakeholders
Execute and verify production releases: manage smoke testing, gated verification flows, and candidate-release promotions through staged quality gates
Triage and resolve build and release incidents, perform root cause analysis, and drive post-incident improvements to prevent recurrence
Coordinate cross-team release activities such as base image rebases, dependency bumps, and version branching across 50+ component repositories
Document release processes, runbooks, and architectural decisions; support onboarding of new team members.
Requirements
3-5+ years of experience in release engineering or DevOps, or CI/CD infrastructure roles
Hands-on experience with container technologies: building container images, managing registries, and operating in Kubernetes/OpenShift environments
Proficiency with at least one CI/CD pipeline framework (Tekton, Jenkins, GitLab CI, or GitHub Actions)
Familiarity with GitOps tooling (ArgoCD, Kargo, Flux) is a strong plus
Solid scripting skills in Python and Bash for automation and tooling
Demonstrated knowledge with Git, including multi-repo workflows, branching strategies, and merge conflict resolution at scale
Familiarity with operator lifecycle management (OLM) and the Red Hat operator ecosystem
Strong debugging and incident response skills, ability to trace failures across distributed pipeline stages, container builds, and cluster environments
Effective asynchronous communication skills, experience collaborating across time zones with distributed engineering teams