Home
Jobs
Saved
Resumes
Lead Cloud Engineering, Production Operations Engineer at qode.world | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Lead Cloud Engineering, Production Operations Engineer
qode.world
Website
LinkedIn
Lead Cloud Engineering, Production Operations Engineer
California, United States of America
Full Time
1 hour ago
No Visa Sponsorship
Apply Now
Key skills
Ansible
AWS
Azure
Chef
Cloud
Docker
Grafana
Jenkins
Kubernetes
Prometheus
Puppet
Python
Terraform
Bash
PowerShell
ArgoCD
CloudFormation
IAM
Datadog
GitLab
CI/CD
Mentoring
Communication
Collaboration
About this role
Role Overview
Design, deploy, and manage hybrid and cloud infrastructures (OCI, AWS, Azure, on-prem) to support production and enterprise systems
Implement infrastructure-as-code (IaC) using Terraform or CloudFormation to ensure repeatable, secure, and automated deployments
Develop and maintain CI/CD-ready environments that support rapid build, test, and release cycles for engineering teams
Partner with network and security teams to implement resilient, compliant architectures
Serve as technical lead for production systems, ensuring stability, performance, and scalability
Establish monitoring, logging, and alerting frameworks to improve visibility and reduce mean time to detection (MTTD) and resolution (MTTR)
Participate in incident response, root cause analysis, and reliability improvement efforts
Collaborate with Engineering and SRE teams to define SLIs, SLOs, and performance metrics for critical services
Develop and enhance deployment pipelines (e.g., Jenkins, GitLab, ArgoCD) to automate software delivery and environment provisioning
Embed security, compliance, and testing gates into CI/CD workflows
Implement configuration management and orchestration tools such as Ansible, Chef, or Puppet to manage infrastructure at scale
Drive efficiency through self-healing systems, auto-scaling, and infrastructure automation
Lead day-to-day production operations activities, mentoring junior engineers on cloud and reliability best practices
Act as a technical bridge between Infrastructure, Security, and Application Engineering teams
Contribute to capacity planning, cost optimization, and production readiness reviews
Maintain documentation, runbooks, and standard operating procedures for production systems
Requirements
Bachelor’s degree in Computer Science, Information Systems, or equivalent experience
7+ years of experience in cloud and infrastructure engineering, with at least 2–3 years in a lead or senior engineer capacity
Deep expertise in OCI (preferred) AWS or Azure (networking, compute, storage, IAM, and monitoring)
Proven experience with production-scale operations and hybrid cloud deployments
Proficiency in:
Infrastructure-as-code (Terraform, CloudFormation)
CI/CD and DevOps pipelines (Jenkins, GitLab, ArgoCD)
Containers and orchestration (Kubernetes, Docker)
Observability tools (Datadog, Prometheus, Grafana, ELK)
Scripting languages (Python, Bash, PowerShell)
Strong troubleshooting skills and the ability to lead through high-impact incidents
Excellent communication and collaboration skills across cross-functional teams
Tech Stack
Ansible
AWS
Azure
Chef
Cloud
Docker
Grafana
Jenkins
Kubernetes
Prometheus
Puppet
Python
Terraform
Apply Now
Home
Jobs
Saved
Resumes