AnsibleAWSAzureChefCloudDistributed SystemsDNSDockerGoogle Cloud PlatformJenkinsKubernetesLinuxTerraformAIGCPGoogle CloudGitHub ActionsCloudFormationGitHubCDNCloudflareAgileCI/CDLeadershipCommunicationRemote Work
About this role
Role Overview
Design, build, and maintain scalable, secure, and reliable cloud infrastructure to support high-traffic, business-critical digital applications.
Proactively improve system reliability, availability, and performance through automation, observability, and continuous optimization.
Collaborate cross-functionally with Web Development, QA, and Product teams to align infrastructure decisions with business and delivery goals.
Lead incident response, root cause analysis, and post-incident reviews, ensuring learnings are translated into systemic improvements.
Build and maintain CI/CD pipelines that reduce friction and increase deployment confidence.
Establish and evolve infrastructure-as-code standards to ensure consistency, scalability, and long-term maintainability.
Identify opportunities to incorporate AI-assisted tooling into DevOps workflows and partner with leadership to measure its impact on reliability, delivery velocity, and cost efficiency.
Drive security, resiliency, and cost-awareness across environments.
Own medium-to-large infrastructure initiatives from design through production, including documentation and long-term support.
Contribute to sprint planning and backlog refinement within an Agile environment, ensuring work is prioritized and delivered effectively.
Contribute to roadmap planning by identifying infrastructure investments that improve reliability, scalability, and developer velocity.
Evaluate technical trade-offs and clearly communicate risks, impact, and recommendations to stakeholders.
Develop and maintain runbooks, architectural diagrams, and system documentation to support operational excellence.
Mentor engineers by sharing best practices in DevOps and cloud architecture
Requirements
5+ years of experience in DevOps, platform engineering, or infrastructure-focused software engineering roles.
Significant experience operating production systems in at least one major public cloud environment (AWS, Azure, or GCP).
Practical experience with infrastructure as code and configuration management (e.g., Terraform, CloudFormation, Ansible, Chef).
Hands-on experience building and maintaining CI/CD pipelines using tools such as Jenkins, GitHub Actions, or similar platforms.
Experience working with CDN, DNS, and edge security platforms (e.g., Cloudflare).
Solid understanding of Linux systems, networking, and distributed systems.
Hands-on experience with containers and orchestration tools (e.g., Docker, Kubernetes).
Ability to troubleshoot complex system issues and drive them to resolution.
Ability to prioritize competing initiatives and manage work with minimal oversight.
Strong written and verbal communication skills, with the ability to explain complex technical concepts clearly.