Home
Jobs
Saved
Resumes
Senior Site Reliability Engineer at ZigZag Offshoring | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Senior Site Reliability Engineer
ZigZag Offshoring
Remote
Website
LinkedIn
Senior Site Reliability Engineer
Philippines
Full Time
3 hours ago
$100,000 - $120,000 USD
No Sponsorship
Apply Now
Key skills
AWS
Cloud
Distributed Systems
Microservices
Python
SDLC
Terraform
Bash
CloudFormation
Agile
CI/CD
Communication
Remote Work
About this role
Role Overview
Design, implement, and continuously improve highly available, scalable, secure, and resilient cloud infrastructure and platform services
Define and evolve Service Level Indicators (SLIs), Service Level Objectives (SLOs), and operational metrics to drive measurable reliability outcomes
Lead incident response activities, major incident management, root cause analysis, and post-incident reviews focused on systemic improvement
Drive reduction of operational toil through automation, standardisation, and self-healing platform capabilities
Develop and maintain disaster recovery, backup, failover, and resilience strategies to meet defined RTO and RPO objectives
Conduct capacity planning, performance analysis, and proactive optimisation of infrastructure and application environments
Architect, build, and maintain scalable cloud-native infrastructure primarily within AWS environments
Develop and maintain infrastructure-as-code using tools such as Terraform and CloudFormation
Build reusable platform components and shared services that improve developer productivity and operational consistency
Design and maintain comprehensive observability solutions covering metrics, logging, tracing, alerting, and dashboarding
Collaborate with engineering teams to embed reliability, scalability, performance, and security considerations into the SDLC.
Requirements
5+ years of experience in Site Reliability Engineering, DevOps Engineering, Platform Engineering, or related infrastructure roles
Strong hands-on experience operating production workloads within AWS cloud environments
Deep experience with infrastructure-as-code tools such as Terraform and/or CloudFormation
Strong experience designing and supporting CI/CD pipelines and modern software delivery practices
Strong understanding of distributed systems, microservices architecture, networking, and cloud-native technologies
Experience implementing observability and monitoring solutions across complex environments
Strong scripting and automation experience using Python, Bash, or similar languages
Experience managing production incidents and conducting structured root cause analysis
Strong understanding of system reliability, scalability, security, and operational best practices
Excellent analytical, troubleshooting, and problem-solving capabilities
Strong communication and stakeholder engagement skills
Ability to work effectively in fast-paced, agile, and collaborative engineering environments.
Tech Stack
AWS
Cloud
Distributed Systems
Microservices
Python
SDLC
Terraform
Benefits
Paid time off
Remote work options
Professional development opportunities
Apply Now
Home
Jobs
Saved
Resumes