Home
Jobs
Saved
Resumes
Site Reliability Engineer at Verisk | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Site Reliability Engineer
Verisk
Website
LinkedIn
Site Reliability Engineer
India
Full Time
3 hours ago
Visa Sponsorship
Apply Now
Key skills
AWS
Cloud
Distributed Systems
Docker
Kubernetes
Splunk
Amazon Web Services
EKS
IAM
Dynatrace
About this role
Role Overview
Design and operate multi-region architectures (active/active or active/passive)
Implement and improve automated failover and traffic routing
Identify and eliminate single points of failure
Ensure regional isolation and graceful degradation when dependencies fail
Define realistic availability goals and failure scenarios
Help the team understand RTO/RPO trade-offs
Build and maintain clear, actionable observability (metrics, logs, traces)
Create alerts that detect real problems without noise
Participate in on-call and help improve incident response
Reduce manual operational work through automation
Monitor performance and saturation across regions
Requirements
Experience operating production systems with real availability requirements
Hands-on experience with cloud infrastructure and distributed systems
Strong understanding of high availability patterns
Comfortable being hands-on: debugging, automating, improving systems
Pragmatic mindset — you know when simple is better than perfect
Clear communicator who works well in a small, collaborative team
Strong expertise in Amazon Web Services (multi-region architecture)
Experience designing Active-Active / Active-Passive deployments
Advanced knowledge of VPC networking, IAM, Route 53, Load Balancers, and EKS
Advanced experience with Kubernetes (EKS preferred)
Strong knowledge of Docker
Advanced knowledge of Splunk
Strong expertise in Dynatrace
Tech Stack
AWS
Cloud
Distributed Systems
Docker
Kubernetes
Splunk
Benefits
Health insurance
Flexible work arrangements
Paid time off
Professional development
Apply Now
Home
Jobs
Saved
Resumes