Home
Jobs
Saved
Resumes
Site Reliability Engineer at StartHub | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Site Reliability Engineer
StartHub
Remote
Website
LinkedIn
Site Reliability Engineer
United States
Full Time
5 hours ago
H1B Sponsor
Apply Now
Key skills
AWS
Cloud
Distributed Systems
Google Cloud Platform
Kubernetes
Terraform
GCP
Google Cloud
About this role
Role Overview
Design, build, and maintain cloud infrastructure for our distributed build acceleration platform
Automate everything: from deployment pipelines to monitoring and recovery
Manage scalability and reliability for high-throughput, low-latency systems
Implement and maintain observability: logging, metrics, tracing, and alerting
Work closely with product and engineering teams to embed reliability into every feature
Diagnose and resolve production incidents quickly, and feed learnings back into systems design
Optimize cost, performance, and resilience across multi-cloud environments
Requirements
4+ years in SRE, DevOps, or Production Engineering roles
Experience managing Kubernetes in production
Strong background in cloud infrastructure (GCP or AWS) and IaC (Terraform preferred)
Solid knowledge of networking, security, and distributed systems
Track record of improving system availability and developer productivity
A knack for debugging complex, cross-system issues under pressure
Tech Stack
AWS
Cloud
Distributed Systems
Google Cloud Platform
Kubernetes
Terraform
Benefits
comprehensive medical, dental, vision benefits
401k/pension
parental leave
generous vacation
Apply Now
Home
Jobs
Saved
Resumes