Hopper is a leading travel platform on a mission to enhance global travel experiences. They are seeking a Senior Site Reliability Engineer to join their Cloud FinOps team, focusing on optimizing infrastructure and cost efficiency in Google Cloud while ensuring reliability and scalability for their services.

Responsibilities:

Work on projects that will drive a higher cost efficiency, such as: Reduce our network egress costs by removing unnecessary headers
Ensure that our warehouse data is in use and select the most efficient storage for it. E.g., cold storage for buckets with infrequent retrieval
Ensure that autoscaling for both databases and compute is well optimized
Work on improving the current cost attribution to ensure all teams have clear visibility into their costs
Participate in providing support to incidents and be part of on-call rotation for platform incidents, as each engineering team has their own on-call rotation
Contribute to solving doubts and problems engineers might face with our infrastructure and approving PRs that require Platform supervision

Requirements:

Strong background in SRE, DevOps, Software Engineering or Systems engineering
Troubleshooting skills
System design with good analytical capabilities
Good communication skills
Knowledge of major cloud providers, preferably Google Cloud
SQL knowledge
Containers, Kubernetes, and related tooling like Kustomize and Helm
Service Mesh, preferably with Istio
Networking knowledge. DNS, TLS, certificates, ingresses, etc
Observability with log collection, metrics, APM, etc. preferably Datadog
Security knowledge, IAM, RBAC, network security, etc
Knowledge on authentication and authorization technologies
CI/CD
Database technologies
Competent in scripting with Bash and Python or other scripting languages

Senior Site Reliability Engineer, Platform & Cloud FinOps (100% Remote - USA Central & EST)

Key skills

About this role

Responsibilities:

Requirements: