Own and operate our application stack and AWS infrastructure to orchestrate and manage our hosted customer instances of Metabase
Debug runtime issues across the different levels of our application stack and hosting stack.
Develop and build our internal tooling and automation to manage the lifecycle of a hosted Metabase installation, from purchase to deployment, zero-downtime upgrades, and general operational health
Continuously improve our automated deployments and testing
Requirements
Is thoughtful and careful
Compulsively automates everything and documents it
Is able to make solid technical judgements and back them up articulately
Has at least 5 years of experience building and operating production infrastructure, ideally on public cloud
Strong Kubernetes and AWS experience
Strong experience with IaC and Terraform
Can write high quality and readable code in a modern language (e.g. Python, Go, etc.)
Experience with modern monitoring stacks (e.g Prometheus/Grafana/Datadog)
Tech Stack
AWS
Cloud
Grafana
Kubernetes
Prometheus
Python
Terraform
Go
Benefits
flexibility (define your own schedule and work from wherever you want)
autonomy
environment that fosters growth, learning, and development