Provide and own automation of the provisioning of CSP resources, including networking, Kubernetes clusters and specific CSP resources required by application teams.
Manage infrastructure for teams building vital Platform tools like Grafana, Mimir, Loki, Tempo, and Pyroscope.
Participate in on-call rotations to ensure system health.
Requirements
Kubernetes cluster provisioning and lifecycle management.
Management of cluster networking components: load balancing, NAT, DNS, CNIs, network policies, private connectivity for customers, cross-cluster communication.
Management of scheduling and autoscaling.
Maintaining Crossplane compositions and Terraform modules for CSP resources common to our users.
Work with our users (Grafana Cloud application teams) to help understand their needs and ensure we’re investing in the right capabilities.
Participation in the Platform department Infrastructure wing on-call rotation.
Tech Stack
Cloud
DNS
Grafana
Kubernetes
Terraform
Benefits
Restricted Stock Units (RSUs) rewarded on every role
Bonus (if applicable)
30 days annual leave, including 3 Grafana Shutdown Days