Grafana Labs, the company behind the open observability cloud, is seeking a Staff Software Engineer for their Platform SysEng squad. This role focuses on improving the performance and scalability of Grafana Cloud's observability platform while collaborating with engineering teams to enhance infrastructure and reliability.
Responsibilities:
- We are hiring for the Platform SysEng squad
- Currently, SysEng is working across engineering with a goal of reducing new region build timelines to meet customer demands
- We’re part of a Platform Engineering group that manages infrastructure for the teams that are building some of the most cherished tools - Grafana, Mimir, Loki, Tempo, Pyroscope to name a few
Requirements:
- Proven delivery of large distributed systems. Experience shipping and operating complex systems that span multiple teams, with clear evidence of technical leadership and impact
- Demonstrable experience in system design. Deep understanding of tradeoffs around latency, consistency, availability, scaling and cost
- Hands-on cloud and platform experience. Solid experience with cloud-native architectures (microservices, containers/Kubernetes, IaC) and the operational practices that keep them healthy
- Reliability and performance ownership. Comfortable defining SLOs/SLIs, doing capacity planning, tuning performance, and driving reliability work end-to-end
- Excellent coding and design skills. You write clear, maintainable, well-tested code and can lead technical designs — we use Go, but Python/C/C++/Rust or similar translate well
- Comfort with AI-assisted development. We embrace AI and agentic development so we expect you to be curious and comfortable using AI-powered developer tools and ideally have practical experience folding them into a team's workflow
- Influence without authority. Ability to align cross-functional stakeholders, set priorities and drive outcomes in a remote-first environment
- Strong communicator. Clear written and verbal communication that works across engineers and non-technical stakeholders
- You've worked in or on open source, or other community-based projects previously. At Grafana Labs, 'OSS is in our DNA'
- Familiarity with Kubernetes scheduling and projects like Karpenter
- Terraform and/or Crossplane experience. We have mixed usage - each has its strengths
- Experience with Tanka and/or Jsonnet