Act as a technical team leader and SRE subject matter expert across the organisation
Influence engineering culture to promote reliability and operational excellence
Steer the SRE roadmap, technical strategy and reliability standards in collaboration with leadership and our principal engineers
Shape the organisation’s observability standards, platform choices, and best practices.
Identify strategic opportunities to improve system reliability and champion them to completion
Level up the SRE team through mentoring, setting a high bar and leading by example.

8+ years of experience designing, building and scaling complex, highly available systems.
Deep SRE expertise: We strongly align with the principles outlined in the Google SRE Book.
Proven technical leadership: A strong track record of delivery, technical leadership, and cross-team collaboration.
AI Readiness: A willingness to work with AI-assisted development tools as part of your daily workflow. You don't need to be an expert today, but you should be curious, open to learning, and able to critically evaluate AI-generated code before it reaches production.
Stack familiarity: Experience with Terraform, Kubernetes, AWS and LGTM would be advantageous.
Coding skills: You are comfortable writing production-quality code, ideally in Go or Python is advantageous.

Staff Site Reliability Engineer

Key skills