Implement and improve monitoring, alerting, and incident response systems and processes to ensure high reliability for our customers and meet defined SLOs
Design, build, and maintain resilient, scalable infrastructure utilizing SRE principles and best practices
Attend post-incident reviews, detect patterns and contribute to continuous improvement efforts
Execute performance testing , analyze system bottlenecks, and formulate strategies for capacity planning to ensure our systems meet current and future demands effectively
Build systems where CI/CD test failures serve as immediate, real-time context for agents , enabling them to analyze logs, trace dependencies, and suggest or apply instant code fixes.
Requirements
6+ years in SRE, DevOps, or Platform Engineering
Strong understanding and practical application of Site Reliability Engineering (SRE) principles, methodologies, and best practices
Proficiency in programming/scripting languages such as Python, GoLang or TypeScript
Practical understanding of integrating LLMs into automated workflows. You know how to feed live system state (like a fresh CI test failure) into an agent as actionable context.
Prior experience in incident management, post-incident reviews, and implementing improvements to prevent future incidents
Ability to troubleshoot complex technical issues systematically and effectively
Good experience working with a public cloud provider, ideally Google Cloud Platform (GCP), and a solid understanding of its observability services
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
Excellent communication skills to convey technical concepts and collaborate effectively with diverse teams
Very good knowledge of spoken and written english, german is a plus
Residency in Germany
Tech Stack
Cloud
Google Cloud Platform
Python
TypeScript
Benefits
You are part of an international, dynamic, and highly motivated team of people who have proven to make things happen
With your work, you accelerate the "energy transition" and hence have a direct impact on our climate
Work with and learn from other super-smart colleagues
You will enjoy direct contact with core decision-makers
You will enjoy the best chances of entering full-time in one of Europe’s most thriving scaleups
You work remotely (Germany-wide), with offices in Hamburg, Berlin or Munich
Create a healthy balance alongside your work and enjoy all the benefits of the EGYM Wellpass
Benefits and discounts are yours with Futurebens
Whether city bike or e-bike
be flexible with our job bike leasing and do something good for the environment at the same time