AnsibleAWSAzureChefCloudGoogle Cloud PlatformGrafanaKotlinKubernetesPrometheusPythonTerraformGoGCPGoogle CloudPulumiCI/CDLeadershipCommunicationCollaborationRemote Work
About this role
Role Overview
Further expand and optimize our cloud infrastructure on Azure and our Kubernetes clusters
designed for high throughput and highest availability
to support Flip's rapid growth across the globe.
Design and implement zero-downtime deployments, rollback mechanisms and disaster-recovery strategies that keep our platform available around the clock.
Evolve our LGTM stack (Loki, Grafana, Tempo, Mimir) to give every team the visibility they need
and use it to define and optimize our SLOs.
Design, develop and optimize infrastructure as code with Pulumi in Go, eliminating toil and making our platform self-service for engineering teams.
Promote CI/CD best practices, incident management, post-mortems and developer experience across the entire engineering organization.
Collaborate with your squad and engineering leadership to define the platform's direction
from scalable, high-throughput systems and cost optimization to security posture and compliance.
Requirements
You have 1–3 years of hands-on experience as a Site Reliability Engineer (SRE), Platform Engineer, DevOps Engineer, Infrastructure Engineer, Cloud Engineer, or Backend Engineer with a strong infrastructure focus.
Experience operating and scaling cloud infrastructures (Azure, GCP, AWS).
Deep knowledge of Kubernetes and container orchestration in production environments.
Hands-on experience with modern observability stacks (e.g. Prometheus, Mimir, Loki, ELK) and comfortable defining and operating SLOs and error budgets.
Solid software development skills in Go (preferred, since our IaC runs on Pulumi in Go), Python or Kotlin.
Hands-on experience with infrastructure as code (e.g. Pulumi, OpenTofu, Terraform) and configuration tooling (e.g. Ansible, Chef).
A collaborative mindset, strong communication skills and business-fluent English.
Willingness to participate in on-call rotations to ensure the reliability of our platform.
Tech Stack
Ansible
AWS
Azure
Chef
Cloud
Google Cloud Platform
Grafana
Kotlin
Kubernetes
Prometheus
Python
Terraform
Go
Benefits
Work mode: We’re remote-first, giving you flexibility to work from home. At the same time, we deeply value the power of in-person collaboration. Depending on the role, you’ll join occasional team events, workshops, or meetings in our Berlin or Stuttgart offices
always with plenty of notice. The exact balance will be discussed during your interview.
Work-Life-Balance: We don't want you to grow roots to your desk chair. That's why we cover the costs of your E-Gym-Wellpass membership and offer job bike leasing.
Celebrating success: Expect highly motivated and committed people in a relaxed working atmosphere.
Be part of something bigger: You actively shape Flip in your role. Along the way, you are an enabler of the rapid growth process of a young tech company and grow towards your goals, fun is guaranteed.
Happy to be a Flipster: Stay tuned for regular team events and culture days that bring us together as Flipsters.
Working abroad: At Flip you can also work abroad in the European Union. Let's talk about remote work in the interview.