Design, integrate, and deploy Rancher-managed RKE2 clusters across cloud, on-premises, and edge environments.
Integrate and operate certified CNIs/CSIs for RKE2 such as Calico, Cilium, Canal, Multus, and Longhorn.
Develop and maintain automation for cluster lifecycle using Terraform, Ansible, Helm, and GitOps.
Implement and manage networking, storage, upgrades, security hardening, and HA configurations for large-scale RKE2 environments.
Troubleshoot advanced issues across Kubernetes, Rancher Manager, RKE2, Linux, and cloud infrastructure layers.
Maintain and troubleshoot the OpenNebula Kubernetes Cluster API Provider (CAPONE) and storage provider (OpenNebula CSI driver)
Monitor and optimize performance, reliability, and scalability of clusters in cloud-edge environments.
Support engineering teams and customers with solution design, issue escalation, demos, and cloud-edge integration.
Produce and maintain technical documentation, architecture diagrams, runbooks, and procedural guides.
Ensure compliance with the RKE2 Support Matrix, certified components, and industry best practices.
Requirements
Bachelor’s or Master’s in Computer Science, Software Engineering, Telecommunications, or related discipline.
Certifications such as CKA, CKAD, SUSE Rancher or equivalent are a strong plus.
3+ years of hands-on experience running Kubernetes in production environments.
Experience designing or operating distributed, multi-site, cloud-edge, or telecom-grade architectures.
Proven track record deploying and managing RKE2 or other hardened Kubernetes distributions.
Deep expertise with RKE2, Rancher Manager, and the broader Rancher ecosystem.
Strong understanding of the RKE2 Support Matrix and certified CNI/CSI components (Calico, Cilium, Canal, Multus, Longhorn).
Strong Linux engineering background (debugging, networking, kernel-level troubleshooting, security hardening).
Hands-on experience with IaC and automation tools: Terraform, Ansible, Helm, GitOps (Flux/ArgoCD).
Knowledge of HA patterns, distributed systems, and observability stacks (Prometheus, Grafana, Loki, etc.).
Experience in languages like Golang, Ruby and bash.
Experience developing/maintaining Kubernetes controllers and solutions based on Kubernetes Cluster API
English fluency at a professional or native-equivalent level, with excellent clarity and expression in both writing and speech.
Strong customer service mindset, with a focus on responsiveness and user satisfaction.
Clear communication and documentation with strong written and verbal English, async collaboration, and visibility of work.
Self-management and accountability with ability to work independently, manage time, and take ownership of tasks and deadlines
Technical autonomy and tool proficiency with confidence in using Git, CI/CD, remote collaboration tools (Slack, Zoom, GitHub, etc.), and solving problems without direct supervision.
Tech Stack
Ansible
Cloud
Distributed Systems
Flux
Grafana
Kubernetes
Linux
Prometheus
Ruby
Terraform
Go
Benefits
Competitive compensation package and Flexible Remuneration Options: Meals, Transport, Nursery/Childcare
Company-provided workstation
Private Health Insurance
6 hours workday on Fridays and everyday during August
PTO: Holidays, Personal Time, Sick Time, Parental leave.
All Remote company with bright HQ centrally located in Madrid, and offices in Boston (USA) and Brno (Czech Republic)
Healthy Work-Life Balance : We encourage the right for Digital Disconnecting and promote harmony between employees personal and professional lives
Flexible hiring options: Full Time/Part Time, Employee (Spain/Usa) / Contractor (other locations)