Xebia is a global tech company with a focus on cloud and software solutions. They are seeking a Senior Site Reliability Engineer to improve and scale CI/CD processes, modernize infrastructure provisioning, and enhance automation and reliability using cloud-native and AI-driven tools.

Responsibilities:

Building and supporting tools, processes, and infrastructure that enable faster and higher-quality software delivery and scaling
Ensuring the availability, reliability, and scalability of application infrastructure
Building and supporting continuous integration, delivery, and release pipelines
Ensuring the right metrics are collected, monitored, and actionable

Requirements:

smart and tech-savvy engineer with 5+ years of experience in DevOps practices and Continuous Delivery
practical knowledge of AWS services, infrastructure, and networking
solid experience with Kubernetes (ideally EKS on AWS) and container orchestration
Python knowledge
experience working with AI Agents
familiarity with Claude Code
experience with FastMCP or other MCP libraries
hands-on experience with GitOps practices, preferably with ArgoCD
strong skills in Terraform and Helm
proficiency in Bash scripting (PowerShell is a plus)
experience with CI/CD pipelines and tooling (GitLab CI/CD, GitHub Actions, or similar)
experience with monitoring, observability, and logging tools (e.g., Prometheus, Grafana, AppDynamics, OpenSearch)
strong security awareness (OWASP, encryption, secrets management)
highly communicative and collaborative, with a strong sense of ownership
upper-intermediate / advanced English (B2/C1)
AWS certifications (e.g., Solutions Architect, Platform Engineer)
familiarity with FluxCD
experience with Rancher
knowledge of Keycloak

Senior Site Reliability Engineer - AWS & AI | EU

Key skills

About this role

Responsibilities:

Requirements: