Xebia is a global tech company with a focus on cloud and software solutions. They are seeking a Senior Site Reliability Engineer to improve and scale CI/CD processes, modernize infrastructure provisioning, and enhance automation and reliability using cloud-native and AI-driven tools.
Responsibilities:
- Building and supporting tools, processes, and infrastructure that enable faster and higher-quality software delivery and scaling
- Ensuring the availability, reliability, and scalability of application infrastructure
- Building and supporting continuous integration, delivery, and release pipelines
- Ensuring the right metrics are collected, monitored, and actionable
Requirements:
- smart and tech-savvy engineer with 5+ years of experience in DevOps practices and Continuous Delivery
- practical knowledge of AWS services, infrastructure, and networking
- solid experience with Kubernetes (ideally EKS on AWS) and container orchestration
- Python knowledge
- experience working with AI Agents
- familiarity with Claude Code
- experience with FastMCP or other MCP libraries
- hands-on experience with GitOps practices, preferably with ArgoCD
- strong skills in Terraform and Helm
- proficiency in Bash scripting (PowerShell is a plus)
- experience with CI/CD pipelines and tooling (GitLab CI/CD, GitHub Actions, or similar)
- experience with monitoring, observability, and logging tools (e.g., Prometheus, Grafana, AppDynamics, OpenSearch)
- strong security awareness (OWASP, encryption, secrets management)
- highly communicative and collaborative, with a strong sense of ownership
- upper-intermediate / advanced English (B2/C1)
- AWS certifications (e.g., Solutions Architect, Platform Engineer)
- familiarity with FluxCD
- experience with Rancher
- knowledge of Keycloak