Team Leadership: Lead, mentor, and develop a team of DevOps and Systems Engineers, balancing high-level cloud automation with deep-dive systems performance and workload priorities.
Hybrid Infrastructure Ownership: Design, deploy, and operate complex runtime infrastructure across AWS and physical on-premises data centers, managing the full stack from bare metal to the application layer.
Low-Level Systems Engineering: Drive deep OS-level tuning (Linux kernel, I/O scheduling, memory management) and advanced networking (BGP, Layer 2/3 switching, SDN) to optimize workload performance and density.
Database & Storage Performance: Oversee the architecture and performance tuning of distributed databases and storage systems, ensuring high-throughput, low-latency data persistence across hybrid environments.
Kubernetes & Orchestration: Lead the evolution of Kubernetes and container orchestration, focusing on bare-metal clusters, custom CNI plugins, and service mesh performance.
Modern Workflows: Drive the adoption of GitOps workflows using Argo CD and Crossplane, extending automation to manage physical assets and cloud resources via a unified control plane.
Security Posture: Continuously improve infrastructure security, from hardened OS kernels and physical access controls to cloud-native security best practices.
Requirements
Deep Linux Internals: Expert-level knowledge of the Linux Kernel (process management, signals, namespaces, cgroups) and the ability to perform performance profiling using tools like perf, strace, bpftrace, or ftrace.
Advanced Networking (L2–L4): Proven experience with low-level networking, including BGP/OSPF routing, VLAN tagging, Load Balancing algorithms, and tuning TCP/IP stack parameters (e.g., congestion control, buffer sizes).
Physical Infrastructure & Virtualization: You must understand (virtual) hardware life-cycle management (firmware, BIOS/UEFI, RAID).
Database Systems & Tuning: Experience managing and tuning database performance (PostgreSQL, MySQL, or NoSQL) at the OS and disk I/O level, understanding WAL, vacuuming, and memory-mapped files.
Systems-Level Scripting: High proficiency in Python or Go, plus advanced Bash scripting for automating low-level system tasks and interacting with hardware APIs.
Infrastructure as Code (IaC): Deep experience with Ansible (for configuration/state management) and Terraform (for hybrid resource orchestration).
Tech Stack
Ansible
AWS
Cloud
Kubernetes
Linux
MySQL
NoSQL
Postgres
Python
Switching
TCP/IP
Terraform
Go
Benefits
CLT contract — enjoy job stability and reliable benefits that value your work
Quarterly performance bonuses to reward your hard work and achievements
Meal allowance paid into Caju card — R$100 per working day
Health insurance with SulAmérica — 50% covered by the company
Extended parental leave
180 days of maternity leave and 20 days of paternity leave.
Daycare allowance provided to mothers with children under 3 years old — R$500 per month per child
Kickstart your journey with an exciting welcome onboarding trip to one of our European offices (Lisbon or Berlin)!