Strong knowledge of monitoring and logging procedures, including standard protocols, tools and patterns such as Open Telemetry, Grafana, Loki, Thanos/Prometheus, and/or others.
Experience with CI/CD pipelines, and tools such as Jenkins, GitHub Actions, Gitlab Pipelines, ArgoCD or Flux
Experience with certificate issuing and management, including PKI and ACME
Strong focus on reliability (SRE), with experience in maintaining RTO and RPO strategies, auto-scaling policies and performance monitoring
Good understanding of the full compute stack – software, storage, networking, OS, virtualization, configuration management, provisioning
Competent with version control systems, web service APIs and open-source configuration management systems
Working knowledge of fundamental Internet protocols (e.g, TCP/IP, HTTP, DNS)
Motivated to work as part of a team, where trust and collaboration is essential
Excellent interpersonal and communication skills in Portuguese and in English.