To design and implement scalable, highly available, and resilient infrastructure on AWS.
Lead the implementation of advanced observability frameworks.
Define SLIs/SLOs and act as a technical lead during critical incidents.
Collaborate with development teams to optimize application performance, providing guidance on best practices and complex architectural decisions.
Requirements
Advanced AWS Expertise: Deep knowledge of AWS major services, including EC2, VPC, ELB, IAM, Lambda, API Gateway, Cloudwatch and data storage.
Orchestration and Containers: Practical knowledge of Kubernetes (EKS/OpenShift) and Docker.
Production Support: Large experience supporting production environments, including deployments and troubleshooting and incident management best practices.
Advanced Infrastructure as Code: Strong proficiency in Terraform and CloudFormation.
Distributed Systems: Broad understanding of complex distributed architectures and microservices.
Advanced Scripting: Proficiency in Python, Bash, or PowerShell for complex automation.