Own and operate the company’s AWS infrastructure, container platforms, and deployment architecture
Lead and execute the migration of DevOps tooling and infrastructure for our platform modernization from Django monolith to Node.js microservices and event-driven systems
Design, maintain, and improve CI/CD pipelines and release processes
Write, review, and maintain Infrastructure as Code (Terraform) and automation
Establish and monitor SLOs, SLAs, and operational KPIs
Lead incident response, post-incident reviews, and reliability improvements
Build, mentor, and manage a high-performing DevOps / Platform Engineering team
Partner with Engineering, Product, Security, and QA on platform priorities
Own cloud security, access controls, and compliance practices
Drive cloud cost optimization and vendor management
Requirements
8+ years of experience in DevOps, SRE, Platform Engineering, or related fields
2+ years of experience leading technical teams
Experience operating large-scale AWS environments
Hands-on experience with containerized workloads and orchestration platforms
Proven experience supporting microservices and distributed systems
Experience designing and operating CI/CD pipelines
Strong background in incident response and reliability engineering
Deep knowledge of AWS infrastructure and networking
Strong experience with Docker and ECS (Kubernetes experience is a plus)
Experience with PostgreSQL, Redis, and message queues
Strong proficiency in Terraform for Infrastructure as Code (required)
Scripting and automation skills (Python, Bash, Node.js, or similar)
Understanding of event-driven and asynchronous architectures