Manage, troubleshoot, and optimize containerized applications and infrastructure deployed on Kubernetes, RedHat OpenShift, and OpenStack platforms.
Serve as the Subject Matter Expert (SME) for core cloud infrastructure technologies.
Prepare and conduct rigorous Root Cause Analysis (RCA) for critical incidents.
Develop, test, and maintain robust automation scripts to streamline daily operational tasks.
Provide end-to-end Escalation, Monitoring, and Emergency (EME) support.
Liaise directly with customer teams and internal teams to understand requirements and deliver tailored solutions.
A rotational on-call schedule is a mandatory part of this position.
Requirements
6+ years of knowledge and proven hands-on experience with Linux administration.
6+ years of knowledge of core networking principles (TCP/IP, routing, load balancing, firewalls) in a cloud environment.
6+ years of knowledge of Kubernetes orchestration, OpenStack platforms, and Docker/Containerization.
Solid Python scripting skills for task automation and system management.
Excellent communication skills (written and verbal in English) and the ability to articulate complex issues clearly.
One or more certifications from the list below will be considered an added advantage: Red Hat Certified Specialist in Cloud Infrastructure (EX210 ), Red Hat Certified Engineer (RHCE) in Red Hat OpenStack (EX310), RHCSA, RHCE, CKA, EX280 (RedHat Certified Specialist in OpenShift Administration), EX380 (RedHat Certified Specialist in OpenShift Automation and API Management).