AnsibleGrafanaKubernetesLinuxPrometheusPythonTerraformGoBashGitVersion Control
About this role
Role Overview
Ensure Hostinger infrastructure remains healthy and operational.
Diagnose and resolve critical or recurring issues, implementing long-term solutions to prevent future disruptions while reducing operational workload for the shift team.
Plan, execute, and optimize regular infrastructure maintenance, ensuring minimal downtime.
Enhance infrastructure observability by improving monitoring, logging, and alerting systems, enabling faster problem detection and response.
Document infrastructure operational processes and best practices while sharing knowledge with teammates to strengthen team expertise.
Automate repetitive tasks to improve operational efficiency and reduce manual interventions.
Work closely with site reliability engineers to align infrastructure improvements with overall system reliability goals and business objectives.
Requirements
Strong incident management experience, with the ability to quickly diagnose and resolve critical issues while implementing long-term fixes to prevent recurrence.
Deep understanding of Linux systems, including administration, performance tuning, and troubleshooting.
Proficiency in hardware diagnostics and troubleshooting, with hands-on experience in physical server maintenance (particularly Dell and/or HPE), including but not limited to RAID configurations and storage management (setup, monitoring, and recovery strategies).
Hands-on experience with data center network infrastructure.
Hands-on experience in monitoring and alerting strategies, utilizing Prometheus, Alertmanager, and Grafana.
Hands-on experience with Kubernetes, Ansible, and Terraform for infrastructure automation and orchestration.
Proficiency in Bash, Python, and/or Go for automation and tooling.
Proficiency in Git for version control and collaborative development workflow.
Fluency in English, with the ability to communicate complex technical concepts clearly in both written and spoken form.
Team player mindset with go-for-it attitude and the ability to collaborate and communicate asynchronously.
An insatiable appetite for learning and becoming innovative.
Tech Stack
Ansible
Grafana
Kubernetes
Linux
Prometheus
Python
Terraform
Go
Benefits
š 360 Growth: We provide limitless learning opportunities: access to platforms like Reforge and CoachHub, global conferences, physical and digital libraries, and feedback culture. Advance your career with internal mobility and grow with a team eager to share knowledge and support your success.
šÆ Freedom & responsibility: Work on your terms: from a modern office in Yogyakarta or your home office. Enjoy flexibility in managing your schedule and bring your ideas to life in a fast-paced, dynamic environment.
šŖ Wellness simplified: Your health comes first with insurance from Day 1, gym memberships, Headspace subscriptions, recharge leave, and regular health checks. Join sports, arts, and hobby clubs or simply enjoy the balance of a lifestyle that prioritizes wellness.
š **Work hard
play hard:** Celebrate your achievements with company events like Town Hall, Meet the Client initiatives, team-buildings, and workations. Celebrate lifeās big moments with milestone gifts for weddings, new parenthood, and graduations.