Supporting a Solaris Utility Compute infrastructure fleet, including multiple Oracle VM (OVM) farms running on Oracle T7 and T8 server hardware.
Managing platform capacity to ensure optimal performance, scalability, and high availability of the infrastructure.
Responding to platform incidents and being available to assist with incidents impacting hosted workloads, ensuring timely and effective resolution.
Maintaining high support standards with minimal impact to users and the business, consistently meeting defined SLA timelines.
Driving automation and operational efficiency by identifying opportunities for automation or process improvements, and implementing changes that enhance platform stability and reduce manual effort.
Requirements
Strong hands-on experience and technical expertise in OVM, LDOMs, Puppet, Red Hat Satellite, RHEL, and Solaris.
Excellent problem-solving and analytical skills to diagnose and resolve complex infrastructure issues.
Clear and effective verbal and written communication skills for collaboration across teams.
Proven automation and scripting capability, with experience in Python, YAML, Ruby, JavaScript, REST APIs, or PHP.
Experience with enterprise-scale environments, including disaster recovery planning and large server estates (1,000+ servers).