Develop and maintain automation solutions to manage and operate large numbers of production servers at scale
Implement consistent configuration, patching, and operational standards across environments
Build and enhance server automation using Bash and PowerShell, with configuration management tools such as Ansible
Support and improve enterprise monitoring and observability using tools like Dynatrace, Splunk, Prometheus, Grafana, Datadog, or ExtraHop
Collaborate with cloud, platform, and operations teams to support workloads running on Kubernetes and major cloud platforms
Participate in enterprise change management processes, ensuring stability and compliance in production environments
Troubleshoot complex operational issues and continuously improve system reliability and performance
Requirements
5+ years of experience in Premier Core, DevOps or IT Operations roles
3+ years of hands-on experience with enterprise monitoring/observability tools (e.g., Dynatrace, Splunk, ExtraHop, Prometheus, Grafana, Datadog)
3+ years of experience with automation tools such as Bash and PowerShell for production server automation, combined with hands-on experience with configuration management solutions such as Ansible
Demonstrated experience managing a large number of production servers, applying consistent configuration, patching and operational standards at scale
Experience with public cloud platforms (AWS, Azure, GCP) and Kubernetes; working familiarity with ITIL practices and enterprise change management
Bachelor’s degree in computer science, engineering, or a related field, or an equivalent combination of education, work, and military experience