Responsible for the overall health, performance, and capacity of gaming platform services
Monitor and manage the core gaming platform to ensure SLAs are met
Build and manage systems, infrastructure services and applications through automation
Develop strategy, processes, and shape our existing infrastructure and support procedures
Regularly check code into our CI/CD pipelines
Requirements
You will serve as a primary point responsible for the overall health, performance, and capacity of gaming platform services. This could potentially entail troubleshooting issues across the entire stack: hardware, software, application and network, and other days, identify and drive opportunities to improve automation for the company.
Gain deep application-level knowledge of the systems as well as contributing to their overall design and drive standardization efforts across multiple disciplines and services
Manage timely resolution of all critical and/or complex problems meeting SLA requirements
Ability to effectively communicate with all levels of management and stakeholders
Develop, configure and optimize service and application monitoring and telemetry
Assist in the rollouts and deployment of new product features and installations
Develop tools to improve our ability to rapidly deploy and effectively monitor applications and services in a large-scale environment
Work closely with development teams to ensure that platforms are designed with "operability" in mind.