Execute the Capacity Governance suite of control activities, ensuring compliance with enterprise governance risk, and control requirements
Manage the capacity breach lifecycle, including identification, escalation, tracking, and remediation of capacity-related breaches
Drive continuous improvement by identifying and enabling automation opportunities within Capacity processes, improving efficiency, strengthening control execution consistency and reducing operational risk
Lead onboarding of infrastructure products into Capacity Governance Frameworks, coordinating across Product Engineering and platform teams
Serve as the Technical Lead across Product Engineering, Development and Operations, translating requirements into actionable execution, driving alignment across teams
Drive capacity reporting governance, ensuring accuracy, completeness and consistency of metrics used for monitoring and decision-making
Support and lead targeted reviews, audits and risk related engagement activities: Internal/External Audit, IRM, Targeted Reviews. Including walkthroughs and delivery of complete and accurate evidence
Build and maintain strong stakeholder relationships and influence across all levels in and outside of immediate department.
Drive remediation activities to address control gaps, implement corrective actions and ensure sustainable risk and control outcomes
Resolve moderately complex issues and lead a team to meet existing client needs or potential new clients' needs while leveraging solid understanding of the function, policies, procedures, or compliance requirements
Lead projects and act as an escalation point, provide guidance and direction to less experienced staff.
Requirements
5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
2+ years of Risk Management or Business Controls experience
5+ years of experience in IT Capacity Management, Capacity Planning, Capacity Design Optimization, Capacity Forecasting, Capacity Reporting
5+ year of IT Service Delivery Interface including IT System Monitoring, Incident Management, Problem Management, Change Management, Release Management, Configuration Management, Service Level Management and Availability Management experience
5+ years of technical experience with IT Infrastructure Operations including Database, Middleware, Network, Operating Systems, Servers and Storage Devices
2+ years of experience with Enterprise monitoring and visualization tools like Prometheus/Grafana, AppDynamics, Dynatrace, Google logging, Elasticsearch, Splunk, GitHub, Big Panda and Service Now.
Tech Stack
ElasticSearch
Grafana
Prometheus
Splunk
Benefits
Ability to work on-site at approved location / hybrid
Relocation assistance is not available for this position
This position is not eligible for visa sponsorship