Nebius is leading a new era in cloud computing to serve the global AI economy. As a Senior Technical Program Manager - Data Center Operations, you will drive operational performance across data center IT and infrastructure teams through data-driven insights and structured tracking, ensuring transparency and optimizing execution at scale.
Responsibilities:
- Define, implement, and continuously improve KPIs, metrics, and dashboards for data center operations
- Establish tracking frameworks to monitor performance, incidents, and operational efficiency across teams
- Lead operational reviews (weekly/monthly) to identify bottlenecks, inefficiencies, and improvement opportunities
- Ensure visibility and accountability through structured reporting and data-driven insights
- Standardize processes across sites, improving operational consistency, scalability, and efficiency
- Collaborate with stakeholders to align operational performance with business goals and customer SLAs
Requirements:
- 5+ years of experience in technical project/program management, preferably in data center, cloud or infrastructure environments
- Experience with project management methodologies and tools
- Strong experience in building metrics, dashboards, and reporting systems for operations
- Strong technical skills (SQL and/or programming languages such as Python or Go)
- Strong analytical skills with a good foundation in math and statistics
- Proven ability to work across multiple teams and drive alignment and execution without direct authority
- Excellent communication and stakeholder management skills
- A structured, proactive, and results-driven mindset
- Familiarity with ITIL / ITSM processes
- Experience working with GPU clusters, HPC, or cloud infrastructure
- Understanding of data center network traffic patterns (east-west and north-south)