Fluidstack is focused on delivering large-scale compute infrastructure for AI. The Data Center Deployment Project Manager will drive the execution of bringing data halls and AI/GPU clusters online, coordinating across various teams to ensure timely deployments.
Responsibilities:
- Own deployment execution for assigned data halls and AI/GPU clusters, from commissioning readiness (L2/L3 Cx) through operational turnover
- Track and drive deployment milestones, critical-path tasks, and go/no-go criteria, keeping schedules and runbooks current and accurate
- Coordinate on-site activities spanning construction handover, infrastructure, ICT/low-voltage, CDU/DLC, hardware installation, and network bring-up
- Manage and hold accountable OEM/ODM vendors, integration contractors, and third-party service providers throughout active deployment
- Confirm commissioning and hardware-validation steps are completed and documented before handoff — burn-in, network validation, and GPU/TPU health checks
- Own a clean operational handoff: acceptance criteria, as-built documentation, runbooks, and turnover to the operations team
- Surface risks, blockers, and schedule slips early, and escalate with proposed options rather than problems alone
- Maintain clear program artifacts: schedules, task trackers, risk and issue logs, punch lists, and handoff checklists
- Provide regular, clear status updates on deployment health, risks, and milestones to program leads and stakeholders
- Collaborate with SMEs across networking, ICT, systems engineering, procurement, and operations — validating plans before executing, not in isolation
- Contribute to the SOPs, runbooks, and process improvements that make each subsequent deployment faster and more repeatable
Requirements:
- 3–5 years delivering data center, infrastructure, or large-scale hardware deployment projects — construction, commissioning, MEP, or IT/network deployment backgrounds are all welcome
- Working understanding of the data center delivery lifecycle and how construction handover, commissioning, and hardware/network deployment fit together
- Familiarity with commissioning levels (L2–L5) and what it takes to move a data hall from systems testing to deployment- and operations-ready
- Exposure to AI/GPU or high-density compute infrastructure — or a solid foundation in data center power, cooling, and networking with the drive to come up to speed quickly
- Demonstrated ability to own a project or workstream end to end: tracking milestones, managing dependencies, and driving tasks to completion
- Experience coordinating vendors, contractors, or cross-functional teams on active project or construction sites
- Comfortable operating with ambiguity and evolving requirements in a fast-moving environment
- Strong organizational and communication skills — able to keep schedules, trackers, and stakeholders aligned
- Ownership mindset: you take initiative, fill gaps, and follow through to the outcome rather than waiting to be told
- Willing and able to travel to data center locations, sometimes on short notice (up to 50–60% travel during active deployments)
- Experience on hyperscale or AI/ML cluster deployments (B100/B200/H100-class or GPU clusters)
- Hands-on exposure to commissioning — CxA support, integrated systems testing (IST), or functional performance testing
- Familiarity with direct liquid cooling (DLC), CDU/sidecar, or immersion cooling
- Experience with high-performance networking (InfiniBand, RoCE) or structured cabling / ICT
- Proficiency with project tracking and workflow tools (Smartsheet, JIRA, MS Project, or similar)
- Familiarity with DCIM, BMS, or CMDB tools (NetBox, Sunbird, Nlyte, or similar)
- Exposure to long-lead procurement, logistics, or material coordination during deployment (receiving, cycle counting, site-to-site transfers)
- Experience contributing to SOPs, runbooks, or process documentation
- Relevant certifications, or active progress toward them: PMP or CAPM (Project Management Institute), DCD (Data Center Dynamics) — any track, CDCP (Certified Data Centre Professional) — EPI / GAQM, CDCPM / CDCMP (Uptime Institute / CNet Training), BICSI RCDD or DCDC (Data Center Design Consultant), ITIL Foundation — relevant to operational handoff and service management, Lean or Six Sigma — relevant to SOP development and process improvement