Oracle Cloud Infrastructure’s architecture development engineering team is seeking a highly skilled and self-driven Principal Technical Program Manager to manage GPU Platform and Infrastructure projects. The role involves collaborating with cross-functional stakeholders, driving execution of complex products, and managing GPU infrastructure and data center programs.
Responsibilities:
- Manage GPU Platform and Infrastructure projects by aligning priorities from various teams, collaborating with cross-functional stakeholders, and driving end-to-end execution of complex products
- Work on GPU infrastructure, Data Center enablement, data center optimization and other distributed computing, highly available cloud services and virtualized infrastructure
- Collaborate with a multi-functional team including vendors and partners for provisioning, deploying, configuring, and maintaining GPU servers in datacenters to support customers run AI/ML workloads and cloud-scale applications
- Define project scopes, plan and direct schedules while focusing on regular and timely delivery of value; organize and lead project status and working meetings; prepare and distribute progress reports; manage risks and issues; correct deviations from plans; and perform delivery planning for assigned projects
- Act as a central on-site leader to manage data center programs
- Represent engineering perspectives to partner organizations, product teams, and executive leadership
- Track and manage issues and resolve blockers on a timely manner
Requirements:
- Demonstrated experience in GPU Infrastructure and data center related design and operations such as power, cooling, hardware installation/removal, cabling, and networking
- Excellent oral and written communications skills and experience interacting with both business and Engineering staff at all levels including the executive level
- Experience with technical design discussions and ability to summarize complex trade-offs and options in presentation and technical documentation
- Be the central point to manage all data center activities as floor manager
- Work with cross-functional teams including datacenter operations to track and manage issues and resolve blockers on a timely manner
- Ability to effectively represent engineering perspectives to partner organizations, product teams, and executive leadership
- Aptitude to work across and engage individuals and teams located across multiple geographies and or cultures
- Thrive and succeed in an innovative and fast-paced environment and not be hindered by ambiguity or conflicting priorities
- Bachelor's degree in computer sciences, software engineering, technology management, business management, or similar
- 3+ years of experience in datacenter/infrastructure development
- 7+ years of experience as hands-on technical program manager; preferably in a related industry
- Advanced knowledge of the full life cycle of product development and experience launching and operating customer-facing cloud services
- Experience efficiently and effectively communicating findings/progress to cross-functional teams, senior leadership, and the broader organizations, with both technical and non-technical stakeholders
- Self-driven problem solver; able to adapt and thrive in a dynamic, ambiguous, and customer-focused environment
- 10+ years of program/project management, product design or related experience
- Bachelor's degree in Computer Science or relevant technical field or equivalent work experience
- Exposure to large scale datacenter and data center activities
- Possess knowledge in provisioning and deploying GPU servers, including rack installation, cabling, firmware updates, and OS/hypervisor configuration
- Strong understanding of GPU infrastructure services, and thorough knowledge of Cloud such as Compute, Storage, Identity and Networking
- Demonstrated knowledge of OCI, AWS, Azure or Google IaaS, SaaS and PaaS services