Runpod is pioneering the future of AI and machine learning, offering cutting-edge cloud infrastructure for full-stack AI applications. We’re hiring an Engineering Manager to lead a high-impact product engineering team building customer-facing features across Runpod’s console, APIs, and developer workflows.
Responsibilities:
- Own Feature Delivery for a Product Area: Lead a team of engineers responsible for shipping new functionality and iterating existing product surfaces. Deliver predictable, high-velocity outcomes that improve customer experience and business value
- Plan and Execute Roadmaps: Translate product strategy into clear technical plans, milestones, and success metrics. Identify tradeoffs early and manage scope, risk, and timelines
- Technical Leadership & Architecture: Drive pragmatic system design for product-layer services and workflows, ensuring scalability, performance, and consistency with Runpod’s platform
- Build and Grow a Strong Team: Hire, mentor, and develop engineers; set expectations, provide feedback, and create a culture of ownership, speed, and craft in a remote-first environment
- Quality & Reliability for Product Surfaces: Improve automated testing, release safety, observability for customer-facing systems, and regression prevention. Ensure on-call health for your team’s services
- Cross-Functional Collaboration: Partner tightly with Product, Program management, Support, GTM, and Infrastructure counterparts. Coordinate dependencies cleanly while maintaining clear ownership
- Operational Excellence: Establish team rituals and delivery systems (sprints/kanban, retros, design reviews). Use data to improve cycle time, throughput, and feature adoption
- Customer-Driven Execution: Stay close to user needs by incorporating feedback, analytics, and support signals into prioritization and iteration
Requirements:
- 2+ years managing a team of high performance software engineers, including ownership of roadmap delivery, hiring/firing, performance, and team culture
- 6+ years as a software engineer building and shipping products used by millions of users, with clear evidence of personal impact
- Strong experience with Linux systems internals and/or cloud systems engineering - ideally building platform capabilities (orchestration, control planes, distributed services, networking or storage layers)
- Comfortable reviewing and contributing to code in modern stacks; Go, Python, and/or TypeScript experience preferred
- Solid understanding of microservices, APIs, eventing, data stores, and service-to-service communication patterns
- Experience running execution processes and using metrics like cycle time, throughput, adoption, and escape defects to improve team flow
- Proven ability to communicate clearly, align stakeholders asynchronously, and sustain momentum across time zones
- Successful completion of a background check
- Experience building AI/ML + developer platforms, workflow-heavy products, or infrastructure-adjacent product layers (e.g., job orchestration UX, deployment flows, usage/billing, observability surfaces)
- Familiarity with Kubernetes-based platforms, container runtimes, scheduling concepts, or GPU/accelerator workflows
- Track record of scaling teams in high-growth environments while keeping quality high
- Open-source contributions in cloud-native, systems, or ML-adjacent projects