Netflix is a leading entertainment company pushing the boundaries of storytelling and technology. They are seeking an Engineering Manager to lead the CI/CD pipelines and lab infrastructure supporting their Linux and FreeBSD-based edge platforms, ensuring security and performance as they expand their services.
Responsibilities:
- Lead and grow a team that combines a highly technical CI/CD and validation group with an operations-focused lab group, blending deep systems expertise with developing talent to support our Linux/FreeBSD-based edge platforms
- Own and evolve scalable CI/CD pipelines and automated qualification with clear release/rollback gates, comprehensive testing, and strong observability
- Evolve and operate test and load infrastructure that runs production-like Live, VOD, Ads, and Cloud Gaming workloads and surfaces platform contention
- Manage lab operations and capacity, including hardware provisioning, inventory and ticket workflows, contractor coordination, remote access, and efficient use of lab resources
- Streamline the lab’s security architecture so that access controls, isolation mechanisms, and workflows are tightly aligned with platform security standards and compliance requirements
- Define processes and tooling so lab operations can reliably support CI/CD and validation needs, bringing order and prioritization to a high-volume, often ambiguous request stream
- Partner closely with OS, hardware, application, and security teams to align validation and lab support with their roadmaps, integrate functional and security guardrails into CI/CD, and debug complex regressions of large scale
- Lead the use of AI to develop tools and analysis systems that improve developer productivity, operational triage, and the efficiency of CI/CD, validation, and lab workflows
- Partner in bringing up and evaluation of new devices from CPU, GPU, and other hardware vendors in the lab, enabling analysis, validation, and integration into our edge platforms
- Drive engineering excellence by setting technical standards, defining metrics and SLOs, and using data and postmortems to continually improve systems, practices, and lab operations
Requirements:
- Experience as an Engineering Manager and a technical leader for highly technical teams in areas such as infrastructure, platform, systems, CI/CD and release engineering
- Excellent communication skills and the ability to translate between deep technical detail and clear business impact
- Deep hands-on background in systems software, with meaningful experience in one or more of: Linux and/or FreeBSD low-level OS development or administration
- Strong experience with CI/CD and test automation, including designing and operating pipelines for complex multi-component systems and using tools such as GitHub Actions, Jenkins, or similar CI systems
- Proven ability to design, build, and operate services that support CI/CD and test automation, delivering reliable, observable, scalable systems and efficient use of compute and hardware resources
- Demonstrated success leading through influence, working cross-functionally with OS, hardware, and security teams, and driving alignment and roadmaps across multiple groups
- Experience with: Hardware lab automation, PXE boot, imaging, remote power control, serial access, and rack automation
- Validating BIOS and firmware behavior, managing firmware rollouts
- Boot, networking, and storage internals for Linux and/or FreeBSD
- Familiarity with: Performance tooling such as perf, flamegraphs, bpftrace/eBPF, DTrace, fio, and network benchmarking tools
- Using AI tools for operational triage (log clustering, anomaly detection) with clear guardrails and fallbacks
- Incident response practices, including postmortems and preventative engineering actions
- Contributions to or collaboration with open-source communities