Support the CDN delivery and day-to-day live-streaming operations for Netflix
Participate in the preparation, validation, and execution of live streaming focused initiatives in collaboration with related production and engineering teams
Impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days
Lead innovation initiatives, implementing new features, and driving enhancements in the streaming services delivery
Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation to maintain highly scalable and reliable CDN services with excellent quality of experience (QoE)
Implement, automate, execute, and analyze the results from a broad range of streaming CDN delivery focused functional, performance, resilience, and fault injection testing
Coordinate, collaborate, and partner across multiple stakeholders for the smooth execution of live-streaming events
Aggregate, analyze, and correlate large amounts of server and application performance data
Use the innovative Netflix Big Data platform as a toolset for service delivery optimization and system reliability improvements
Participate in an on-call rotation and work flexible hours based on live events schedule, including weekends and holidays
Requirements
Proficient in a programming language such as Python or Go
3+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery
Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies
Experience supporting live-streaming CDN delivery on a large scale is a plus
Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale (specifically FreeBSD)
Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S
Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners
Tech Stack
DNS
Linux
Python
Spark
SQL
TCP/IP
Unix
Go
Benefits
Health Plans
Mental Health support
401(k) Retirement Plan with employer match
Stock Option Program
Disability Programs
Health Savings and Flexible Spending Accounts
Family-forming benefits
Life and Serious Injury Benefits
Paid leave of absence programs
Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off
Full-time salaried employees are immediately entitled to flexible time off