Match score not available

Site Reliability Engineer L4/L5 - Live Streaming Pipeline

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Salary: 
100 - 720K yearly
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

5+ years operational experience in large scale systems, Experience with video transport protocols (RTP, RTMP, SRT, etc.), Expert-level knowledge of Unix/Linux systems, Understanding of networking principles and protocols, Proficient in a programming language like Python or Go.

Key responsabilities:

  • Maintain highly scalable and reliable services worldwide
  • Conduct functional and performance testing for live streaming
  • Collaborate with stakeholders for event execution
  • Analyze server and application performance data
  • Participate in on-call rotation and flexible hours based on live events
Netflix logo
Netflix XLarge https://jobs.netflix.com/
10001 Employees
See more Netflix offers

Job description

Netflix is one of the world’s leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

The Role

Netflix has been changing how people watch shows and movies, enabling on-demand access to thousands of movies and TV shows. Recently, Netflix has expanded its entertainment offering to include Live content, like Chris Rock Comedy Special, the SAG Awards ceremony or The Netflix Slam tennis match. Bringing stories in real-time to 270+ million viewers around the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About the role In this role, you will support our live-streaming pipeline team and day-to-day live-streaming operations for Netflix. As a Live Streaming Pipeline SRE, you will be responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting end-to-end observability and visualizing the data to achieve the desired availability at scale. Working with cross-functional teams in the preparation, validation, and execution of live streaming-focused initiatives. You will impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days. You will be leading innovation initiatives, driving new features that will enhance our live streaming services, encoding & content delivery. Responsibilities:
  • Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal of maintaining highly scalable and reliable services worldwide.
  • Implement, automate, execute, and analyze the results from a broad range of live streaming delivery-focused functional, performance, resilience, and fault injection testing.
  • Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events.
  • Aggregate, analyze and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized, and efficient toolset for service delivery optimization and system reliability improvements.
  • Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule.
Qualifications:
  • 5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery.
  • Experience with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS, MPEG-DASH.
  • Knowledge of and proven experience with HTTP cache/proxy technologies. Experience supporting live-streaming delivery at scale.
  • Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale. 
  • Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S.
  • Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc).
  • Proficient in a programming language such as Python or Go.
  • Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners.
  • Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience).
Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000. Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs.  Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more detail about our Benefits here. Netflix is a unique culture and environment.  Learn more here.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication

Site Reliability Engineer (SRE) Related jobs