Site Reliability Engineer L5 - Open Connect

1 Week ago • All levels • DevOps • $100,000 PA - $720,000 PA

Job Summary

Job Description

As a Site Reliability Engineer L5 at Netflix's Open Connect, you'll design, scale, operate, automate, and analyze the globally distributed CDN, focusing on Edge Accelerator services. Responsibilities include improving resilience, security, observability, QoE, monitoring, and automation. You'll analyze massive datasets using Netflix's Big Data platform to optimize service delivery and system reliability. On-call rotation and handling production issues are also key aspects of this role. Experience with *nix, networking, data analysis, and large-scale service operations is essential, along with proficiency in programming languages like Go, C, or Python.
Must have:
  • CDN and HTTP cache/proxy expertise
  • Deep understanding of internet protocols
  • Building and maintaining highly distributed systems
  • Proficiency in Go, C, or Python
  • Experience with distributed analytics
  • Excellent communication skills

Job Details

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved stories, Netflix is responsible for a significant portion of global internet traffic. To steward that responsibility, we work collaboratively with ISPs to deploy , Netflix’s Content Delivery Network (CDN), our in-house custom-built network and server infrastructure responsible for delivering 100% of Netflix's video traffic. 

In addition to streaming video delivery, Open Connect Appliances (OCAs) are ideally situated to also improve the latency between clients and the Netflix services running on AWS. The Open Connect Edge Accelerator is taking advantage of the highly geo-distributed nature of Open Connect to improve the quality of experience. It is the entry point for device and website traffic, putting it on the critical path to delivering and monitoring our product experiences. 

We are seeking a seasoned Reliability Engineer with extensive experience in *nix, networking, data analysis, and large-scale service operations experience to design, scale, operate, automate, and analyze our globally distributed CDN, with a focus on the Edge Accelerator services. You will be working on reliability, resilience, performance, latency measurement, steering solutions, low-latency reverse proxy, failover mechanisms, protocol optimizations, and DDoS protection to name a few. 

Qualifications

  • Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies

  • Deep understanding of Internet protocols like TCP, TLS, HTTP/S, and DNS

  • Experience building and maintaining highly distributed, scalable, low-latency, fault-tolerant production systems with a focus on security and reliability 

  • Proficient in a programming language such as Go, C, or Python

  • Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)

  • Great communication and documentation skills targeted at cross-team collaboration

  • Motivated by “the art of possible” and able to balance idealism and pragmatism

  • Cool-headed during production issues, able to focus on problem resolution

  • Preferred - BS in Computer Science, Electrical Engineering, or Computer Engineering (or equivalent professional experience)

Responsibilities 

  • Drive continual improvement in resilience, security, observability, quality of experience (QoE), monitoring, instrumentation, and automation with the primary goal of maintaining highly scalable and reliable CDN services worldwide

  • Aggregate, analyze and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized, and efficient toolset for service delivery optimization and system reliability improvements

  • Participate in on-call rotation and handle escalations for service delivery production issues

  • Have lots of discussions about all the great content and your favorite movies and series 

Things that show how we think

Does this sound interesting? Or does it sound interesting but intimidating? Please don’t self-select; let’s figure it out together. Come join us and play a meaningful role in our journey to entertain the world! We’d love to talk to you!

Netflix is a global company with a diverse member base, which is why the content we produce reflects that: global perspectives and global stories. As we grow globally, we must have the most talented employees with diverse backgrounds, cultures, perspectives, and experiences to support our innovation and creativity. We are an equal opportunity employer and strive to build balanced teams from all walks of life.

Our culture is unique, and we tend to live by our values, so it’s worth learning more about Netflix .

At Netflix, we carefully consider a wide range of compensation factors to determine your personal top of market. We rely on market indicators to determine compensation and consider your specific job, skills, and experience to get it right. These considerations can cause your compensation to vary and will also be dependent on your location. The overall market range for roles in this area of Netflix is typically  $100,000 - $720,000. This market range is based on total compensation (vs. only base salary), which is in line with our compensation philosophy. 

is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Similar Jobs

POWTOON - Instructional Designer & Learning Strategist

POWTOON

England, United Kingdom (Remote)
1 Month ago
Cognite - Senior Back-end Engineer

Cognite

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Netflix - Software Engineer (L5) - Open Connect Control Plane, Cloud Games

Netflix

Los Gatos, California, United States (On-Site)
1 Month ago
NVIDIA - Data Systems Analyst (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Month ago
ByteDance - Machine Learning Engineer Intern (Knowledge Graph) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Vi - Data Infrastructure Engineer

Vi

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
LeoVegas - Cloud Security Engineer

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
3 Months ago
Zoox - Developer Platforms Internship/Co-op

Zoox

Foster City, California, United States (Hybrid)
4 Months ago
Brillio - Azure DB Architect - Migration - R01531206

Brillio

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Playtech - Network Reliability Engineer (Intern)

Playtech

Tartu, Tartu County, Estonia (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Egnyte - Business Development Representative

Egnyte

Draper, Utah, United States (Hybrid)
3 Months ago
Razer - Solutions Architect

Razer

Singapore (On-Site)
4 Months ago
Vi - Data Infrastructure Engineer

Vi

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Paypal - Machine Learning Engineer

Paypal

San Jose, California, United States (Hybrid)
4 Months ago
Mozilla - Staff Machine Learning Engineer, Fakespot

Mozilla

United States (Remote)
4 Months ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
5 Days ago
Next Level Business Services - Bigdata / Hadoop Architect

Next Level Business Services

Oldsmar, Florida, United States (On-Site)
4 Months ago
Acceldata - Staff Support Engineer / Product Specialist : Cloud & Data Management

Acceldata

Bengaluru, Karnataka, India (On-Site)
4 Months ago
N-iX - Senior Data Engineer

N-iX

Ukraine (Remote)
4 Weeks ago
Acceldata - Resident Solutions Architect

Acceldata

United States (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

The Walt Disney Company - Talent Acquisition Marketing Communications Intern

The Walt Disney Company

Lake Buena Vista, Florida, United States (Hybrid)
1 Week ago
Company3 Method Studios - NExT Summer Intern - Postproduction

Company3 Method Studios

Los Angeles, California, United States (On-Site)
1 Week ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Trek - Assistant Store Manager

Trek

Albuquerque, New Mexico, United States (On-Site)
1 Week ago
Jobot - Lead Gameplay Animator

Jobot

Las Vegas, Nevada, United States (Remote)
5 Months ago
Meta - ASIC Engineer, Design Verification

Meta

Sunnyvale, California, United States (Remote)
3 Months ago
ByteDance - Product Manager, Insurance - Global Payment

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ION - Senior Full-Stack Developer, New York

ION

New York, New York, United States (Hybrid)
4 Months ago
Fliff  Inc  - User Acquisition Manager

Fliff Inc

Austin, Texas, United States (On-Site)
7 Months ago
Evolution - Online Game Presenter (Guest Service Agent Alternative)

Evolution

Atlantic City, New Jersey, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Tencent - Senior Product Solution Architect

Tencent

Hong Kong (On-Site)
2 Months ago
Luxoft - Senior DevOps Engineer

Luxoft

Guadalajara, Jalisco, Mexico (On-Site)
3 Months ago
Onward Search - Front-end Engineer

Onward Search

Rochester, Minnesota, United States (Remote)
1 Week ago
Greenway Health - Devops Manager, Product Development

Greenway Health

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Imagineio - DevOps Engineer

Imagineio

Delhi, India (Hybrid)
3 Months ago
Ness Digital - DevOps Engineer

Ness Digital

Timișoara, Timiș, Romania (Hybrid)
1 Month ago
The Walt Disney Company - Principal Data Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Weeks ago
Ajmera Infotech - Senior ASP.NET Developer with Azure Expertise

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
2 Months ago
Rackspace Technology - AWS Support Engineer IV

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Netflix is one of the world's leading entertainment services with over 247 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

Los Gatos, California, United States (On-Site)

London, England, United Kingdom (On-Site)

Burbank, California, United States (On-Site)

Sydney, New South Wales, Australia (On-Site)

Los Gatos, California, United States (On-Site)

United States (Remote)

View All Jobs

Get notified when new jobs are added by Netflix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug