Principal Site Reliability Engineer

11 Hours ago • 8-10 Years • DevOps

About the job

Job Description

Zscaler seeks an experienced Principal Site Reliability Engineer to join its Engineering Team. This hybrid role (3 days on-site in San Jose, CA) involves working with large-scale distributed systems, cloud platforms (AWS, GCP, Azure), and infrastructure as code. Responsibilities include supporting large-scale services, managing high-pressure situations, participating in on-call rotations, developing and enhancing tools, diagnosing and fixing issues, and developing automation tools. The ideal candidate will have 8-10+ years of experience in SRE, a deep understanding of SRE principles, and experience with incident response and large-scale distributed systems. Proficiency in Python, Golang, Java, or Rust is preferred.
Must have:
  • 8-10+ years SRE experience
  • Large-scale distributed systems expertise
  • Cloud platform (AWS, GCP, Azure) knowledge
  • Incident response and resolution skills
  • Infrastructure as code experience
Good to have:
  • Bachelor's Degree in CS/MIS
  • Proficiency in Python, Golang, Java, or Rust
  • Kubernetes experience
  • Standard SDLC experience
Perks:
  • Various health plans
  • Time off plans
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks

About Zscaler

Serving thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world’s largest security cloud, Zscaler accelerates digital transformation so enterprises can be more agile, efficient, resilient, and secure. The pioneering, AI-powered Zscaler Zero Trust Exchange™ platform protects thousands of enterprise customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location. 

Named a Best Workplace in Technology by Fortune and others, Zscaler fosters an inclusive and supportive culture that is home to some of the brightest minds in the industry. If you thrive in an environment that is fast-paced and collaborative, and you are passionate about building and innovating for the greater good, come make your next move with Zscaler. 

Our Engineering team built the world’s largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your vision and passion to our team of cloud architects, software engineers, security experts, and more who are enabling organizations worldwide to harness speed and agility with a cloud-first strategy.

We're looking for an experienced Principal Site Reliability Engineer to join our Engineering Team, reporting to the VP of Engineering.  This is a hybrid role going onsite in our San Jose, CA office 3 days a week.  In this role, you will:
  • Work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
  • Support large-scale services, manage high-pressure situations, and participate in on-call rotations.
  • Develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality.
  • Diagnose and fix issues by editing code, modifying infrastructure configurations, conducting network and performance analysis and creating reusable tooling.
  • Develop automation tools and optimize services through version-controlled infrastructure-as-code.

What We're Looking for (Minimum Qualifications)

  • U.S. citizenship is required for this position due to the nature of the customers assigned to this role. 
  • 8-10+ years of relevant experience working in SRE teams, supporting mission critical production service.
  • Deep understanding of SRE principles, practices, and tools.
  • Experience with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
  • Experience with incident response including resolving system failures and outages, with a focus on engineering solutions is support of production reliability.

What Will Make You Stand Out (Preferred Qualifications)

  • Bachelor's Degree in Computer Science, Management Information Systems, or equivalent experience.
  • Proficiency in Python, Golang, Java, or Rust. Experience working in a standard SDLC.
  • Experience with Kubernetes on multiple cloud provider platforms.

#LI-Hybrid

#LI-BH1

Zscaler’s salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training.

The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits.

Base Pay Range
$161,000$230,000 USD

At Zscaler, we believe that diversity drives innovation, productivity, and success. We are looking for individuals from all backgrounds and identities to join our team and contribute to our mission to make doing business seamless and secure. We are guided by these principles as we create a representative and impactful team, and a culture where everyone belongs. For more information on our commitments to Diversity, Equity, Inclusion, and Belonging, visit the Corporate Responsibility page of our website.

Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including:

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!

By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines.

Zscaler is proud to be an equal opportunity and affirmative action employer. We celebrate diversity and are committed to creating an inclusive environment for all of our employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status or any other characteristics protected by federal, state, or local laws.

See more information by clicking on the Know Your Rights: Workplace Discrimination is Illegal link.

Pay Transparency

Zscaler complies with all applicable federal, state, and local pay transparency rules. For additional information about the federal requirements, click here.

Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.

View Full Job Description
$161.0K - $230.0K/yr (Outscal est.)
$195.5K/yr avg.
San Jose, California, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Axonius gives customers the confidence to control complexity by mitigating threats, navigating risk, automating response actions, and informing business-level strategy. With solutions for both cyber asset attack surface management (CAASM) and SaaS management, Axonius is deployed in minutes and integrates with hundreds of data sources to provide a comprehensive asset inventory, uncover gaps, and automatically validate and enforce policies. Cited as one of the fastest-growing cybersecurity startups, with accolades from CNBC, Forbes, and Fortune, Axonius covers millions of assets, including devices and cloud assets, user accounts, and SaaS applications, for customers around the world. For more, visit Axonius.com.

Bengaluru, Karnataka, India (On-Site)

San Jose, California, United States (Remote)

South Korea (Remote)

Tennessee, United States (Remote)

Escazu, San José Province, Costa Rica (Hybrid)

Pennsylvania, United States (Remote)

Texas, United States (Remote)

Sahibzada Ajit Singh Nagar, Punjab, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Axinous

Similar Jobs

Meta - Software Engineer, Infrastructure

Meta, United States (On-Site)

Sinch - Senior Software Engineer (Java)

Sinch, India (Hybrid)

Microsoft - Member of Technical Staff - MacOS Engineer

Microsoft, United States (Hybrid)

Rackspace Technology - Job Opportunity :  Lead Database Engineer : Night Shift

Rackspace Technology, India (Hybrid)

Info Stretch - Full Stack Developer – (React / Node)

Info Stretch, United Kingdom (On-Site)

Picarro - DevOps Manager

Picarro, India (On-Site)

The Walt Disney Company - Sr Site Reliability Engineer

The Walt Disney Company, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

CloudHire - Sr. Database Engineer

CloudHire, India (Remote)

InMobiInMobi - Senior Solutions Engineer

InMobiInMobi, United Kingdom (On-Site)

Luxoft - Senior Java Developer

Luxoft, Colombia (Remote)

Axinous - Android Software Engineer (Networking)

Axinous, United States (Hybrid)

Alpha Sense - Join AlphaSense India Talent Community

Alpha Sense, India (On-Site)

GoTo Group - Senior Software Engineer (Backend) - DPI

GoTo Group, Indonesia (On-Site)

ION - Cloud Engineer Kubernetes

ION, Italy (Hybrid)

Paytm - Business Analyst - AM/DM

Paytm, India (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

Meta - Production Engineering

Meta, United States (Hybrid)

Hasbro - Digital Game Event Coordinator

Hasbro, United States (Remote)

Aristocrat Gaming - Customer Success Associate, Interactive

Aristocrat Gaming, United States (Hybrid)

Microsoft - Research Intern - Azure Research – Systems

Microsoft, United States (On-Site)

Fabric - Applied Cryptographer, ZKP Research

Fabric, United States (Remote)

Samsung Semiconductor - Senior Manager, Customer Quality and Reliability

Samsung Semiconductor, United States (On-Site)

Magic Media - Business Development Manager

Magic Media, United States (Remote)

Nintendo - Senior Engineer, Multimedia (NTD)

Nintendo, United States (On-Site)

Reversing Labs - Accounting Manager

Reversing Labs, United States (Remote)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Egnyte - Database Administrator

Egnyte, India (Remote)

Publicis Groupe - Openlink Endur Architect/Senior Architect

Publicis Groupe, India (On-Site)

Zones - Cloud Engineer

Zones, India (On-Site)

Playtika - Senior DATA/AI SRE Engineer

Playtika, Poland (On-Site)

Ubisoft - Machine Learning Developer

Ubisoft, Canada (On-Site)

Microsoft - Site Reliability Engineer II

Microsoft, India (On-Site)

Glean - Site Reliability Engineer (India)

Glean, India (On-Site)

Barracuda Networks  Inc  - Senior Site Reliability Engineer

Barracuda Networks Inc , India (On-Site)

Get notifed when new similar jobs are uploaded