Senior Site Reliability Engineer

4 Months ago • 4-10 Years • DevOps

Job Summary

Job Description

Senior Site Reliability Engineer with 4+ years of experience in software development, running Kubernetes or equivalent technology in a public or private cloud, building high-availability applications, and proficiency in C# or Java. Strong understanding of service-oriented architectures, virtualization, monitoring, and automation.
Must have:
  • Kubernetes experience
  • High-availability apps
  • C# or Java
  • Service-oriented arch
Good to have:
  • Scripting languages
  • Orchestration tools
  • Load balancing tech
  • Monitoring tools

Job Details

About the job

Summary

Description Summary of This Role

Responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Creates a bridge between development and operations by applying a software engineering mindset to system administration topics. Splits time between operations/on-call duties and developing systems and software that help increase site reliability and performance.

What Part Will You Play?

  • Chaos engineering - you’re expected to think laterally about how our systems might fail in theory, design tests to demonstrate how they behave in practice, and then formulate and implement remediation plans, as appropriate.
  • Pushing our systems to their limits, and then coming up with designs for how to get them to the next performance tier.
  • Safeguarding reliability. Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing.
  • Running “game days” to test assumptions about reliability and learn what will break before it matters to customers.
  • Reviewing designs with an eye toward increasing the holistic stability of our platform and identifying potential risks.
  • Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure.
  • Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and don’t get paged when it doesn’t).
  • Troubleshooting systems and network issues, alongside our Technical Operations Team.
  • Mentoring other engineers in reliability-related skills.
  • Evolving our SDLC, practices, and tooling to account for Site Reliability considerations and best practices.
  • Developing runbooks and improving documentation.

What Are We Looking For in This Role?

Minimum Qualifications

  • Bachelor's Degree in computer science or other relevant field of study
  • Typically Minimum 4 Years Relevant Exp in software development

What Are Our Desired Skills and Capabilities?

  • Skills / Knowledge - A seasoned, experienced professional with a full understanding of area of specialization; resolves a wide range of issues in creative ways. This job is the fully qualified, career-oriented, journey-level position.
  • Job Complexity - Works on problems of diverse scope where analysis of data requires evaluation of identifiable factors. Demonstrates good judgment in selecting methods and techniques for obtaining solutions. Networks with senior internal and external personnel in own area of expertise.
  • Supervision - Normally receives little instruction on day-to-day work, general instructions on new assignments.
  • Technical Acumen: Experience running kubernetes or equivalent technology in a public or private cloud. Building and maintaining high-availability applications including redundancy, fail over, scalability, monitoring and performance. Proficiency in coding in either C# or Java. Proficiency in scripting languages such as Shell, Python, Perl, Ruby, etc. Service Oriented or microservice architectures. Experience with virtualization, monitoring and automation. Hands-on experience with orchestration and system configuration tools such as Salt, Ansible, Fabric, Puppet, Chef, Terraform, etc. Load balancing, storage, and clustering technologies. System-level monitoring and alerting tools such as PRTG, Nagios or Zabbix. Linux and/or indows System Administration. Systems and Network Engineering. Continuous Integration tools (eg. Jenkins, TeamCity, Bamboo). Experience with cloud infrastructure and networking in a production context. Experience with physical data centers and networks

Similar Jobs

Riot Games - Staff Software Engineer - Developer Connections

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Nielsen Holdings - Data Engineer

Nielsen Holdings

Mumbai, Maharashtra, India (Hybrid)
1 Month ago
Sporty Group - OpsTech Backend Engineer

Sporty Group

India (Remote)
3 Months ago
Interactive Brokers - Senior Systems Engineer- Microsoft M365/Active Directory

Interactive Brokers

Chicago, Illinois, United States (Hybrid)
4 Months ago
Netflix - Software Engineer (L5) - Experimentation Platform

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
Pixar Animation Studios - Build & Release Engineer

Pixar Animation Studios

Emeryville, California, United States (Hybrid)
2 Weeks ago
N-iX - Technical Lead Data Engineer

N-iX

Ukraine (Hybrid)
2 Days ago
Dream11 - Lead Engineer - Cloud Security

Dream11

Mumbai, Maharashtra, India (On-Site)
3 Months ago
HiLabs - Sr. DevOps Engineer

HiLabs

Pune, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ION - Senior DevSecOps Engineer, Italy

ION

Pisa, Tuscany, Italy (On-Site)
4 Months ago
Google - Software Engineer, Java and Kotlin Ecosystem

Google

(On-Site)
2 Months ago
PwC - Experienced Associate - Forensics Services

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
4 Months ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain Merchant Platform

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Epic Games - Web Programmer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
1 Week ago
ION - Data Associate - KYC6

ION

Budapest, Hungary (On-Site)
4 Months ago
Kira Studio - Android Developer (GAMP)

Kira Studio

Bengaluru, Karnataka, India (Remote)
4 Months ago
Google - Student Training in Engineering Program (STEP) Intern, 2025

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Info Stretch - Senior Java Engineer

Info Stretch

Dublin, County Dublin, Ireland (On-Site)
2 Months ago
Microsoft - Data Scientist II

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Pune, Maharashtra, India

Gameopedia - Marketing Manager

Gameopedia

Hyderabad, Telangana, India (On-Site)
2 Months ago
SparkCognition - Senior DevOps Engineer

SparkCognition

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Infoblox - Manager, Enterprise Support

Infoblox

Thiruvananthapuram, Kerala, India (On-Site)
3 Months ago
CloudHire - Senior Content Strategist (Americas)

CloudHire

Delhi, India (Remote)
3 Months ago
Xentrix Studios - Compositing – Junior Artist

Xentrix Studios

India (On-Site)
3 Months ago
Rackspace Technology - DEVOP Engineer (AWS Terraform)-PSDE III

Rackspace Technology

India (Remote)
2 Months ago
WalkingTree Technologies - Senior Software Engineer - QA

WalkingTree Technologies

Noida, Uttar Pradesh, India (On-Site)
7 Months ago
Paytm - Regional- HRBP (Jaipur)

Paytm

Jaipur, Rajasthan, India (On-Site)
1 Month ago
Eccentric - Product Manager

Eccentric

Mumbai, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

CD PROJEKT RED - DevOps Engineering Manager

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
Wind River Systems - Cloud Solutions Architect

Wind River Systems

Tokyo, Japan (On-Site)
3 Months ago
Dream Sports - SDE - 1 - DevOps

Dream Sports

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Luxoft - Senior Java engineer (with oncall support)

Luxoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
1 Month ago
ION - Cloud Engineer/Architect (DevOps)

ION

London, England, United Kingdom (On-Site)
4 Months ago
Quizizz - Platform Engineer

Quizizz

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Xsolla - Cloud Gaming Support Engineer

Xsolla

Montreal, Quebec, Canada (Remote)
3 Weeks ago
SmileGate - Lost Ark Build Manager

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
Revolgy - GCP Engineer

Revolgy

Prague, Czechia (Hybrid)
1 Month ago
Rockstar Games - Senior DevOps Engineer

Rockstar Games

North Carolina, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded