Site Reliability Engineer

4 Months ago • 8-10 Years • DevOps

Job Summary

Job Description

Seeking a talented SRE with 8-10 years of experience in building scalable and reliable systems. Must have strong programming skills in Python, Go, Java, or Ruby, and experience with monitoring tools like ELK, Dynatrace, Cloudwatch, etc. Strong problem-solving and communication skills required.
Must have:
  • SRE experience
  • Programming skills
  • Monitoring tools
  • Problem-solving skills
Good to have:
  • Cloud experience
  • On-call management
  • Release management
  • Agile development

Job Details

About the job

About the role

We are seeking a talented Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in software engineering and systems administration, with a passion for building scalable and reliable systems. As an SRE, you will collaborate with development and operations teams to ensure our services are reliable, performant, and highly available.


Key Responsibilities

  • Experience maintaining and supporting solutions in a Cloud based environment (GCP or AWS)
  • Experience working with various monitoring tools. (eg. ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus)
  • Ensure monitoring and self-healing strategies are implemented and maintained to proactively prevent production incidents.
  • Perform root cause analysis of production issues
  • Design and manage on call and escalation processes – Nice to Have
  • Participate in design reviews and production reviews for new features, products, or pieces of infrastructure
  • Designing and implementing ELK (Elasticsearch, Logstash and Kibana) stack, Prometheus and Grafana solutions for monitoring and alerting.
  • Debug production issues across services and levels of the stack.
  • Establish KPIs to demonstrate maturity, efficiency, and value to our business partners
  • Works as an integral part of the DevOps team with complimentary skills and common goals
  • L3 Support experience is an asset.
  • Work to create a Release management process and help with Out-of-business-hour deployments and support (Rotation with team members)
  • Familiar and comfortable with agile development techniques.


Technology skills (Mandatory)

ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus


Required qualifications to be successful in this role:

  • Bachelor’s degree in computer science engineering, or related field.
  • 8 -10 years of experience as a SRE.
  • Proven experience as an SRE, DevOps engineer, or similar role.
  • Strong programming skills in languages such as Python, Go, Java, or Ruby.
  • Strong problem-solving skills and ability to work under pressure.
  • Excellent communication and collaboration skills.
  • Flexible to work in EST time zones ( 9-5 EST)

Similar Jobs

Meta - Software Engineer, Infrastructure

Meta

Atlanta, Georgia, United States (Remote)
3 Months ago
Enphase Energy - Sr. Staff Engineer - Enlighten Cloud Backend

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
1 Month ago
The Walt Disney Company - Senior Systems Engineer, Data Services [Database Administration]

The Walt Disney Company

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
Quizizz - Platform Engineer

Quizizz

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Nagarro - Senior Staff Engineer (Python Azure Synapse)

Nagarro

India (On-Site)
3 Months ago
Oportun - Senior ML Engineer

Oportun

India (Remote)
3 Months ago
PwC - ETIC, Cloud Solution Architect - Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
3 Months ago
SmileGate - Build Manager [LOST ARK Mobile]

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
4 Weeks ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Bengaluru, Karnataka, India (On-Site)
3 Months ago
PwC - ETIC, Cloud Solution Architect (Multi-Cloud, DevOps Focus) - Senior Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Demonware - Data Engineering Co-op

Demonware

Vancouver, British Columbia, Canada (Hybrid)
1 Week ago
PhonePe - Software Engineer Backend (Exp. Bucket 7-10 Yrs)

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Trend Micro - Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago
Axinous - Principal Software Engineer (ZDX Platform Engineering)

Axinous

San Jose, California, United States (Hybrid)
2 Months ago
Next Level Business Services - Salesforce Tech Lead

Next Level Business Services

San Jose, California, United States (On-Site)
3 Months ago
The Walt Disney Company - Lead Software Engineer, Scala

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago
Lulalend - Senior Software Engineer

Lulalend

Cape Town, Western Cape, South Africa (Remote)
4 Months ago
ByteDance - Senior Site Reliability Engineer, CDN

ByteDance

Singapore (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

PwC - IN- Manager_ Employee Central_Enterprise Apps SAP_Advisory_Noida

PwC

Noida, Uttar Pradesh, India (On-Site)
3 Months ago
Sportskeeda - Content Manager - MLB

Sportskeeda

India (On-Site)
4 Weeks ago
Quizizz - Marketing Campaign Specialist

Quizizz

Bengaluru, Karnataka, India (On-Site)
6 Days ago
Luxoft - Murex Datamart Reporting Consultant

Luxoft

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Tenstorrent - CPU Core Regression Debug Engineer

Tenstorrent

Karnataka, India (Hybrid)
4 Months ago
Keywords Studios (Player Support) - Software Engineer II - DevOps (On Contract)

Keywords Studios (Player Support)

Maharashtra, India (Hybrid)
1 Month ago
BlueJeans - Lead Engineer - API/Platform

BlueJeans

Bengaluru, Karnataka, India (On-Site)
3 Months ago
PhonePe - Associate Manager - Content Strategy

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Sportskeeda - Social Media Executive

Sportskeeda

India (Remote)
2 Weeks ago
Hitachi - Azure DevOps CICD

Hitachi

Hyderabad, Telangana, India (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Microsoft - Digital Technology Specialists App Innovation ( Spanish Speaker)

Microsoft

Dublin, County Dublin, Ireland (Hybrid)
1 Month ago
Netflix - Solutions Support Engineer (L5) - Observability

Netflix

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
EvoPlay - Senior Java Developer

EvoPlay

Limassol, Limassol, Cyprus (On-Site)
1 Month ago
Electronic Arts - [EA Sports FC] DevOps Engineer

Electronic Arts

Seoul, South Korea (On-Site)
2 Months ago
Ness Digital - DevOps Engineer

Ness Digital

Timișoara, Timiș, Romania (Hybrid)
1 Month ago
Microsoft - Senior System Electrical Engineer

Microsoft

Taipei City, Taiwan (On-Site)
1 Month ago
Smart Food Safe  - Sr Devops Engineer

Smart Food Safe

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Admin Looks - Release Manager

Admin Looks

Hyderabad, Telangana, India (Remote)
3 Months ago
Egnyte - Senior Build Engineer - Python - Jenkins

Egnyte

India (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded