Senior Site Reliability Engineer

2 Months ago • All levels • Administrative • DevOps

About the job

Job Description

Senior Site Reliability Engineer at HHAeXchange with expertise in maintaining PostgreSQL, MySQL, SQL Server, MongoDB, DataDog, CloudWatch, and Solarwinds, with strong incident response and root cause analysis skills.
Must have:
  • PostgreSQL, MySQL, SQL Server
  • DataDog, CloudWatch, Solarwinds
  • Incident Response & RCA
  • Capacity Planning & Optimization
Good to have:
  • AWS VPN, Workspace, Backup
  • System Performance Optimization
  • DR Drills & Planning
  • Cloud Operation Solutions
Perks:
  • Competitive Health Plans
  • 401K Retirement Program
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solution to help people who are aging or have disabilities thrive in their homes and communities. Our employees are passionate about transforming the healthcare space by building the only homecare ecosystem that fully connects patients, personal care providers, managed care organizations, and states.  

The Sr SRE Engineer role will be working with SRE team on the shared full stack ownership of a collection of services and technologies in the cloud and our 2 data centers. The individual in this role needs should have the ability to work independently, understand end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.

To perform this job successfully, an individual must be able to perform each essential job duty satisfactorily with or without reasonable accommodation.  Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

Essential Job Duties

    • Maintain PostgreSQL, MySQL, SQL Server, MongoDB.
    • Maintain and implement observability systems specially DataDog, CloudWatch, Solarwinds.
    • Act as point of contact and first responder for production issues during business hours.
    • Engage in incident calls and help resolve issues as soon as possible.
    • Engage in capacity planning of resources to make sure there is enough infrastructure available.
    • Conduct root cause analysis and post incident review after each incident.
    • Review SRE jira tickets daily and ensure they are on track according to the goals defined.
    • Maintain and update System Operation Procedures (SOP) for production systems.
    • Act as liaison between onshore and offshore SRE teams.
    • Conduct daily morning production inspections before start of business hour.
    • Actively coordinate with director of cloud and onshore team during US business hours.
    • Participate in DR drills to make sure there is proper disaster recovery plan in place.
    • Identify and implement changes to optimize system performance.
    • Develop and maintain SRE documentation in internal wiki.
    • Validate and maintain certificates and licences for different applications.
    • Design and develop cloud operation solutions (AWS VPN, Workspace, AWS Backup)
    • Analyze trends, dive into system dashboards and review key performance metrics to identify anomalies, and proactively address any potential issues.

Other Job Duties

    • Other duties as assigned by supervisor or HHAeXchange leader.
The base salary range for this US-based, full-time, and exempt position is $125,000-135,000, not including variable compensation. An employee’s exact starting salary will be based on various factors including but not limited to experience, education, training, merit, location, and the ability to exemplify the HHAeXchange core values.
 
This is a benefits-eligible position. HHAeXchange offers competitive health plans, paid time-off, company paid holidays, 401K retirement program with a Company elected match, including other company sponsored programs.

HHAeXchange is an equal-opportunity employer. The Company offers employment opportunities to all applicants and employees without regard to race, color, religion, national origin, sex, sexual orientation, gender identity or expression, age, disability, medical condition, marital status, veteran status, citizenship, genetic information, hairstyles, or any other status protected by local or federal law.
View Full Job Description
$125.0K - $135.0K/yr (Outscal est.)
$130.0K/yr avg.
New York, New York, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

New York, New York, United States (Remote)

New York, New York, United States (Remote)

New York, New York, United States (Remote)

New York, New York, United States (Remote)

New York, New York, United States (Remote)

Minneapolis, Minnesota, United States (Remote)

New York, New York, United States (Remote)

Minneapolis, Minnesota, United States (Remote)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by HHA Exchange

Similar Jobs

Koombea - Software Project Manager

Koombea, Mexico (Remote)

PwC - Application Support Engineer

PwC, Greece (Hybrid)

paypal - UX Designer - Design Systems

paypal, Mexico (Hybrid)

1047 Games - Senior UI Engineer

1047 Games, (Remote)

social discovery ventures - Senior App Compliance Manager

social discovery ventures, Spain (Remote)

inveniolsi - SAP BTP Associate Managing Consultant

inveniolsi, India (On-Site)

SSC Technologies - Application Support Analyst

SSC Technologies, United States (Remote)

Neostella - IT Systems Engineer

Neostella, Colombia (On-Site)

Evolution - Performance Assessment Specialist

Evolution, Georgia (On_site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Neostella - Python Developer

Neostella, Mexico (Remote)

Zynga - QA Engineer - Gram Games

Zynga, United Kingdom (On-Site)

Playtika - Full-Stack Developer

Playtika, Romania (On-Site)

Rockstar Games - Senior Production Coordinator, Creator Platform

Rockstar Games, United Kingdom (On-Site)

DAZN - QA Lead

DAZN, India (On-Site)

UPS - Intermediate SQA

UPS, India (On-Site)

Aristocrat Gaming - Sr Data Analyst

Aristocrat Gaming, India (On-Site)

vi - Product Manager

vi, Israel (On-Site)

SSC Technologies - Automation Support and Operations Manager

SSC Technologies, United States (Remote)

The Walt Disney Company - Pipeline Technical Director (PH)

The Walt Disney Company, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in New York, New York, United States

undefined - Technical Consultant, West

United States (Remote)

ION - Senior Business Consultant - Endur

ION, United States (On-Site)

Fabric - Digital Design Verification Consultant

Fabric, United States (On-Site)

Glean - Enterprise Customer Success Manager

Glean, United States (On-Site)

ION - FX Implementation Specialists

ION, United States (On-Site)

Salesforce - Lead Software Engineer/DevOps

Salesforce, United States (On-Site)

Get notifed when new similar jobs are uploaded

Administrative Jobs

Nintendo - Licensing Coordinator

Nintendo, United States (Hybrid)

Nintendo - Intern - Competitive Play

Nintendo, United States (On-Site)

leadventure - IT Specialist

leadventure, India (On-Site)

Crytek - C++ Buildpipeline Programmer

Crytek, Germany (On-Site)

SSC Technologies - Desktop Engineer

SSC Technologies, Japan (On-Site)

Tencent - Payroll Associate

Tencent, Malaysia (On-Site)

Barracuda Networks Inc - Desktop Support Specialist

Barracuda Networks Inc, India (On-Site)

Easygo - Workplace Experience Assistant

Easygo, Australia (On-Site)

undefined - NOC Administrator

Bengaluru, Karnataka, India (On-Site)

Unisys - IT Engineer - Onsite

Unisys, India (On-Site)

Get notifed when new similar jobs are uploaded