Senior Site Reliability Engineer

43 Minutes ago • 6-10 Years • DevOps

About the job

Job Description

The Senior Site Reliability Engineer (SRE) at Microsoft's Windows Servicing & Delivery (WSD) team will be responsible for ensuring the reliability and performance of Windows client, Windows Update, and Windows Autopatch. This role involves troubleshooting complex problems, resolving customer issues, collaborating with engineering teams, performing technical investigations, and proactively identifying and preventing issues affecting millions of clients. Responsibilities include leading troubleshooting, liaising between customers and engineering teams, identifying trends, implementing changes, collaborating with other engineering teams, using advanced tools for analysis and problem-solving, and consistently providing excellent customer service. The ideal candidate will possess strong debugging skills, experience in a customer-facing SRE role, and proficiency in troubleshooting various technologies.
Must have:
  • 6+ years experience in relevant field
  • 4+ years in customer-facing SRE role
  • Proficient troubleshooting and data analysis
  • Debugging and code analysis skills
  • Collaboration and communication skills
Good to have:
  • C/C++/C# code reading and analysis
  • Windbg debugging experience
  • Windows Azure cloud platform experience
  • Microsoft Intune, Entra, and Device Management knowledge
  • PowerShell/VB Scripting experience
  • Knowledge of Windows Updates, Autopatch, WUfB
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

We are looking to expand our team of next generation Site Reliability Engineers (SREs), that enables the success of Microsoft solutions for our Commercial & Enterprise customers. The Senior Site Reliability Engineer will be responsible for solving complex problems and ensuring the reliability of our products whilst maintaining customer satisfaction. As a technical leader, you will have many opportunities to assist in the growth of your colleagues through one-on-one mentoring, one-to-many education scenarios, and incident response. We’ll provide you with abundant resources, including a rich content library and advanced diagnostic tools. As a member of this organization, you will benefit from access to the most comprehensive collection of experts as well as the opportunity to work directly with the Product Managers and Software Engineers who design and build Microsoft products. 

 

The Windows Servicing & Delivery (WSD) SRE Team utilizes diagnostic data and deep technical experiences to optimize the reliability and performance of our product offerings with a focus on Windows client, Windows Update, and Windows Autopatch.

 

We are looking for a Senior Site Reliability Engineer to join our team and bring valuable site reliability contributions to Windows and associated services. You will collaborate with multiple Microsoft engineering teams, especially in the Windows area. As a member of this team, you will be responsible for debugging, troubleshooting, filing bugs, and resolving customer issues. You will work directly with developers and customers who range from local to global corporations. The SRE team performs technical investigations to solve customer and service incidents as well as working on proactive service improvements. Work on this team isn’t just about fixing one system but thinking at scale to help fix or prevent problems affecting millions of client systems.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required Qualficiations: 

  • 6+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.
  • 4+ years’ experience in customer-facing site reliability, service engineer role or support engineer roles.

Other Requirements:

 

Ability to meet Microsoft, customer and/or government security screening requirements that are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications: 

  • Able to read and analyze C/C++ / C# / code and perform source code level investigations.
  • Familiar with debugging native C/C++ and managed code C# using Windbg. 
  • Windows on Azure cloud platform (Virtual Machines/Containers/Hypervisor/Virtualization) Reliability and Performance.
  • Technical proficiency, troubleshooting and learning attitude towards Microsoft M365 technologies. 
  • Working knowledge of Microsoft Intune, Microsoft Entra and Device Management.
  • Experience with scripting language-based development (PowerShell, VB Script).
  • Knowledge on the Windows Updates space, specifically on Windows Autopatch and Windows Updates for Business (WUfB). Feature Upgrades, Quality Updates and Driver updates.
  • Proficient troubleshooting and data/log analysis skills (Perfmon/XPerf/ETL/ETW). 
  • Experience with networking protocols and knowledge of troubleshooting network issues, infrastructure components, and cloud services.

Site Reliability Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Microsoft will accept applications for the role until November 17, 2024.

 

Responsibilities

Site Reliability Engineers (SRE) within Windows Servicing and Delivery are responsible for quickly and accurately responding to the problems our customers are experiencing, creating solutions that scale, and proactively identifying problem indicators and acting before issues occur. As a Senior SRE, you are expected to: 

  • Lead troubleshooting investigations to bring quicker issue resolution to complex problems impacting our customers, to improve our customer experience and contribute to the growth of our products.
  • Liaise between our customers and other engineering teams across Microsoft when required.
  • Identify emerging trends or recurring scenarios for our technologies and drive improvement feedback/repair items into our engineering team.
  • Perform methodical change implementations and validation tests to measure effectiveness and expected outcomes.
  • Exhibit leadership through personal responsibility, and accountability and teamwork.
  • Collaborate with other Engineering teams regularly to provide technical reviews and action plans for the customers.
  • Use trace analysis, debug skills, source code and other proprietary tools to analyze problems and develop solutions to meet the customer requirements.
  • Display a customer-obsessed mindset and provide a great customer experience.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$117.2K - $229.2K/yr (Outscal est.)
$173.2K/yr avg.
Atlanta, Georgia, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (On-Site)

Santa Clara, California, United States (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

Dublin, County Dublin, Ireland (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in Atlanta, Georgia, United States

Samsung Semiconductor - Senior Manager, Memory Sales

Samsung Semiconductor, United States (Hybrid)

Twitch - Senior Data Scientist - ML

Twitch, United States (On-Site)

Anavation - Senior Android Software Engineer

Anavation, United States (On-Site)

The Walt Disney Company - Spa Esthetician - Tenaya Stone Spa - Part Time

The Walt Disney Company, United States (On-Site)

Meta - Production Engineering

Meta, United States (On-Site)

Probably Monsters - Senior Technical Artist

Probably Monsters, United States (Hybrid)

Greenworks - Email Marketing Specialist

Greenworks, United States (On-Site)

ByteDance - Network Software Development Engineer, SDN

ByteDance, United States (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Microsoft - Technical Program Manager

Microsoft, Czechia (Remote)

Microsoft - Principle Software Engineer

Microsoft, India (On-Site)

Anthology  Inc  - Director, Internal Business Systems

Anthology Inc , India (On-Site)

Microsoft - Principal Group Engineering Manager

Microsoft, Ireland (On-Site)

Microsoft - Principal Software Engineer- AI Search

Microsoft, United States (On-Site)

bosh group india - Technical Consultant

bosh group india, India (On_site)

Luxoft - .NET and Azure API Developer

Luxoft, India (On-Site)

Get notifed when new similar jobs are uploaded