Senior Datacenter Incident Manager

1 Month ago • 3-5 Years • Operations • $117,200 PA - $250,200 PA

Job Summary

Job Description

The Senior Datacenter Incident Manager at Microsoft coordinates multiple workstreams during crises, leveraging diagnostic expertise to resolve major incidents impacting customers and businesses. Responsibilities include on-call incident response, troubleshooting, deploying fixes, driving automations to prevent recurrence, and conducting post-mortems to identify opportunities for improvement. The role requires end-to-end expertise in service/system design, understanding of technology layers and dependencies, and maintaining advanced knowledge of evolving technology landscapes.
Must have:
  • Advanced technical expertise and judgment
  • Experience in critical environments
  • Incident response and resolution skills
  • Troubleshooting and root cause analysis
  • Post-mortem analysis and reporting
Good to have:
  • 5+ years Data Center experience
  • Service Engineering IC4 experience
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Microsoft Cloud Infrastructure and Operations (CO+I) is the engine that powers Microsoft's cloud services and we are hiring for a Senior Datacenter Incident Manager. The group is responsible for designing, building, and operating Microsoft’s global datacenters; managing the programmatic delivery of our critical infrastructure design, equipment procurement, construction delivery, infrastructure innovation, demand planning and capacity utilization of our unified infrastructure; and responsible for all operations needed to run the physical infrastructure.

 

We focus on smart growth with an emphasis on automation, data-driven engineering, cost‐effectiveness, and environmental sustainability. We deliver the core infrastructure and foundational technologies for Microsoft's 200+ online businesses including Azure, Office 365, Bing, Xbox Live, Skype, and OneDrive.  Our portfolio is built and managed by a team of subject matter experts working 24x7x365 to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide.  

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.  

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.  

 

Empower Billions!

Qualifications

Required/Minimum Qualifications

  • Bachelor's Degree in Electrical Engineering, Mechanical Engineering, or related field AND 3+ years technical experience in Critical Environments

    • OR equivalent experience in Datacenter Operations within Critical Environments.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: 

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Additional or Preferred Qualifications

  • 5+ years of Data Center Critical Environments experience 

 

Service Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

 

Microsoft will accept applications and processes offers for these roles on an ongoing basis.

 

 

#COICareers | #EPCCareers | #DCDCareers

Responsibilities

  • Leverages advanced technical expertise, judgment, and decision making to coordinate multiple work streams and resources in highly complex crisis situations to drive mitigation plan and resolve crisis by engaging necessary teams and escalating to appropriate stakeholders. Applies diagnostic expertise. 
  • Responds to incidents during regular on-call rotations, including highly complex issues with major customer or business impact, by identifying the level of impact, troubleshooting, making difficult decisions based on business impact, deploying appropriate fixes to resolve root cause(s), and driving automations for prevention of recurring issues through managing multiple workstreams and/or resources required for incident resolution (e.g., product teams and owners, organization leadership, engineering teams). 
  • Drives post-mortems and shares insights related to highly complex incidents and their resolution through postmortem reports and regular review meetings to identify opportunities to adopt similar solutions that can prevent incident recurrence in similar systems, platforms, and products across organizations. 
  • End-to-end expertise in service and/or system design, interactions between technology layers and components, functions of infrastructure, and dependencies at scale.
  • Maintains advanced knowledge and expertise as technology landscape evolves, leveraging industry norms and deep understanding to drive the adoption of innovative solutions across the team. 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Ziff Davis - Senior Software Engineer, Backend - Lose It!

Ziff Davis

United States (On-Site)
3 Months ago
Nagarro - Consultant Cyber Security (m/f/d)

Nagarro

Germany (Hybrid)
2 Months ago
Whoop - IT Manager

Whoop

Boston, Massachusetts, United States (On-Site)
4 Months ago
Vanderlande - Senior QA Engineer

Vanderlande

Pune, Maharashtra, India (Hybrid)
4 Months ago
Nagarro - Principal Engineer (Python)

Nagarro

Gurugram, Haryana, India (On-Site)
4 Months ago
Tesla - Store Manager

Tesla

Budapest, Hungary (On-Site)
3 Weeks ago
Netflix - Production Operations Manager

Netflix

Singapore, Singapore (On-Site)
2 Months ago
Take-Two Interactive - Partner, Global Mobility

Take-Two Interactive

New York, New York, United States (On-Site)
1 Month ago
PhonePe - Zonal Collections Manager  - Field

PhonePe

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Realworld one - Vice President Pharma (m/f/d)

Realworld one

Germany (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Funko - Cloud Systems Engineer

Funko

Washington, United States (On-Site)
2 Months ago
Microsoft - Research Sciences Intern

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
PwC - Senior Associate_Azure Data Engineer-- Data and Analytics_Advisory_Gurugram

PwC

Gurugram, Haryana, India (On-Site)
2 Months ago
Microsoft - Foundational Site Reliability Engineer II

Microsoft

(On-Site)
1 Month ago
ION - Technical Lead Engineer, New York

ION

New York, New York, United States (On-Site)
4 Months ago
Anavation - AI Specialist

Anavation

Chantilly, Virginia, United States (On-Site)
2 Months ago
Microsoft - Software Engineering IC2

Microsoft

Prague, Prague, Czechia (On-Site)
1 Month ago
Luxoft - Senior Software Support Engineer

Luxoft

Slovakia (Remote)
3 Months ago
PwC - IN-Manager _Technical Delivery Manager_ Emerging Technologies_ Advisory_ Bengaluru

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
SparkCognition - Software Engineer (Scala_Backend)

SparkCognition

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Operations Jobs

Keywords Studios (Player Support) - Player Engagement - Operations Manager

Keywords Studios (Player Support)

Mandaluyong, Metro Manila, Philippines (Hybrid)
2 Months ago
Unity - Executive Assistant

Unity

Montreal, Quebec, Canada (On-Site)
5 Months ago
Bally's Interactive - KYC Senior Analyst

Bally's Interactive

Gibraltar, England, United Kingdom (On-Site)
1 Month ago
Unity - Partner Relations Manager, Industry

Unity

Austin, Texas, United States (On-Site)
3 Months ago
Tesla - Mobile Service Technician

Tesla

Rogaland, Norway (On-Site)
3 Weeks ago
Next Level Business Services - SCSM Engineer

Next Level Business Services

Redmond, Washington, United States (On-Site)
4 Months ago
Wargaming - Publishing Producer (10019549)

Wargaming

Austin, Texas, United States (Hybrid)
1 Month ago
Entain - Customer Service Representative

Entain

(On-Site)
2 Months ago
Sphere Entertainment Co - Analyst Security Intelligence

Sphere Entertainment Co

Las Vegas, Nevada, United States (On-Site)
1 Month ago
PhonePe - Senior Executive - Training & Awareness

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

New York, New York, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug