Site Reliability Engineer II

1 Month ago • 3 Years + • DevOps

Job Summary

Job Description

As a Site Reliability Engineer II at Microsoft's Azure Data group, you'll ensure the smooth and reliable operation of Azure SQL Database and Managed Instance. This involves designing and implementing automated solutions for resolving issues, collaborating with feature teams on reliable releases, and possessing a holistic understanding of the cloud service. Responsibilities include acting as a subject matter expert for troubleshooting and monitoring, implementing automation to reduce incidents, designing solutions for improved service health, and investigating and resolving service issues. You will also work on diagnostics infrastructure, maintain documentation, define processes, interact with customers, mentor team members, and participate in on-call rotations. The role requires deep expertise in SQL Server/Azure SQL Database, Windows OS concepts, debugging tools, and programming languages like C# or Java.
Must have:
  • 3+ years software development or SQL support experience
  • Debugging skills using Windbg, Visual Studio, Xperf
  • Troubleshooting experience in SQL Server/Azure SQL Database
  • Deep understanding of Windows OS concepts
  • Proficient in C#/Java programming
  • Experience working with multiple teams
Good to have:
  • Troubleshooting experience in Azure SQL Database or Managed Instance
  • Automation-related experience
  • Git experience
  • Experience running large-scale online systems
  • Familiarity with x86/x64 assembly languages and Managed IL
  • Practical Powershell, Java, Python development experience
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Overview

At Microsoft, our mission is to empower every person and every organization on the planet to achieve more.  As a member of our engineering team within Azure Data group, you will play an integral part in making that happen, navigating us into the future and impacting the lives of people all around the world.

 

Job Purpose:

 

Are you passionate about building state of the art software and running it as a service?   Do you enjoy debugging and solving complex problems? Do you want to work in a world class engineering team with an exceptional record of innovation and execution that has made us one of the largest businesses within Microsoft? If yes, keep reading…

The SQL Database team delivers world class relational database server technologies used across the globe on everything from small devices, desktops, and servers to some of the largest Cloud deployments in the world. Everywhere you look, the amount of data being generated in the world is soaring. With this dramatic growth, the importance of the data platform continues to grow tremendously. Data is at the heart of the Cloud. Winning in the Cloud requires rich data services that provide deep insights from data. All of this must be done while supporting high availability, predictability, and elasticity; Supporting 100s of terabytes of data; ability to both scale up and scale out; supporting 1000’s of users on a single machine or a query that utilizes 1000’s of cores. Does this sound like an interesting problem? If yes, talk to us in the SQL team. We have made a big bet on our super-successful business winning in the Cloud. We will win by building the world’s best data management Cloud platform and transforming how people use databases. But to do that, we need your help. Our competition is fierce, and our aspirations are bold!

 

Running software as a service means more than just developing and releasing features.  Ensuring reliability and serviceability is critical part of software cycle.  This is where you come into the picture.   As a Site Reliability Engineer, you will ensure the service of Azure SQL Database or Managed Instance runs smoothly with required reliability and availability.  You will design and implement software to automatically resolve issues.   You will work closely with feature teams to design, implement and release features that are reliable and serviceable.   You will be a cross-domain expert who has a holistic view of our cloud service

 

This is an opportunity to work with some of the best engineers in the industry to continue to innovate and deliver Azure SQL Database and Managed Instance for the Cloud. The challenges span the entire stack of database technology - connectivity, high availability, programming language, query processing, transaction processing and data management involving 100s of nodes.   You will learn what it takes to deploy and run software as a 24x7 enterprise grade cloud service!

 

We are the Azure Data team which is part of the C+AI Azure organization.  This is a once in a lifetime opportunity to be part of a very agile team to tackle hard problems involving distributed systems, and service highly impactful features at a rapid pace.

Qualifications

Qualifications

Required Qualifications:

 

  • 3+ years of software development or SQL product support experience.
  • 3+ years of using debugging tools such as Windbg, Visual Studio and Xperf to debug user dumps or live applications.
  • Demonstrated troubleshooting skills in SQL Server/Azure SQL Database with deep understanding in one or more of the following areas:
  • Query Processing
  • Storage Engine
  • SQL Operating System (SQL OS) layer (threading, scheduling, and memory management)
  • High Availability
  • Replication
  • Connectivity
  • Deep understanding of Windows Operating System level concepts such as processes, threading, memory allocation, and the network stack; understanding of how applications are affected by the above, and ability to debug same.
  • Demonstrated influence outside their own immediate team without managerial authority
  • Experience in working with multiple teams, including coordinating large projects, communicating well, and exercising initiative when presented with problems.
  • Proficient programming skills using managed code such as C#/Java. Ability to read native C/C++ code to debug issues and find answers not documented.
  • BS/MS in Computer Science, Engineering and/or equivalent industry experience is preferred.

 

Preferred Qualifications:

 

  • Troubleshooting experience in Azure SQL Database or Managed instance.
  • Industry experience in software development with automation-related experience.
  • Source control software such as git.
  • Practical experience running large scale online systems is always an advantage.
  • Familiarity withx86 and x64 assembly languages and Managed IL
  • Familiarity with CodeQL, Code Coverage and other quality capabilities.
  • Knowledge of various memory management designs and applications.
  • Skilled with troubleshooting approaches such as a Bayesian network.
  • Practical Powershell, Java, Python development experience.
  • Experience with:
  • AI and Machine Learning skills
  • Live event streams
  • State machines, Event and Health ModelsMulti-machine deployment operations.
  • Large scale testing
  • Kusto query language
  • Data Warehousing and Reporting

Responsibilities

Responsibilities

  • Act as subject matter expert for configuring, troubleshooting, and monitoring Azure Database/Managed Instance services.
  • Identify opportunities and implement automation to resolve and reduce live-site incidents.
  • Design and implement solutions to improve service health, manageability, reliability, and telemetry.
  • Design and implement diagnostics infrastructure to improve product quality and customer satisfaction.
  • Own, triage, investigate, and resolve service issues with emphasis on broad communications, learning, and teaching throughout the process.
  • Author and maintain functional and technical documentation. Define and maintain process and procedures to run enterprise service.
  • Interact with customers as result of escalation from support for issues including performance and availability.
  • Mentor, develop and deliver training.
  • Ability to meet on-call responsibilities periodically.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Straumann Group - System Engineer

Straumann Group

Bengaluru, Karnataka, India (On-Site)
3 Months ago
The Walt Disney Company - Data Engineer II, Ad Intelligence

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Day ago
Luxoft - Java Expert Software Developer

Luxoft

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
2 Months ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Menlo Park, California, United States (On-Site)
3 Months ago
Saviynt - Sr. Engineer, IGA Operations

Saviynt

El Segundo, California, United States (Hybrid)
3 Months ago
OtherSide Entertainment - Senior Online Engineer

OtherSide Entertainment

United States (Remote)
2 Weeks ago
Rackspace Technology - Site Reliability Engineer / Observability Engineer

Rackspace Technology

Giza, Giza Governorate, Egypt (Remote)
1 Month ago
The Walt Disney Company - Principal Data Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
6 Days ago
Flexera - Senior Site Reliability Engineer

Flexera

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Meltwater - Backend & Cloud Engineer – Javascript

Meltwater

Hyderabad, Telangana, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Attentive - Senior Software Engineer, Platform

Attentive

United States (Remote)
3 Months ago
Warner Bros Discovery - Staff Software Engineer in Test - (GQA Team),Bangalore

Warner Bros Discovery

Bengaluru, Karnataka, India (On-Site)
2 Months ago
bosh group india - MS/ECO-XC_Android_Automotive_Expert

bosh group india

Bengaluru, Karnataka, India (On-Site)
2 Months ago
S&P Global - Senior SQA Analyst

S&P Global

Hyderabad, Telangana, India (On-Site)
3 Months ago
The Walt Disney Company - Senior Database Engineer - Oracle

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
3 Months ago
BlackRock - Linux System Engineer -Vice President

BlackRock

Gurugram, Haryana, India (Hybrid)
4 Months ago
Riot Games - Principal Software Engineer, Product Tech-Lead - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Microsoft - Software Engineer II

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Luxoft - Backend Java Developer

Luxoft

New Delhi, Delhi, India (Remote)
2 Months ago
Applike Group - Tech Lead (Playtime Supply) (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

PwC - IN-Senior Associate_IICS Devreloper_Data & Analytics_Advisory_PAN India

PwC

Hyderabad, Telangana, India (On-Site)
1 Month ago
Garena - Data Analyst

Garena

Pune, Maharashtra, India (On-Site)
2 Months ago
MLVeda - Senior Software Engineer

MLVeda

Ahmedabad, Gujarat, India (On-Site)
7 Months ago
Rackspace Technology - Director Software Engineering GTM Application- India

Rackspace Technology

Gurugram, Haryana, India (Hybrid)
2 Months ago
PwC - Associate - Kolkata Y-14 - Technology Consulting

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
JIFFYai - STAFF ENGINEER SRE

JIFFYai

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Unity - Payroll Specialist

Unity

Karnataka, India (Hybrid)
5 Months ago
InMobiInMobi - Chief Of Staff - People Strategy & Operations

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Acceldata - Senior Product Support Engineer - Cloud Support

Acceldata

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Enphase Energy - Sr. Staff Engineer Cloud

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

DEVOTEAM - Distributed Cloud | AWS DevOps Engineer

DEVOTEAM

Lisbon, Lisbon, Portugal (Remote)
3 Months ago
Warner Bros Discovery - Staff Software Engineer

Warner Bros Discovery

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Razer - Lead Site Reliability Engineer

Razer

Shanghai, Shanghai, China (On-Site)
4 Months ago
Omnissa - Staff Engineer (C++ Windows)

Omnissa

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
CapSpire - Senior Consultant – Endur Technical

CapSpire

Bengaluru, Karnataka, India (Remote)
2 Months ago
ION - Cloud Engineer/Architect (DevOps)

ION

London, England, United Kingdom (On-Site)
3 Months ago
Microsoft - Support Engineer (Azure DevOps and Developer Support)

Microsoft

Seoul, South Korea (Remote)
2 Weeks ago
Microsoft - Software Engineer II - DevOps

Microsoft

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Playtech - Product Operations Team Leader

Playtech

Kyiv, Kyiv City, Ukraine (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (Hybrid)

London, England, United Kingdom (On-Site)

Jakarta, Jakarta, Indonesia (On-Site)

Gurugram, Haryana, India (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

London, England, United Kingdom (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug