Data Reliability Engineer

1 Week ago • Upto 10 Years
Create a profile and let recruiters contact you

About the job

SummaryBy Outscal

Bungie seeks a Data Reliability Engineer to design, deploy, and maintain highly available data infrastructure, including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite. You'll troubleshoot issues, ensure data security, and collaborate with engineering teams on projects and services. Must have experience with Linux, infrastructure automation, and distributed production environments.

Data Reliability Engineering at Bungie is a core team of the Central Tech area that keeps our games and tooling running at scale. Our team owns the overall scalability, observability and resilience of the databases, data processing platforms and in-memory key-value stores used throughout the Bungie ecosystem. We partner with our engineering teams and business units on projects, services, designs, and processes We are the stewards of architecture and provide tools and services to enable engineering teams to meet their design requirements.

RESPONSIBILITIES

  • Design, deploy, and maintain highly available and scalable data infrastructure components including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite
  • Perform capacity planning and scalability assessments for data platforms
  • Troubleshoot and resolve issues related to data processing pipelines, message queuing, and performance including participation in on-call rotation
  • Ensure data security, integrity, and compliance with industry best practices and regulatory requirements
  • Document system configurations, procedures, and operational knowledge
  • Advise service owners on industry and company standards and best practices
  • Maintain reliability and performance levels for core data platform infrastructure
  • Data observability strategy and implementation
  • Data ownership strategy and documentation

REQUIRED SKILLS

  • Strong understanding of Linux operating systems and their administration
  • Effective communication skills and ability to collaborate effectively in a team environment
  • Experience with infrastructure automation and configuration management (e.g., Ansible, Terraform…)
  • Excellent troubleshooting skills and the ability to analyze and resolve complex infrastructure resource and application deployment issues
  • Experience working in a distributed production environment
  • Deep understanding of cluster management areas, such as scaling, consistency tuning, replication, and multi-datacenter configuration
  • Familiarity with time-series monitoring systems & tools (e.g., Datadog, Prometheus, Grafana and ELK)
  • Experience designing and implementing logging and metric pipelines

About The Company

Explore gaming industy jobs in one of the leading Game Studios.

View All Jobs

Similar Jobs

VGW - Senior Site Reliability Engineer

Mecklenburg-Vorpommern, Germany (On-Site)

Electronic Arts - Site Reliability Engineer

Telangana, India (On-Site)

2K - Staff Site Reliability Engineer

California, United States (Hybrid)

Keywords Studios (Player Support) - Site Reliability Engineer (SRE) - Intermediate

County Dublin, Ireland (On-Site)

2K - Senior Site Reliability Engineer

California, United States (Hybrid)

Guerrilla - SENIOR SITE RELIABILITY ENGINEER

North Holland, Netherlands (On-Site)

2K - Staff Site Reliability Engineer

Texas, United States (Hybrid)

2K - Senior Site Reliability Engineer

Texas, United States (Hybrid)

Tencent - Senior Site Reliability Engineer

California, United States (On-Site)

Moon Active - Site Reliability Engineer

Masovian Voivodeship, Poland (On-Site)

Similar Skill Jobs

Infogain - iOS Developer (Senior)

Maharashtra, India (On-Site)

Electronic Arts - Software Engineer - UGX - Platform

British Columbia, Canada (On-Site)

Electronic Arts - Software Engineer - 12 month

British Columbia, Canada (On-Site)

Samsung Semiconductor - Senior Staff, Data Scientist

California, United States (Hybrid)

Conduent - Lead C++ Developer

Karnataka, India (On-Site)

Arrow Electronics - React JS

Gujarat, India (On-Site)

PlayStation Global - Systems Admin: Motion Capture Support

California, United States (On-Site)

Blizzard Entertainment - Principal Software Engineer, Server

California, United States (Hybrid)

Kyndryl - Batch & Console Monitoring Operator - AS400

Uttar Pradesh, India (Hybrid)

Evolution - QA Engineer (Game Team)

Masovian Voivodeship, Poland (Hybrid)

Software Engineering Jobs

DroneStark Technologies - Drone Firmware Engineer & Test Pilot

Maharashtra, India (On-Site)

Infogain - iOS Developer (Senior)

Maharashtra, India (On-Site)

Infogain - Frontend React Developer (Lead)

Karnataka, India (On-Site)

Infogain - Frontend VueJS Developer (Senior)

Maharashtra, India (On-Site)

Keywords Studios (Player Support) - Senior Artist - ZBrush

British Columbia, Canada (Hybrid)

Ubisoft - Tools Programmer - Snowdrop Paris - F/H/NB

Île-de-France, France (Hybrid)

Hacksaw Studios - Engineering manager

Stockholm County, Sweden (On-Site)

Electronic Arts - Mocap Operator

California, United States (On-Site)

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug