Data Reliability Engineer

2 Weeks ago • Upto 10 Years
Sign up and Unlock PRO benefits for FREE!

About the job

SummaryBy Outscal

Bungie seeks a Data Reliability Engineer to design, deploy, and maintain highly available data infrastructure, including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite. You'll troubleshoot issues, ensure data security, and collaborate with engineering teams on projects and services. Must have experience with Linux, infrastructure automation, and distributed production environments.

Data Reliability Engineering at Bungie is a core team of the Central Tech area that keeps our games and tooling running at scale. Our team owns the overall scalability, observability and resilience of the databases, data processing platforms and in-memory key-value stores used throughout the Bungie ecosystem. We partner with our engineering teams and business units on projects, services, designs, and processes We are the stewards of architecture and provide tools and services to enable engineering teams to meet their design requirements.

RESPONSIBILITIES

  • Design, deploy, and maintain highly available and scalable data infrastructure components including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite
  • Perform capacity planning and scalability assessments for data platforms
  • Troubleshoot and resolve issues related to data processing pipelines, message queuing, and performance including participation in on-call rotation
  • Ensure data security, integrity, and compliance with industry best practices and regulatory requirements
  • Document system configurations, procedures, and operational knowledge
  • Advise service owners on industry and company standards and best practices
  • Maintain reliability and performance levels for core data platform infrastructure
  • Data observability strategy and implementation
  • Data ownership strategy and documentation

REQUIRED SKILLS

  • Strong understanding of Linux operating systems and their administration
  • Effective communication skills and ability to collaborate effectively in a team environment
  • Experience with infrastructure automation and configuration management (e.g., Ansible, Terraform…)
  • Excellent troubleshooting skills and the ability to analyze and resolve complex infrastructure resource and application deployment issues
  • Experience working in a distributed production environment
  • Deep understanding of cluster management areas, such as scaling, consistency tuning, replication, and multi-datacenter configuration
  • Familiarity with time-series monitoring systems & tools (e.g., Datadog, Prometheus, Grafana and ELK)
  • Experience designing and implementing logging and metric pipelines

About The Company

Explore gaming industy jobs in one of the leading Game Studios.

View All Jobs

Similar Jobs

PlayStation Global - Site Reliability Engineer Intern - Undergraduate

California, United States (Hybrid)

Crytek - Senior Site Reliability Engineer

Hessen, Germany (Remote)

Millennium - Site Reliability Engineer

Karnataka, India (On-Site)

VGW - Senior Site Reliability Engineer

Mecklenburg-Vorpommern, Germany (On-Site)

Electronic Arts - Site Reliability Engineer

Telangana, India (On-Site)

2K - Staff Site Reliability Engineer

California, United States (Hybrid)

Docusign - Site Reliability Engineer

Karnataka, India (Hybrid)

Pattern® - Senior Site Reliability Engineer

Maharashtra, India (On-Site)

2K - Senior Site Reliability Engineer

California, United States (Hybrid)

Guerrilla - SENIOR SITE RELIABILITY ENGINEER

North Holland, Netherlands (On-Site)

Similar Skill Jobs

Pragma - Service Operations Specialist

United States (Remote)

Netspeak Games - Dev Ops Engineer

Valais, Switzerland (Remote)

Modoyo - Senior QA Lead

Stockholm County, Sweden (Hybrid)

PENN Interactive - Staff Network Engineer

Pennsylvania, United States (Hybrid)

Unity - Software Engineer, Data Engineering

Copenhagen, Denmark (On-Site)

Stream Hatchet - Back End Developer

Catalonia, Spain (Hybrid)

Sperasoft - NOC Operator in Cracow

Lesser Poland Voivodeship, Poland (Hybrid)

Blizzard Entertainment - Lead UI Artist

Catalonia, Spain (On-Site)

Jobs in Worldwide

Expression Games - Environment Artist - UK

Worldwide (Remote)

Outscal - Mentor - C++ & DSA

Worldwide (Remote)

ElevenLabs - Content Moderator

Worldwide (Remote)

Orionix - Beta Tester

Worldwide (Remote)

Peak - Data Scientist (New Grad)

Worldwide (On-Site)

Software Engineering Jobs

Pragma - Service Operations Specialist

United States (Remote)

Azra Games - Mid-level Software Engineer (Visual Scripter)

California, United States (Hybrid)

Rockstar Games - Nuke Compositor

California, United States (On-Site)

PTW - Junior Post-Production Coordinator

England, United Kingdom (Hybrid)

Abstraction games - Engineering Manager

Eindhoven (On-Site)

ElevenLabs - AI Safety Engineer

Berlin, Germany (Remote)

ElevenLabs - AI Safety Engineer

California, United States (Remote)

ElevenLabs - AI Safety Engineer

Catalonia, Spain (Remote)

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug