Sr. Datadog Developer

16 Hours ago • All levels • DevOps

Job Summary

Job Description

As a Sr. Datadog Developer at Rackspace Technology, you'll be a key member of the Managed Public Cloud software development team, collaborating globally on various projects including cloud integrated services, customer interaction platforms, and backend business systems. You'll collaborate with product teams, architect production-ready software, and champion observability. Agile development, TDD, code reviews, and DevOps practices are essential. You will lead research, design, and implement monitoring strategies using Datadog, troubleshoot complex issues, and communicate insights to stakeholders. Experience with Datadog features (dashboards, monitors, logs, alerting), IaC tools (Terraform/CloudFormation), and scripting languages (Python, Bash, PowerShell) is required.
Must have:
  • In-depth Datadog knowledge
  • Datadog agent configuration
  • Custom metrics, traces, logs
  • IaC experience (Terraform/CloudFormation)
  • Scripting (Python, Bash, PowerShell)
  • Observability principles
  • Agile development & DevOps
  • Excellent communication skills
Good to have:
  • Custom Datadog integrations
  • Experience with AWS CloudWatch, Kubernetes, Azure Monitor
  • Security best practices in monitoring

Job Details

At Rackspace Technology, we are experts in multi-cloud solutions. Our deep technical expertise with leading technologies and multi-cloud environments—spanning applications, data, and security—enables businesses to grow, increase efficiency, and drive innovation. We don’t just solve workload problems; we create competitive advantages by empowering you to work faster, smarter, and stay ahead of the curve.

Key Responsibilities

    • Be a key member of the Managed Public Cloud software development team, collaborating globally.
    • Work on a variety of projects including cloud integrated services, customer interaction platforms, and backend business systems.
    • Collaborate with Product teams to assess functional requirements for new offerings, analyze technical feasibility, and coordinate task assignments with agility to deliver innovative software.
    • Proven ability to architect production ready software with minimal direction, prioritizing system observability.
    • Strong background in agile development and project planning, including TDD and code reviews.
    • Establish and adhere to coding and process best practices, including conducting code reviews.
    • Regularly contribute to engineering standards and best practices, motivating the team to deliver their best work.
    • Lead research, proof of concept, and prototype efforts within the project team.
    • Gain support for complex architectures and negotiate solution/architectural tradeoffs.
    • Write and review design documents and actively participate in project discussions.
    • Work within a DevOps culture, including participating in on call rotations and maintenance schedules.

Skills

    • In depth knowledge of Datadog features such as dashboards, monitors, log management, and alerting.
    • Expertise in setting up and configuring Datadog agents across various environments (e.g., AWS, Azure, GCP, on-premise).
    • Experience with creating and managing custom metrics, traces, and logs.
    • Ability to integrate Datadog with various services and tools (e.g., AWS CloudWatch, Kubernetes, Azure Monitor, etc).
    • Experience in automating monitoring and alerting setups using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
    • Proficiency in scripting languages such as Python, Bash, or PowerShell to create custom checks, scripts, and automation tools.
    • Experience with developing custom Datadog integrations and plugins.
    • Strong understanding of observability principles, including metrics, logging, and tracing.
    • Ability to design and implement monitoring strategies that provide deep visibility into system performance and application health.
    • Experience with monitoring application and infrastructure performance using Datadog.
    • Skills in identifying performance bottlenecks and optimizing system performance based on Datadog insights.
    • Experience in setting up and managing alerts, anomaly detection, and incident response workflows in Datadog.
    • Ability to troubleshoot complex issues by analyzing metrics, logs, and traces in Datadog.
    • Ability to work with cross functional teams to define monitoring requirements and implement Datadog solutions.
    • Strong communication skills to convey insights and recommendations to both technical and nontechnical stakeholders.
    • Knowledge of security best practices related to monitoring, such as ensuring data privacy and compliance with regulatory requirements.
    • Experience with integrating Datadog into security operations, including monitoring for security threats and vulnerabilities.
    • Commitment to continuously improving monitoring setups, staying updated with Datadog’s latest features and best practices.
    • Ability to lead or contribute to efforts to enhance observability across the organization.
    • Excellent oral and written English communication skills.
undefined

Similar Jobs

The Walt Disney Company - System Application Development & Sustainment Analyst

The Walt Disney Company

Orlando, Florida, United States (On-Site)
1 Week ago
PhonePe - SRE - Big Data (OnPrem)

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Patterned Learning Career - Lead Python AWS Developer

Patterned Learning Career

(Remote)
3 Days ago
Larian Studios - DevOps Build Engineer

Larian Studios

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Playrix - Senior Release Support Engineer

Playrix

Ireland (Remote)
3 Months ago
Trend Micro - (Sr.) Software Engineer in Linux

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago
SparkCognition - Senior DevOps Engineer

SparkCognition

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Info Stretch - Senior Engineer

Info Stretch

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Rackspace Technology - Lead Engineer - Multi-Cloud Platforms and Infrastructure

Rackspace Technology

United States (Remote)
1 Month ago
Sperasoft - Release Engineer

Sperasoft

(Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nintendo - Security Engineer

Nintendo

Redmond, Washington, United States (Hybrid)
2 Months ago
Dream Sports - Director System IT

Dream Sports

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Microsoft - Linux security and Release Management Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
SKYDANCE - Mid Systems Administrator

SKYDANCE

Madrid, Community Of Madrid, Spain (Hybrid)
1 Week ago
Rackspace Technology - Senior DataDog Developer

Rackspace Technology

India (Remote)
3 Months ago
Blinkhealth - Senior Manager, Cloud Engineering

Blinkhealth

(Remote)
1 Day ago
Moon Active - IT Infrastructure & Cloud Engineer

Moon Active

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Days ago
UXBERT Labs - Senior DevOps Engineer

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
2 Weeks ago
AbZorba Games  - Dev Ops Engineer

AbZorba Games

Athens, Greece (On-Site)
8 Months ago
Garena - Garena - Database Administator 遊戲資料維運工程師

Garena

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Mexico City, Mexico City, Mexico

Buckman - PT Account Manager

Buckman

Mexico (On-Site)
2 Months ago
Amber - Senior Concept Artist (Project Based)

Amber

Guadalajara, Jalisco, Mexico (Remote)
1 Week ago
Google - Software Engineer, Java and Kotlin Ecosystem

Google

Mexico City, Mexico City, Mexico (On-Site)
3 Months ago
Nielsen Holdings - Schedule Editor, Metadata Production

Nielsen Holdings

Mexico City, Mexico City, Mexico (Remote)
1 Month ago
Lion Bridge Games - Technical Test Associate

Lion Bridge Games

Mexico City, Mexico City, Mexico (On-Site)
3 Days ago
Brillio - QA Engineer - R01542503

Brillio

Guadalajara, Jalisco, Mexico (Hybrid)
3 Months ago
Nagarro - Associate Principal Engineer, Delivery

Nagarro

Mexico (Remote)
3 Months ago
Scale AI - Operations Specialist (New Grads)

Scale AI

Mexico City, Mexico City, Mexico (Remote)
4 Months ago
Paypal - Business Program Management

Paypal

Mexico City, Mexico City, Mexico (Hybrid)
3 Months ago
PwC - Associate 2 External Audit

PwC

Monterrey, Nuevo Leon, Mexico (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Lost Boys Interactive - DevOps - GoLang Developer

Lost Boys Interactive

(Remote)
3 Weeks ago
Ubisoft - Site Reliability Engineer [Game Security]

Ubisoft

Düsseldorf, North Rhine-Westphalia, Germany (Hybrid)
2 Weeks ago
N-iX - Senior Azure DevOps Engineer

N-iX

Poland (Remote)
1 Week ago
Global Payments  Inc  - Senior Site Reliability Engineer

Global Payments Inc

Pune, Maharashtra, India (On-Site)
4 Months ago
Luxoft - Murex Technical Developer - Lead

Luxoft

Toronto, Ontario, Canada (On-Site)
2 Months ago
CorroHealth - Site Reliability Engineer

CorroHealth

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
Rackspace Technology - Sr Big Data Engineer Airflow and Oozie (GCP)

Rackspace Technology

United States (Remote)
1 Month ago
SuperPlay - DEVOPS ENGINEER

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Velotio Technologies - Lead Devops Engineer

Velotio Technologies

Maharashtra, India (Remote)
1 Month ago
Omnissa - Member of Technical Staff (C++ Windows)

Omnissa

Chennai, Tamil Nadu, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Mexico City, Mexico (Remote)

Mexico City, Mexico City, Mexico (Remote)

Alexandria, Alexandria Governorate, Egypt (Remote)

United States (Remote)

Mexico City, Mexico City, Mexico (Remote)

Gurugram, Haryana, India (Hybrid)

View All Jobs

Get notified when new jobs are added by Rackspace Technology

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug