Middle Data Engineer (Python)

46 Minutes ago • 2 Years + • Data Analyst

Job Summary

Job Description

This role involves designing and implementing data pipelines using Python and PySpark within an Azure environment. Responsibilities include collecting, cleaning, and transforming data from various sources; building and maintaining data storage and processing systems (databases, data warehouses, data lakes); adhering to data governance policies; collaborating with data analysts and scientists; participating in code reviews and performance tuning; working with Big Data Solution Architects to optimize data ingestion; ensuring solutions meet production-ready standards; and participating in daily project meetings. The project focuses on a large-scale data transformation to improve data accuracy, consistency, and accessibility for reporting, analytics, and machine learning.
Must have:
  • 2+ years big data experience
  • Python & PySpark proficiency
  • 2+ years Azure experience
  • Data querying & manipulation skills
  • Unit & integration testing
  • CI/CD pipeline experience
  • Excellent communication skills
Good to have:
  • Data visualization tools (SSRS, Power BI)
  • Machine learning knowledge
  • Data privacy regulation knowledge (GDPR, CCPA)
Perks:
  • Flexible working format
  • Competitive salary
  • Personalized career growth
  • Professional development tools
  • Education reimbursement
  • Corporate events

Job Details

We are looking for a Middle Big Data Engineer (Python+Azure) to join our team!

Client Overview:
Our client is involved in a large-scale Data Transformation project, with a focus on solidifying the foundation of their data operations. They are aiming to ensure that data is accurate, consistent, and available at critical times to support their business needs. 

Project Objectives:
The project aims to build and maintain robust data pipelines, scalable storage systems, and efficient processing mechanisms using Azure technology. 

The goal is to support the client's data-driven decision-making by ensuring clean, transformed, and readily accessible data for reporting, analytics, and machine learning across the organization.

Responsibilities:

  • Design and implement data pipelines to collect, clean, and transform data from various sources.
  • Build and maintain data storage and processing systems, including databases, data warehouses, and data lakes.
  • Follows data governance policies and procedures.
  • Collaborate with data analysts, data scientists, and other stakeholders to understand and meet their data needs.
  • Participate in code reviews, performance tuning, and best practice discussions within the team and brain-storm session
  • Work with Big Data Solution Architects to design, prototype, implement, and optimize data ingestion pipelines.
  • Ensure solutions are production-ready in terms of operational, security, and compliance standards.
  • Participate in daily project and agile meetings, providing technical support for issue resolution.
  • Communicate clearly and concisely with the business about item status and blockers.
  • Maintain comprehensive knowledge of the client's data landscape.

Requirements:

  • 2+ years of design & development experience with big data technologies.
  • Proficiency in Python and PySpark.
  • 2+ years of development experience in cloud technologies like Azure.
  • Strong skills in querying and manipulating data from various databases (relational and big data).
  • Experience in writing effective and maintainable unit and integration tests for ingestion pipelines.
  • Familiarity with static analysis and code quality tools, and experience building CI/CD pipelines. 
  • Excellent communication, problem-solving, and leadership skills.
  • Experience working on high-traffic and large-scale software products.

Nice to Have:

  • Experience with data visualization tools (e.g., SSRS, Power BI).
  • Knowledge of machine learning algorithms and their applications in big data.
  • Familiarity with data privacy regulations (e.g., GDPR, CCPA).

We offer:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

Similar Jobs

Head Digital Works - Data Scientist

Head Digital Works

Hyderabad, Telangana, India (On-Site)
• 6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model, Speech & Audio) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Digital Extremes - Senior Graphics Programmer

Digital Extremes

London, Ontario, Canada (Remote)
• 3 Months ago
Fluxon - Staff Software Engineer

Fluxon

Hyderabad, Telangana, India (Remote)
• 4 Months ago
Google - Senior Software Engineer, Multiplatform, Core

Google

(On-Site)
• 3 Months ago
N-iX - SENIOR DATA ENGINEER (DATABRICKS) (#2703)

N-iX

Poland (Remote)
• 1 Month ago
Balbix - Staff AI Engineer

Balbix

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
ByteDance - Seed - LLM Performance Operation Analyst (Non-safety)

ByteDance

Singapore (On-Site)
• 2 Months ago
Fortis Games - Analyst, People Analytics & Systems

Fortis Games

Canada (Remote)
• 2 Months ago
The Walt Disney Company - Vice President, Data Science

The Walt Disney Company

Santa Monica, California, United States (On-Site)
• 4 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Dataquad  Inc  - Data Scientist

Dataquad Inc

Telangana, India (Hybrid)
• 6 Months ago
Google - Software Engineer, University Graduate, 2025

Google

Taipei City, Taiwan (On-Site)
• 3 Months ago
Netflix - Research Scientist (L6) - Identity Algorithms

Netflix

Los Gatos, California, United States (On-Site)
• 3 Months ago
Whoop - Staff Data Science Tech Lead (Training)

Whoop

Boston, Massachusetts, United States (On-Site)
• 4 Months ago
Blind Squirrel Games - Senior Generalist Engineer

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
• 1 Month ago
Google - Staff Software Engineer, Infrastructure, Google Cloud Data Management

Google

Sunnyvale, California, United States (On-Site)
• 1 Month ago
Samsung Semiconductor - Intern, Architecture Research Engineer

Samsung Semiconductor

San Jose, California, United States (Hybrid)
• 1 Month ago
Tencent - Research Intern

Tencent

Palo Alto, California, United States (On-Site)
• 2 Days ago
Airlab Inc  - Gameplay Programmer (Mobile)

Airlab Inc

Montreal, Quebec, Canada (On-Site)
• 7 Months ago
Genies - Senior Software Engineer (3D Graphics)

Genies

Los Angeles, California, United States (On-Site)
• 5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Ukraine

PwC - Senior Associate - Accounting Advisory Services

PwC

Kyiv, Kyiv City, Ukraine (On-Site)
• 4 Months ago
N-iX - Talent Sourcer (Leadership Hiring)

N-iX

Ukraine (Remote)
• 2 Weeks ago
Luxoft - Java Intern

Luxoft

Kyiv, Kyiv City, Ukraine (On-Site)
• 3 Months ago
N-iX - Senior Automation (JS, Cypress) Test Engineer

N-iX

Ukraine (Remote)
• 1 Week ago
PwC - Senior market analyst in government and public sector practice

PwC

Kyiv, Kyiv City, Ukraine (On-Site)
• 4 Months ago
GoReel - Service Desk Specialist

GoReel

Kyiv, Kyiv City, Ukraine (On-Site)
• 3 Weeks ago
N-iX - Senior DevOps Engineer (Azure AD B2C)

N-iX

Ukraine (Remote)
• 2 Weeks ago
Luxoft - Data Engineer for Market Data Projects (with Streamlit Expertise)

Luxoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
• 3 Months ago
CloudHire - Shopify Developer - Remote

CloudHire

Ukraine (Remote)
• 4 Months ago
Every matrix - Middle Data QA Engineer (Python)

Every matrix

Lviv, Lviv Oblast, Ukraine (Hybrid)
• 2 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

The Walt Disney Company - Lead Data Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
• 2 Weeks ago
PwC - Senior Associate _Java Developer _Data & Analytics _Advisory _PAN India

PwC

Kolkata, West Bengal, India (On-Site)
• 4 Months ago
LeoVegas - Data Scientist - Sportsbook

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
• 2 Months ago
Wargaming - Game Data Analyst (World of Tanks)

Wargaming

Vilnius, Vilnius County, Lithuania (On-Site)
• 3 Months ago
Maersk Careers - Vendor Master Data Lead

Maersk Careers

Bengaluru, Karnataka, India (On-Site)
• 340 Years ago
Hasbro - Sr Manager, Digital and Entertainment Analytics

Hasbro

London, England, United Kingdom (On-Site)
• 2 Months ago
ByteDance - LLM Coding Trainer - Specialist

ByteDance

Singapore (On-Site)
• 3 Months ago
Modulate - Senior Data Engineer

Modulate

Somerville, Massachusetts, United States (Hybrid)
• 1 Month ago
PwC - Data Scientist

PwC

Brno, South Moravian Region, Czechia (On-Site)
• 3 Months ago
Next Level Business Services - BI Tech Project Manager - Full Time

Next Level Business Services

Redmond, Washington, United States (On-Site)
• 4 Months ago

Get notifed when new similar jobs are uploaded