Data Engineer - Python & Databricks

2 Months ago • 5 Years + • Data Analyst

Job Summary

Job Description

As a Data Engineer Developer, you will design, develop, and maintain data pipelines using Python and Databricks to process large-scale datasets. You'll collaborate with data scientists, analysts, and business stakeholders. Responsibilities include data pipeline development (batch and real-time), ETL processes, data integration from various sources, performance optimization, data validation, cloud integration (AWS, Azure, or Google Cloud), automation & scheduling, and documentation. The role requires building scalable solutions enabling advanced analytics and reporting, working with large datasets, and ensuring data integrity and quality.
Must have:
  • 5+ years Data Engineering experience with Python expertise
  • Databricks or similar big data platform experience
  • Strong understanding of data pipelines, ETL, data integration
  • Cloud platform experience (AWS, Azure, or GCP)
  • SQL proficiency and relational/non-relational database experience
  • Experience with big data technologies (Spark, Kafka, Hadoop)
  • Data modeling, warehousing, and database design knowledge
  • Git and CI/CD pipeline experience
Good to have:
  • Delta Lake, Lakehouse architecture experience
  • Machine learning and data science workflow familiarity
  • DevOps or DataOps experience
  • Terraform, Docker, or Kubernetes knowledge
  • Data governance, privacy regulations (GDPR, CCPA) knowledge

Job Details

Project description

As a Data Engineer Developer, you will design, develop, and maintain data pipelines using Python and Databricks to process large-scale data sets. You will collaborate with data scientists, analysts, and business stakeholders to gather data requirements and build efficient, scalable solutions that enable advanced analytics and reporting.

Responsibilities

Data Pipeline Development: Design, develop, and implement scalable data pipelines using Python and Databricks for batch and real-time data processing.

ETL Processes: Build and maintain ETL (Extract, Transform, Load) processes to gather, transform, and store data from multiple sources.

Data Integration: Integrate structured and unstructured data from various internal and external sources into data lakes or warehouses, ensuring data accuracy and quality.

Collaboration: Work closely with data scientists, analysts, and business teams to understand data needs and deliver efficient solutions.

Performance Optimization: Optimize the performance of data pipelines and workflows to ensure efficient processing of large data sets.

Data Validation: Implement data validation and monitoring mechanisms to ensure data quality, consistency, and reliability.

Cloud Integration: Work with cloud platforms like AWS, Azure, or Google Cloud to build and maintain data storage and processing infrastructure.

Automation & Scheduling: Automate data pipelines and implement scheduling mechanisms to ensure timely and reliable data delivery.

Documentation: Maintain comprehensive documentation for data pipelines, processes, and best practices.

Skills

Must have

5+ years of experience as a Data Engineer with strong expertise in Python.

Bachelor's degree in Computer Science, Data Engineering, or a related field (or equivalent experience).

Hands-on experience with Databricks or similar big data platforms.

Strong understanding of data pipelines, ETL processes, and data integration techniques.

Experience with cloud-based platforms such as AWS, Azure, or Google Cloud, particularly with services like Data Lakes, S3, or Azure Blob Storage.

Proficiency in SQL and experience with relational and non-relational databases.

Familiarity with big data technologies like Apache Spark, Kafka, or Hadoop.

Strong understanding of data modeling, data warehousing, and database design principles.

Ability to work with large, complex datasets, ensuring data integrity and performance optimization.

Experience with version control tools like Git and CI/CD pipelines for data engineering.

Excellent problem-solving skills, attention to detail, and the ability to work in a collaborative environment.

Nice to have

Experience with Delta Lake, Lakehouse architecture, or other modern data storage solutions.

Familiarity with machine learning and data science workflows.

Experience with DevOps or DataOps practices.

Knowledge of Terraform, Docker, or Kubernetes for cloud infrastructure automation.

Familiarity with data governance, data privacy regulations (e.g., GDPR, CCPA), and data security best practices.

Other

Languages

English: B2 Upper Intermediate

Seniority

Regular

Similar Jobs

SSC Technologies - UI Technical Lead (Angular) – Product and Innovation Team

SSC Technologies

Bucharest, Bucharest, Romania (On-Site)
3 Months ago
Zeta - Senior Software Development Engineer in Test

Zeta

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Intele health - Data Scientist

Intele health

Maharashtra, India (Remote)
4 Months ago
Playrix - Lead Unity Software Engineer (Gameplay)

Playrix

Serbia (Remote)
3 Months ago
ION - Data Engineer, Italy

ION

Italy (Hybrid)
3 Months ago
Postman - Senior Product Analyst

Postman

San Francisco, California, United States (On-Site)
3 Months ago
EXUSIA - Data Governance Developer - Collibra & Ab Initio

EXUSIA

India (Remote)
3 Months ago
PwC - IN-Associate _ Data Analyst _Captive Financial Services_Advisory_Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
3 Months ago
PwC - Senior Associate -ETL Testing_D&A_Advisory_Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

WaveApps - Senior Manager, AI & Data Platform

WaveApps

Toronto, Ontario, Canada (Remote)
3 Months ago
Varonis  - DevOps Support Engineer

Varonis

Herzliya, Tel Aviv District, Israel (On-Site)
3 Months ago
Ajmera Infotech - React Developer

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Extreme Network - Staff Backend Developer (Python, Microservices, GenAI - 92890)

Extreme Network

Toronto, Ontario, Canada (Remote)
3 Months ago
Cisco - Senior Software Engineer - C, Linux, L2, L3 Networking, Sonic, Control Plane

Cisco

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Accurate - Manager, Software Engineering

Accurate

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Lytx,  Inc  - Performance Automation Test Engineer

Lytx, Inc

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Playrix - Golang Tech Lead

Playrix

Ukraine (Remote)
2 Months ago
Nisum - Application Support (IOS) - W5712

Nisum

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Luxoft - Automation Tester

Luxoft

New Delhi, Delhi, India (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Chennai, Tamil Nadu, India

Aptiv - Android Audio - Technical Lead

Aptiv

Bengaluru, Karnataka, India (On-Site)
6 Months ago
PwC - SAP - BODS - Senior Associate-Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
3 Months ago
BOT VFX - Environment Artist - Contractual

BOT VFX

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
Nasdaq - Senior Analyst - Quality Assurance, FinTech

Nasdaq

Mumbai, Maharashtra, India (On-Site)
3 Months ago
PwC - Senior Associate_Azure Data Engineer_Data & Analytics_Advisory_PAN  India

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Paytm - Agency Engagement - Mumbai/Delhi

Paytm

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Rambus - Lead MTS Systems Engineering

Rambus

Bengaluru, Karnataka, India (On-Site)
4 Months ago
PwC - IN-Manager_SAP ABAP_Enterprise Apps SAP_Advisory_Pan India

PwC

Pune, Maharashtra, India (On-Site)
3 Months ago
Rocket - Technical Support Engineer

Rocket

Bengaluru, Karnataka, India (On-Site)
5 Years ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

eBay - DataScience Lead-Global Consumer Insight

eBay

San Jose, California, United States (Hybrid)
4 Months ago
OKX - Head of Compliance Data Analytics

OKX

San Jose, California, United States (On-Site)
3 Months ago
PwC - ETIC, Data Solution Architect - Senior Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
3 Months ago
The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
Meta - Data Scientist, Product Analytics STE 18 Month Contract

Meta

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Google - Strategy and Operations Analyst Lead, Go-To-Market

Google

Reston, Virginia, United States (On-Site)
3 Months ago
PublicisGroupe - Copy of Senior Associate Data Engineering L2 DE - Big Data AWS

PublicisGroupe

Hyderabad, Telangana, India (On-Site)
3 Months ago
Voodoo - Senior Data Analyst - Growth

Voodoo

Paris, Île-de-France, France (Hybrid)
2 Months ago
The Walt Disney Company - Lead Data Engineer, Data Reliability

The Walt Disney Company

Seattle, Washington, United States (On-Site)
3 Months ago
HoYoverse - Data Engineer - Fresh Grad

HoYoverse

Singapore (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Luxoft, a DXC Technology Company (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Acquired by U.S. company DXC Technology in 2019, Luxoft is a global operation in 44 cities and 21 countries with an international, agile workforce of nearly 18,000 people. It combines a unique blend of engineering excellence and deep industry expertise, helping over 425 global clients innovate in the areas of automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.

DXC Technology is a leading Fortune 500 IT services company which helps global companies run their mission critical systems. Together, DXC and Luxoft offer a differentiated customer-value proposition for digital transformation by combining Luxoft’s front-end digital capabilities with DXC’s expertise in IT modernization and integration. Follow our profile for regular updates and insights into technology and business needs.

Gothenburg, Västra Götaland County, Sweden (On-Site)

New Delhi, Delhi, India (Remote)

Poland, Ohio, United States (Remote)

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)

Ukrainka, Kyiv Oblast, Ukraine (Remote)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

Bengaluru, Karnataka, India (On-Site)

Chennai, Tamil Nadu, India (On-Site)

View All Jobs

Get notified when new jobs are added by Luxoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug