Big Data Engineer - CapCut

2 Months ago • 5-7 Years • Data Analyst

Job Summary

Job Description

CapCut, a leading video editing app, seeks a Big Data Engineer to build and maintain its data warehouse. Responsibilities include designing and optimizing ETL jobs for batch and streaming data, developing data products, collaborating with business teams on data metrics and dashboards, and implementing data system changes. The role requires proficiency in distributed computing engines (e.g., Spark, Flink), NoSQL databases (e.g., HBase), and data modeling. The ideal candidate will have 5+ years of software engineering experience and 2+ years in data engineering, with experience in building and maintaining complex, reliable, and secure ETL pipelines. The engineer will leverage data mining, modeling, NLP, and machine learning techniques to extract insights from large datasets. This role involves translating business requirements into technical implementations and contributing to the continuous improvement of CapCut's data infrastructure.
Must have:
  • 5+ years software engineering, 2+ years data engineering
  • Proficient in ETL pipeline creation and maintenance
  • Experience with distributed computing engines (Spark, Flink)
  • Data warehouse design and modeling experience
  • Data product development and maintenance
Good to have:
  • NoSQL database experience (e.g., HBase)
  • Excellent communication and collaboration skills

Job Details

Responsibilities
About CapCut CapCut is an all-in-one video editing app that empowers creators to express themselves and transform videos into creative masterpieces. In addition to its basic features, such as video editing, text, stickers, filters, colors and music, CapCut offers free advanced features, including keyframe animation, smooth slow-motion effects, chroma key, Picture-in-Picture (PIP), and stabilization to help you capture and snip moments. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. How we become the NO.1 video editing app CapCut is an all-in-one video editing solution that helps you create incredible videos. With the mission of making content creation easier and more engaging, CapCut was first launched on mobile platforms in April 2020. In less than a year, CapCut was released in Brazil, US, Indonesia, Japan and 170 countries/regions. To better serve the diverse needs, CapCut released its online and PC version in 2022. Starting in 2023, CapCut has continued to invest in AI technology to provide product features that are more accessible and easier to use. As of today, CapCut's global monthly users have exceeded 500 million. It has remained at the top of the download list in several app stores. Team Introduction We are an incredible team with passion. We enjoy learning new things and taking on challenges. Team culture here is open and inclusive. Everyone can make things happen, good ideas will always win. Today, we are continuously increasing investment in AI technology to make content creation much easier for everyone, and we always take the protection of user privacy and data security very seriously. In CapCut , our goal is to build a Data Warehouse that can cater to batch and streaming data, Data Products that provide useful information to build efficient data metrics & dashboards which will be used to make smarter business decisions to support business growth. If you're looking for a challenging ground to push your limits, this is the team for you! - Translate business requirements & end to end designs into technical implementations and responsible for building batch and real-time data warehouse - Manage data modeling design, writing, and optimizing ETL jobs - Collaborate with the business team to building data metrics based on data warehouse - Responsible for building and maintaining data products - Involvement in rollouts, upgrades, implementation, and release of data system changes as required for streamlining of internal practices - Develop and implement techniques and analytics applications to transform raw data into meaningful information using data-oriented programming languages and visualisation software. - Apply data mining, data modelling, natural language processing, and machine learning to extract and analyse information from large structured and unstructured datasets. - Visualise, interpret, and report data findings and may create dynamic data reports as well.
Qualifications
Minimum qualifications - 5 years in software engineering and 2 years of relevant experience in data engineering. - Proficient in creating and maintaining complex ETL pipeline end-to-end while maintaining high reliability and security. Preferred qualifications - Familiar with data warehouse concept and have production experience in modeling design. - Familiar with at least 1 distributed computing engine (e.g. Hive, Spark, Flink). - Familiar with at least 1 NoSQL database is a plus (e.g. HBase). - Excellent interpersonal and communication skills with the ability to engage and managing internal and external stakeholders across all levels of seniority . - Strong collaboration skills with the ability to build rapport across teams and stakeholders. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Similar Jobs

Bally's Interactive - Data Engineer

Bally's Interactive

(On-Site)
3 Months ago
Netflix - Data Engineer (L5) - Customer Service

Netflix

United States (Remote)
3 Months ago
Barracuda Networks  Inc  - Senior Site Reliability Engineer

Barracuda Networks Inc

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Rackspace Technology - Presales Data Science Architect – AWS Cloud

Rackspace Technology

Aguascalientes, Aguascalientes, Mexico (On-Site)
3 Months ago
Activision - Stagiaire - Coordonateur.trice de produit

Activision

San Francisco, California, United States (On-Site)
3 Months ago
Barracuda Networks  Inc  - Senior Machine Learning Engineer

Barracuda Networks Inc

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Nagarro - Staff Engineer

Nagarro

Sri Lanka (Remote)
3 Months ago
Luxoft - Senior Data Engineer/Analyst

Luxoft

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
Publicis Groupe - Growth Data Analyst

Publicis Groupe

(Hybrid)
2 Months ago
Luxoft - Data Business Analyst

Luxoft

(On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ION - Internship - Data Science

ION

Milan, Lombardy, Italy (On-Site)
4 Months ago
Yahoo - Sr Software Engineer

Yahoo

Ireland (Remote)
4 Months ago
Egnyte - Sr Product Manager - AI/ML

Egnyte

India (Remote)
1 Month ago
PwC - Data Engineer - Financial Crime team

PwC

Prague, Prague, Czechia (On-Site)
4 Months ago
Microsoft - Principal Applied Scientist

Microsoft

Mountain View, California, United States (On-Site)
1 Month ago
Trend Micro - Data Scientist

Trend Micro

Manila, Metro Manila, Philippines (On-Site)
15 Years ago
Dream11 - Senior Security Engineer - Application Security

Dream11

Mumbai, Maharashtra, India (On-Site)
6 Months ago
Fluence - Battery Data Engineer

Fluence

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
HP - HP Spark Management Associate - Supplies Process Engineer

HP

Singapore, Singapore (On-Site)
1 Month ago
Match Group - Senior Data Scientist (Product Analytics)

Match Group

Palo Alto, California, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

OKX - (Senior/Principal) Product Manager, Blockchain Explorer

OKX

Singapore, Singapore (On-Site)
3 Months ago
PwC - Tax NewLaw - Associate

PwC

Singapore (On-Site)
4 Months ago
ByteDance - Operation Team Lead - Training Operation (Safety)

ByteDance

Singapore (On-Site)
3 Months ago
ByteDance - Software Engineer (Messaging Middleware), Cloud Infrastructure

ByteDance

Singapore (On-Site)
3 Months ago
Ubisoft - Senior Financial Planning Analyst - Projects

Ubisoft

Singapore (Hybrid)
3 Months ago
Axinous - Marketing Executive Program Manager, APJ

Axinous

Singapore, Singapore (On-Site)
1 Month ago
ByteDance - Backend Software Engineer (SRE) Intern - 2025 Start

ByteDance

Singapore (On-Site)
1 Month ago
Interactive Brokers - Fraud Prevention Analyst

Interactive Brokers

Singapore (Hybrid)
4 Months ago
Limit Break - Sr. Mobile Game Designer

Limit Break

Singapore, Singapore (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Duolingo - Marketing Technology Manager

Duolingo

New York, New York, United States (On-Site)
4 Months ago
PwC - IN_Senior Associate_PBI Custom Chat Developer_Data &  Analytics_Advisory_PAN India

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
CloudHire - Data Labeler

CloudHire

Bengaluru, Karnataka, India (Remote)
3 Months ago
The Walt Disney Company - Data Engineer F/H/NB - CDI

The Walt Disney Company

Montévrain, Île-de-France, France (On-Site)
2 Months ago
Keywords Studios (Player Support) - Associate Data Scientist

Keywords Studios (Player Support)

Nottingham, England, United Kingdom (Hybrid)
1 Month ago
Inkittt - Product Analyst

Inkittt

San Francisco, California, United States (Hybrid)
1 Month ago
Aristocrat Gaming - Sr Project Coordinator

Aristocrat Gaming

Gurugram, Haryana, India (Hybrid)
2 Months ago
Microsoft - Principal Software Engineering Manager

Microsoft

Barcelona, Catalonia, Spain (On-Site)
1 Month ago
Thinkproject - Data Scientist (m/f/d)

Thinkproject

Pune, Maharashtra, India (Hybrid)
4 Months ago
Equivalent Jobs - ML ENGINEER

Equivalent Jobs

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded