LLM Global Data - LLM Coding Trainer Intern - 2025 Start

44 Minutes ago • Upto 1 Years • Data Analyst

About the job

Job Description

As an LLM Global Data - LLM Coding Trainer Intern at ByteDance, you'll be a core member of the LLM Global Data Team, gaining hands-on experience in training Large Language Models (LLMs). You'll work on quality assurance, developing case studies for coding data challenges (with occasional math), collaborating with engineers to identify effective coding data, conducting data research, and gaining experience in human feedback data production. The internship involves contributing to data generation and quality assurance initiatives, potentially leading to larger-scale projects. The role requires proficiency in programming languages (Python, Java, Go, C) and strong problem-solving and communication skills.
Must have:
  • Bachelor's degree in CS or related field
  • Proficiency in Python, Java, Go, or C
  • Strong communication & problem-solving skills
  • Quality assurance and case study development
  • Collaboration with engineers and product managers
Good to have:
  • Experience with large codebases
  • Algorithm optimization skills
  • Operations and technical writing experience
  • Leadership and mentoring skills
  • Interest in LLMs, human behavior, and UX
Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the team As a core member of our LLM Global Data Team, you will be at the heart of our coding operations. This role offers a unique opportunity to gain first-hand experience in understanding the intricacies of training Large Language Models (LLMs) with diverse data sets. Through our carefully designed rotation program, you will witness how different verticals of high-quality data are meticulously crafted and used. Upon completion, you will contribute to initiatives in coding for data generation and quality assurance, paving your way to lead, train, or oversee large-scale coding data QA and operation projects. Your Role Will Involve: 1. Perform quality assurance and develop case studies to tackle intricate data challenges involving coding, with occasional work in mathematics. 2. Collaborate with product managers and algorithmic engineers to identify the most effective coding data for improving our LLMs. 3. Engage in data research to craft strategic insights that guide our data production. 4. Gain direct experience in human feedback data production to understand the synergy between humans and data in LLM training. Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early. Successful candidates must be able to commit to at least 3 months long internship period. Successful candidates must be able to commit to either of the following Internships - From January to June - From May to August (Summer)
Qualifications
Minimum Qualifications 1. Bachelor's degree in Computer Science, Information Science or a related technical discipline 2. Proficiency in one or more programming languages, including but not limited to Python, Java, Go, and C. 3. Strong communication and problem-solving skills; effective execution and enforcement; and adeptness in document writing. Preferred Qualifications: 1. Experience with large scale codebases or advanced coding skills in algorithm optimisation. 2. Experience in operations and technical writing. 3. Proven leadership skills, including mentoring team members and facilitating the swift onboarding of new hires. 4. Strong sense of responsibility and the ability to adapt to a high-intensity work environment. 5. A deep interest in LLMs, human behaviour, and user experience. The ideal candidate is an enthusiastic learner who finds engagement with diverse case studies and annotators stimulating. Note: This role requires a paper test prior to interviews. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://jobs.bytedance.com/en/legal/privacy. If you have any questions, please reach out to us at apac-earlycareers@bytedance.com
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

View All Jobs

Get notified when new jobs are added by ByteDance

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Dun & Bradstreet - 2025 Summer Internship Program - Technology

Dun & Bradstreet, United States (On-Site)

Epic Games - Senior Backend Engineer

Epic Games, United States (On-Site)

Guidewire Software - Site Reliability Engineer - Cloud Platform

Guidewire Software, India (Hybrid)

Luxoft - Java Technical Support L2 Engineer

Luxoft, India (On-Site)

Microsoft - Principal Software Engineer- AI Search

Microsoft, United States (On-Site)

Dream Sports - SDE 2 - React Native

Dream Sports, India (On-Site)

N-iX - Senior Node.JS Engineer (#2555)

N-iX, Poland (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Singapore

ByteDance - Database Administrator - Game

ByteDance, Singapore (On-Site)

Limit Break - Sr. Mobile Game Designer

Limit Break, Singapore (On-Site)

ByteDance - Lark Integrated Marketing Intern - 2025 Start

ByteDance, Singapore (On-Site)

ByteDance - Food Safety Manager, APAC

ByteDance, Singapore (On-Site)

The Walt Disney Company - Intern, Public Relations, Disney+ – May to Aug 2025

The Walt Disney Company, Singapore (On-Site)

Tencent - Senior Big Data Solution Architect

Tencent, Singapore (On-Site)

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Rockstar Games - Senior Data Scientist, GTA+ Subscriptions

Rockstar Games, United States (On-Site)

Notion - Software Engineer, Data Platform

Notion, United States (On-Site)

Wargaming - Game Data Analyst (World of Warships)

Wargaming, Czechia (Hybrid)

PwC - Senior Associate - D&A - GDC

PwC, India (On-Site)

Ericsson - Data Scientist

Ericsson, India (On-Site)

Animoca Brands - Digital Asset Researcher

Animoca Brands, Hong Kong (On-Site)

Token Metrics - Crypto Senior Backend Engineer (Remote)

Token Metrics, Colombia (Remote)

Get notifed when new similar jobs are uploaded