Network Engineer, High Performance GPU Network Direction - Ashburn, VA

2 Months ago • 3 Years + • Network Engineering • Undisclosed

About the job

Job Description

ByteDance is seeking a skilled Network Engineer with a focus on High Performance GPU Network Direction to join our team in Ashburn, VA. You'll play a crucial role in designing, validating, implementing, and operating our global HPC networks, working closely with cross-functional teams to drive innovation and evolution. This position requires expertise with HPC network topologies, including RDMA over Converged Ethernet (RoCE) or InfiniBand (IB), and a strong understanding of network protocols like TCP/IP, DHCP, BGP, OSPF/IS-IS, and MPLS. You'll also be involved in building HPC networks, ensuring the reliability of ByteDance's global network by participating in on-call rotation, and collaborating with external vendors to explore cutting-edge architecture and next-generation technology.
Must have:
  • Bachelor's in CS/related field
  • 3+ years experience
  • HPC network topology expertise
  • Network protocol understanding
  • HPC network building experience
  • Self-driven, good communication skills
Good to have:
  • Experience with RoCE or InfiniBand
Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imagination thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact-for ourselves, our company, and the users we serve. About the Team ByteDance Networking brings together innovative ideas and technologies from network architecture, software-defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe. ByteDance Networking is responsible for designing, building, and operating the global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance. By joining this team, you will gain marketable software development and/or network operation experience in data center networking on a massive scale. Responsibilities - Responsible for the design, validation, implementation and operation of ByteDance's global high performance computing (HPC) networks. - Work with cross-functional teams, including but not limited to machine learning (ML), compute and storage, driving the innovation and evolution of the HPC network. - Work closely with external vendors to explore state-of-the-art architecture and next-gen technology. - Build software and tools to improve the reliability and availability of HPC network infrastructure. - Ensuring the reliability of ByteDance global network by participating in on-call rotation.
Qualifications
Minimum Qualifications - Bachelor's in Computer Science, Information Science, Engineering, Mathematics, or a related field, or experience equivalent to a Bachelor's degree. - 3 years of working experience and above. - Expertise with HPC network topologies, like RDMA over Converged Ethernet (RoCE) or InfiniBand (IB). - Good understanding of network protocols including TCP/IP, DHCP, BGP, OSPF/IS-IS and MPLS related technologies. - Experience with building HPC networks. - Be self-driven, possess good communication and written skills. Preferred Qualifications - Candidates with experience in high-performance computing (HPC) network topologies, particularly those familiar with Remote Direct Memory Access over Converged Ethernet (RoCE) or InfiniBand (IB), are preferred. About ByteDance ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

Bangkok, Bangkok, Thailand (On-Site)

Jakarta, Jakarta, Indonesia (On-Site)

San Jose, California, United States (On-Site)

Seoul, South Korea (Hybrid)

San Jose, California, United States (On-Site)

Singapore (On-Site)

Seoul, South Korea (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rockstar Games - Online System Administrator

Rockstar Games, India (On-Site)

Penumbra - IT Systems Engineer, Operations

Penumbra, United States (On-Site)

SparkCognition - Senior IT Cloud Engineer

SparkCognition, India (On-Site)

InvenioLSI - SAP Senior Cloud Engineer (US Citizen)

InvenioLSI, United States (On-Site)

The Walt Disney Company - System Engineer

The Walt Disney Company, India (On-Site)

Trendyol - Teknoloji Destek Uzmanı

Trendyol, Türkiye (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in Ashburn, Virginia, United States

Zoox - Software Engineering  - Returnship

Zoox, United States (Hybrid)

Epic Games - Senior Outsource Artist

Epic Games, United States (On-Site)

Fanatics - Data Analyst III

Fanatics, United States (Hybrid)

Crunchyroll - Senior Marketing Manager, eCommerce (Contract)

Crunchyroll, United States (Hybrid)

CD PROJEKT RED - Senior Cinematic Animator

CD PROJEKT RED, United States (Hybrid)

The Walt Disney Company - Embroidery Specialist - Full Time, Walt Disney World

The Walt Disney Company, United States (On-Site)

Lirio - Senior Cloud Engineer

Lirio, United States (Remote)

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

The Walt Disney Company - Senior Network Operations Engineer

The Walt Disney Company, United States (On-Site)

Zones - Manager, Information Technology

Zones, United States (Hybrid)

The Walt Disney Company - Senior Network Peering Engineer

The Walt Disney Company, United States (On-Site)

Tencent - Senior Software Engineer - Network

Tencent, China (On-Site)

Meta - Network Production Engineer

Meta, United States (On-Site)

Bohemia Interactive - Engine Network Programmer Prague/Brno

Bohemia Interactive, Czechia (On-Site)

The Walt Disney Company - Network Operations II

The Walt Disney Company, United States (On-Site)

Get notifed when new similar jobs are uploaded