Tech Lead (SRE) - Cloud Infrastructure

2 Months ago β€’ 5 Years + β€’ DevOps

Job Summary

Job Description

ByteDance seeks a Tech Lead (SRE) for its Cloud Infrastructure team in Singapore. This role involves leading and mentoring a team of software and system engineers, establishing efficient processes, and collaborating with other teams. Responsibilities include team management, developing software systems, creating technical strategies, developing PoCs, establishing operational protocols (access management, disaster recovery), building automated monitoring frameworks, and collaborating with development teams to ensure system reliability. Candidates should have a Bachelor's degree in Computer Science or a related field, 5+ years of professional experience (including 3+ years in R&D), Linux/networking expertise, and experience with large-scale distributed systems. Cloud computing platform experience is preferred.
Must have:
  • 5+ years experience (3+ in R&D)
  • Bachelor's degree in CS or related field
  • Linux systems and networking proficiency
  • Large-scale distributed system management
  • Team and project management skills
Good to have:
  • Experience with large-scale distributed storage
  • Experience with cloud computing platforms
  • Experience in big data computing systems

Job Details

Responsibilities
ByteDance will be prioritising applicants who have a current right to work in Singapore and do not require ByteDance sponsorship of a visa. About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. Team Introduction The Site Reliability Engineering (SRE) team is a fusion of software and systems engineering techniques used to design and operate large-scale, extensively distributed, and resilient systems. Within Infrastructure SRE at TikTok, our primary focus is to ensure that the reliability and uptime of our infrastructure services meet the needs of our users and support rapid improvement iterations. Our software development efforts are deeply committed to optimising existing systems, constructing essential infrastructure, and streamlining operations through automation. The Role In the role of a Tech Lead, you will assume responsibility for guiding and assembling a team of software and system engineers, leveraging your exceptional technical leadership skills. Your role will involve establishing efficient processes for project execution and promoting sound engineering practices. Additionally, you will maintain regular coordination and communication with other infrastructure teams and our user community. What you will be doing: 1. Establish and oversee the SRE team, which encompasses tasks such as team recruitment, the training of new talent, system operation and maintenance, coordination efforts, and fostering a cohesive team culture; 2. Oversee the acquisition and development of software systems in organisational units. Establish a comprehensive long-term technical strategy with well-defined implementation steps and milestones to continually enhance the team's competitiveness and technological capabilities; 3. Oversee the development of Proof-of-Concept/solutions and provide technical expertise on the development of software and platform features, ensuring that appropriate security and risk factors are considered; 4. Create protocols and strategies for critical aspects of the operating platform, including access management, configuration, disaster recovery, and fault handling; 5. Devise and implement software platforms and monitoring frameworks that promote efficient, automated, and intelligent governance within a service-oriented architecture (SOA); 6. Collaborate closely with the system development team to guarantee the reliability of systems from initial design through to launch. Consistently advance automated operations and maintenance facilities and platforms; 7. Foster improved communication and collaboration with business teams, enhance cross-team coordination, and persistently refine and optimize business processes. Drive the evolution of business architecture design.
Qualifications
What you should have: - At least a Bachelor's Degree in Computer Science or a closely related technical field, along with more than 5 years of professional experience (including at least 3 years in Research and Development); - Demonstrates a systematic approach to operations and maintenance, with proficiency in Linux systems and networking. Brings practical expertise in managing and maintaining large-scale distributed systems; - Self-motivated with strong planning and summarisation skills. Possesses a track record of project and team management; - Exhibits a high level of responsibility, a proactive team-oriented attitude, and exceptional problem-solving abilities; - Prior experience with extensive cloud-computing platforms is a plus. - Preferred qualifications include experience in the development of large-scale distributed storage, scheduling, big data computing systems, or intelligent operations and maintenance. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Similar Jobs

Google - Lead CPU Design Verification Engineer, Silicon

Google

(On-Site)
β€’ 2 Months ago
PlayStation Global - Staff Service Reliability Engineer

PlayStation Global

Berlin, Berlin, Germany (On-Site)
β€’ 3 Months ago
Passive Logic - Senior Embedded Systems Engineer

Passive Logic

Salt Lake City, Utah, United States (On-Site)
β€’ 4 Months ago
Microsoft - Senior Software Engineer - C++

Microsoft

Hyderabad, Telangana, India (On-Site)
β€’ 2 Weeks ago
SuperPlay - Senior Server Developer

SuperPlay

Bucharest, Bucharest, Romania (On-Site)
β€’ 2 Weeks ago
Egnyte - Senior Build Engineer - Python - Jenkins

Egnyte

India (Remote)
β€’ 1 Month ago
Script Assist - Junior DevOps Engineer

Script Assist

Ahmedabad, Gujarat, India (Hybrid)
β€’ 5 Months ago
UniVoxx - Kamailio (VOIP) Engineer

UniVoxx

Ahmedabad, Gujarat, India (On-Site)
β€’ 5 Months ago
Vi - Data Infrastructure Engineer

Vi

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
β€’ 3 Months ago
Dream Sports - Lead Engineer - Cloud Security

Dream Sports

Mumbai, Maharashtra, India (On-Site)
β€’ 6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PwC - Cloud Engineer

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
β€’ 3 Months ago
CD PROJEKT RED - Senior / Principal Network Engineer

CD PROJEKT RED

Boston, Massachusetts, United States (Hybrid)
β€’ 1 Month ago
Electronic Arts - Software Engineer, Java - EA Sports FC

Electronic Arts

Bucharest, Bucharest, Romania (Hybrid)
β€’ 2 Months ago
Luxoft - Senior AAOS HAL/Driver Developer

Luxoft

Belgrade, Serbia (On-Site)
β€’ 2 Months ago
PwC - SAP - Basis + S/4 HANA + Cloud- Senior Associate  -Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
β€’ 1 Month ago
Take-Two Interactive - Senior Infrastructure Engineer

Take-Two Interactive

Bengaluru, Karnataka, India (On-Site)
β€’ 2 Weeks ago
Unity - Staff Graphics Engineer

Unity

United States (Remote)
β€’ 1 Month ago
Nagarro - Principal Engineer, Embedded Systems

Nagarro

Sri Lanka (Remote)
β€’ 3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

Bushiroad - Licensing Executive / Senior Licensing Executive

Bushiroad

Singapore, Singapore (On-Site)
β€’ 2 Days ago
ByteDance - LLM Training Operation (Language and Creative) - Specialist

ByteDance

Singapore (On-Site)
β€’ 3 Months ago
The Walt Disney Company - Intern, Loyalty & Partnerships, Disney+

The Walt Disney Company

Singapore, Singapore (On-Site)
β€’ 1 Week ago
IGG - Editor

IGG

Singapore (On-Site)
β€’ 3 Months ago
ByteDance - iOS Software Engineer, Flow

ByteDance

Singapore (On-Site)
β€’ 3 Months ago
Razer - Associate Director, Software Product Marketing

Razer

Singapore (On-Site)
β€’ 4 Months ago
Riot Games - Insights Analyst III

Riot Games

Singapore (On-Site)
β€’ 4 Months ago
Garena - Associate/Senior Associate, HR Operations

Garena

Singapore (On-Site)
β€’ 1 Month ago
ByteDance - Merchant Financing Product Manager - Global Payment

ByteDance

Singapore (On-Site)
β€’ 1 Week ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

GoGuardian - Site Reliability Engineer

GoGuardian

India (Remote)
β€’ 4 Months ago
PwC - Utilities Grid Modernization Senior Associate

PwC

Toronto, Ontario, Canada (On-Site)
β€’ 2 Months ago
InvenioLSI - MuleSoft Managing Consultant

InvenioLSI

Dubai, Dubai, United Arab Emirates (On-Site)
β€’ 1 Month ago
LeoVegas - Cloud Security Engineer

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
β€’ 3 Months ago
Luxoft - DevOps Engineer with Azure

Luxoft

Pune, Maharashtra, India (On-Site)
β€’ 1 Month ago
Next Level Business Services - Cloud Architect

Next Level Business Services

Jersey City, New Jersey, United States (On-Site)
β€’ 3 Months ago
Consilio LLC - SR Site Reliability Engineer

Consilio LLC

Bengaluru, Karnataka, India (Hybrid)
β€’ 3 Months ago
Microsoft - Support Engineer (Azure DevOps and Developer Support)

Microsoft

Seoul, South Korea (Remote)
β€’ 2 Weeks ago
IGT - Systems Engineer

IGT

Alaska, United States (Remote)
β€’ 2 Months ago
Warner Bros Discovery - Sr. Manager, Integrations

Warner Bros Discovery

Mexico City, Mexico City, Mexico (On-Site)
β€’ 1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

Seattle, Washington, United States (On-Site)

San Jose, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug