ByteDance

Applied Machine Learning SRE Intern (AML) - 2026 Start (BS/MS)

ByteDance  •  Singapore, SG (Onsite)  •  2 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Team Introduction
The Site Reliability Engineering (SRE) team for the Applied Machine Learning (AML) organization is at the heart of what we do. We are a global team of engineers who blend deep systems knowledge with software engineering to build and run the large-scale, distributed systems that power our machine learning products. Our mission is to ensure that ByteDance's core machine learning services are reliable, scalable, and efficient.

As an SRE intern, you will be immersed in a fast-paced, high-impact environment. You will have the opportunity to work on real-world challenges, receive mentorship from experienced engineers, and contribute to systems that serve millions of users worldwide. This is a unique chance to develop your skills in coding, performance analysis, and large-scale system operations.

We are looking for talented individuals to join our team in 2026. As an intern, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at ByteDance.
Successful candidates must be able to commit to an onboarding date by end of year 2026. Please state your availability and graduation date clearly in your resume.
Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early.

Responsibilities
As an SRE intern, you will partner with your mentor and team members to support our core infrastructure. Your work will focus on improving the reliability and performance of our services. Key responsibilities include:
- Assisting in the design and development of software and tools to enhance system automation, monitoring, and operational efficiency.
- Participating in troubleshooting and resolving system issues, analyzing root causes, and implementing preventative measures.
- Contributing to the enhancement of existing software by updating capabilities and supporting testing and validation procedures.
- Collaborating with software and hardware engineers to understand system requirements and contribute to performance improvements.
- Learning and applying principles of computer science, engineering, and mathematical analysis to solve real-world problems.
ByteDance

About ByteDance

ByteDance is a global incubator of platforms at the cutting edge of commerce, content, entertainment and enterprise services - over 2.5bn people interact with ByteDance products including TikTok.

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. We are committed to building a safe, healthy and positive online environment for all our users.

We have over 110,000 employees based in more than 30 countries globally. Join us.

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
China, CN
Year Founded
Unknown
Social Media