ByteDance

Research Engineer Graduate (AI Training Systems & RL Infrastructure - Seed Infra) - 2026 Start (PhD)

ByteDance  •  $233k - $428k/yr  •  Seattle, WA (Onsite)  •  16 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
58
AI Success™

Job Description

About the team
The Seed Infrastructures team oversees the distributed training, reinforcement learning framework, high-performance inference, and heterogeneous hardware compilation technologies for AI foundation models.

We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at our Company.

Successful candidates must be able to commit to an onboarding date by end of year 2026. Please state your availability and graduation date clearly in your resume.

Responsibilities
- Conduct research and development on large-scale AI infrastructure to support efficient training and post-training of foundation models, multimodal LLMs, and image/video generation models.
- Design and optimize distributed training strategies, including data/model/tensor/pipeline/expert parallelism, computation–communication overlap, and large-scale GPU cluster scaling.
- Prototype and improve end-to-end reinforcement learning (RL) training systems, covering rollout generation, policy optimization, evaluation, and iterative deployment workflows.
- Build scalable and fault-tolerant infrastructure that operates reliably under dynamic workloads and heterogeneous compute environments.
- Analyze performance bottlenecks across the training stack (e.g., networking, scheduling, GPU memory management), and develop principled optimization approaches to improve throughput, efficiency, and stability.
- Develop tooling, monitoring, debugging, and observability frameworks to ensure reliability of large-scale training and RL systems.
- Collaborate with researchers and engineers on system–algorithm co-design, translating research prototypes into scalable, production-ready infrastructure systems.

The base salary range for this position in the selected city is $232560 - $427500 annually.
ByteDance

About ByteDance

ByteDance is a global incubator of platforms at the cutting edge of commerce, content, entertainment and enterprise services - over 2.5bn people interact with ByteDance products including TikTok.

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. We are committed to building a safe, healthy and positive online environment for all our users.

We have over 110,000 employees based in more than 30 countries globally. Join us.

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
China, CN
Year Founded
Unknown
Social Media