ByteDance

Student Researcher [LLM Post Training – Agent & Reinforcement Learning] - 2026 Start (PhD)

ByteDance  •  San Jose, CA (Onsite)  •  3 months ago
Expired
AI can make mistakes so check important info. Chat history is never stored.

Job Description

About the team
The Seed LLM Post Training team is responsible for researching cutting-edge posttrain technologies and providing core posttrain capabilities for unified multimodal large models. The team's goal is to research and explore next-generation advanced technologies such as SFT, RM, RL, and self-learning during the posttrain phase, while significantly optimizing and improving key areas including reasoning, coding, agent, and omni model.
PhD internships at ByteDance provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. Our dynamic internship experience blends hands-on learning, enriching community-building and development events, and collaboration with industry experts.
Applications will be reviewed on a rolling basis - we encourage you to apply early. Please state your availability clearly in your resume (Start date, End date).

Responsibilities
- Develop generalized agents capable of solving complex real-world tasks through long-horizon reasoning, memory, and multi-turn interaction.
- Tackle the challenges of large-scale reinforcement learning, building systems that can scale across compute, data, and environments to improve model intelligence and alignment with human preferences.
- Advance agent capabilities in long-horizon, multi-step reasoning across diverse domains, aiming to match or surpass expert-level performance.
- Explore planning, tool use, and feedback mechanisms to enhance agent robustness and adaptability across domains.

annually.
ByteDance

About ByteDance

ByteDance is a global incubator of platforms at the cutting edge of commerce, content, entertainment and enterprise services - over 2.5bn people interact with ByteDance products including TikTok.

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. We are committed to building a safe, healthy and positive online environment for all our users.

We have over 110,000 employees based in more than 30 countries globally. Join us.

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
China, CN
Year Founded
Unknown
Social Media