XPENG

强化学习算法实习生

XPENG  •  Onsite  •  2 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

强化学习算法实习生上海实习研发 - 算法职位描述1、研发面向自动驾驶的强化学习算法,解决奖励设计、环境交互、安全约束与样本效率等核心挑战;
2、设计奖励模型与仿真环境,构建从虚拟训练到真实迁移(Sim-to-Real)的可靠路径;
3、探索离线强化学习、模仿学习与RL的融合方案,提升算法在复杂交通场景中的泛化能力;
4、参与真实车端/机端的算法部署与闭环验证,推动RL在物理世界的落地;
5、与数据,仿真,infra团队协同,构建高效的RL开发框架,提升模型迭代效率。职位要求1、27届-28届毕业同学,计算机/自动化/机器人等相关专业,硕博优先;
2、对物理AI有强烈兴趣,愿意深入解决RL落地的真实难题;
3、扎实的强化学习基础,熟悉PPO、GRPO、SAC等算法,有MuJoCo、Isaac Gym、CARLA等仿真平台经验;
4、有基于Autoregression、diffusion、flow matching 生成式模型算法经验者优先;
5、熟悉大模型微调(LoRA, DPO, SFT),有VLA/VLM模型训练实际经验者优先;
6、熟练掌握Python/C++,具备算法工程化与调试能力;
7、有自动驾驶公司、机器人公司等RL算法实习或全职经验优先。 投递
XPENG

About XPENG

XPeng is a leading Chinese Smart EV company that designs, develops, manufactures, and markets Smart EVs that appeal to the large and growing base of technology-savvy middle-class consumers. Its mission is to drive Smart EV transformation with technology and data, shaping the mobility experience of the future. In order to optimize its customers’ mobility experience, XPeng develops in-house its full-stack advanced driver-assistance system technology and in-car intelligent operating system, as well as core vehicle systems including powertrain and the electrical/electronic architecture. XPeng is headquartered in Guangzhou, China. In 2021, the Company established its European headquarters in Amsterdam, along with other dedicated offices in Copenhagen, Munich, Oslo, and Stockholm.The Company’s Smart EVs are mainly manufactured at its plant in Zhaoqing and Guangzhou,Guangdong province.

For more information, please visit https://heyxpeng.com.

Industry
Automotive & Mobility
Company Size
1,001-5,000 employees
Headquarters
Guangzhou, CN
Year Founded
2014
Social Media