XPENG

LLM算法实习生(具身大脑方向)

XPENG  •  Onsite  •  11 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

LLM算法实习生(具身大脑方向)深圳实习互联网 / 电子 / 网游职位描述【关于我们】
我们致力于探索基于 大模型作为具身大脑,能够让机器人在复杂环境中完成 长程、实时的交互任务。
我们关注的机器人在真实或模拟环境中持续分析、决策和行动的能力:模型需要根据多轮反馈维护历史状态,在动态变化的环境中进行任务规划,并在时间约束下做出合理决策。
【职位描述】
1、搭建长程实时交互任务的仿真任务:面向导购、厨房、家庭服务、工厂异常处理等场景,设计具有多步骤依赖、环境不确定性、动态反馈和时间约束的交互任务,用于激发和评测模型的具身决策能力;
2、研究基于环境交互的 agentic RL 训练:构造适用于 SFT / RL / RLHF / Agentic RL 的交互轨迹数据,设计环境反馈、奖励信号和任务验证机制,提升模型在复杂交互任务中的成功率和泛化能力;
3、撰写技术报告与研究总结:跟踪 LLM Agent、Embodied AI、Agentic RL、机器人基础模型等前沿进展,整理实验结果和技术分析,参与团队内部技术讨论与外部研究交流。职位要求1、具备扎实的机器学习基础和强悍的编码能力,能熟练使用 PyTorch;
2、了解大模型或者强化学习中的至少一个方向;
3、对 LLM Agent、多轮交互、长程任务规划、工具调用或机器人智能感兴趣;
4、具备较强的问题抽象能力,能够从真实场景中提炼出可交互、可验证、可扩展的任务。
【加分项】
1、有 ICML、ICLR、NeurIPS、ACL、CVPR 等顶级学术会议发表过有影响力研究成果的优先;
2、在 ACM/ICPC、NOI/IOI、Kaggle 等编程/AI 比赛获奖者优先;
3、主导、参与过 AI 相关的有大影响力的开源/闭源项目的优先。 投递
XPENG

About XPENG

XPeng is a leading Chinese Smart EV company that designs, develops, manufactures, and markets Smart EVs that appeal to the large and growing base of technology-savvy middle-class consumers. Its mission is to drive Smart EV transformation with technology and data, shaping the mobility experience of the future. In order to optimize its customers’ mobility experience, XPeng develops in-house its full-stack advanced driver-assistance system technology and in-car intelligent operating system, as well as core vehicle systems including powertrain and the electrical/electronic architecture. XPeng is headquartered in Guangzhou, China. In 2021, the Company established its European headquarters in Amsterdam, along with other dedicated offices in Copenhagen, Munich, Oslo, and Stockholm.The Company’s Smart EVs are mainly manufactured at its plant in Zhaoqing and Guangzhou,Guangdong province.

For more information, please visit https://heyxpeng.com.

Industry
Automotive & Mobility
Company Size
1,001-5,000 employees
Headquarters
Guangzhou, CN
Year Founded
2014
Social Media