NIO

AI Agent算法评测工程师

NIO  •  Onsite  •  4 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

AI Agent算法评测工程师
上海、北京
社招
全职
数字技术
本科及以上
5-7 年
职位描述
搭建评测框架:构建针对RAG、Tool Calling、多步推理的自动化评测Pipeline,覆盖任务成功率与轨迹合理性。
裁判模型建设:落地LLM-as-a-Judge机制,解决打分偏差问题,实现Agent每日回归测试。
基准与数据集:维护业务评测黄金集,跟进行业Benchmark(如AgentBench),定期输出竞品对比报告。
缺陷归因:深度分析失败轨迹,区分是模型能力不足还是框架设计缺陷,推动底层优化。
职位要求
硬核技能:精通Python,熟悉Pytest,能独立搭建自动化测试工程。
懂大模型:熟悉RAG、Function Calling原理,擅长复杂Prompt设计与调优。
工程基础:熟悉Docker及CI/CD流水线,有数据分析(Pandas/可视化)能力。
加分项:有LLM-as-a-Judge实战经验;参与过AI应用的红队测试;熟悉LangChain或Dify源码。
投递
NIO

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO’s mission is to shape a sustainable and brighter future together. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO’s models for sale include the all-new smart electric flagship SUV ES8, the smart electric flagship coupe SUV EC7, the smart electric mid-large SUV ES7, the smart electric flagship sedan ET7, the all-new smart electric all-round SUV ES6, the all-new smart electric coupe SUV EC6, the smart electric mid-sized sedan ET5, and the smart electric tourer ET5T.

Industry
Unknown
Company Size
5,001-10,000 employees
Headquarters
Jiading, CN
Year Founded
Unknown
Website
nio.com
Social Media