TikTok

Senior LLM Evaluation Researcher - TikTok

TikTok  •  San Jose, CA (Onsite)  •  4 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

About the Team:

We are looking for a passionate and detail-oriented specialist to join our AI experience team. In this role, you will be responsible for defining and driving the evaluation framework for Tako's AI-powered features, ensuring our large language model (LLM) responses meet the highest standards of quality, relevance, and user satisfaction.

Responsibilities

- Develop a deep understanding of LLM capabilities and stay current with the latest research paradigms in model evaluation; apply both qualitative and quantitative user research methodologies to explore and define the ideal response quality standard for AI in diverse use cases.

- Own the end-to-end online experience quality of Tako; design and build a comprehensive evaluation framework by integrating internal expert assessments, crowdsourced testing, and LLM-based automated evaluation; identify experience gaps and translate findings into prioritized, actionable improvement recommendations for the team.

- Collaborate with international operations teams to drive the execution of evaluation programs, including the maintenance and curation of evaluation datasets, as well as the routine execution and analysis of benchmark assessments.
TikTok

About TikTok

Inspire Creativity and Bring Joy

Industry
Arts & Entertainment
Company Size
10,000+ employees
Headquarters
Los Angeles, California
Year Founded
Unknown
Social Media