Senior LLM Evaluation Researcher - TikTok

TikTok • San Jose, CA (Onsite) • 4 hours ago

Apply

AI can make mistakes so check important info. Chat history is never stored.

Explore job details

Tailor my resume

Practice an interview

Develop new skills

Job Description

About the Team:

We are looking for a passionate and detail-oriented specialist to join our AI experience team. In this role, you will be responsible for defining and driving the evaluation framework for Tako's AI-powered features, ensuring our large language model (LLM) responses meet the highest standards of quality, relevance, and user satisfaction.

Responsibilities

- Develop a deep understanding of LLM capabilities and stay current with the latest research paradigms in model evaluation; apply both qualitative and quantitative user research methodologies to explore and define the ideal response quality standard for AI in diverse use cases.

- Own the end-to-end online experience quality of Tako; design and build a comprehensive evaluation framework by integrating internal expert assessments, crowdsourced testing, and LLM-based automated evaluation; identify experience gaps and translate findings into prioritized, actionable improvement recommendations for the team.

- Collaborate with international operations teams to drive the execution of evaluation programs, including the maintenance and curation of evaluation datasets, as well as the routine execution and analysis of benchmark assessments.

About TikTok

Inspire Creativity and Bring Joy

Industry

Arts & Entertainment

Company Size

10,000+ employees

Headquarters

Los Angeles, California

Year Founded

Unknown

Website

tiktok.com

Social Media