Meta

AI Research Scientist, Media Data Research - MSL FAIR

Meta  •  Menlo Park, CA (Onsite)  •  2 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
62
AI Success™

Job Description

Meta is seeking AI research scientists to help us build the data foundation for Meta's most advanced Large Language and Media Models. We're looking for researchers with LLM/LMM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM/LMM development (pre-training, mid-training, post-training) and all domains/modalities (image, video, agent, media perception and generation). We are tackling complex challenges at trillion-scale, including organic data curation, synthetic data generation, agent and interaction data, and frontier paradigms that redefine what is possible.

Based in Meta Superintelligence Labs (MSL) within the Fundamental AI Research Organization (FAIR), you'll directly contribute to Meta’s frontier models like Llama, while having the chance to collaborate with researchers and engineers across MSL.

Responsibilities
Collaborate with cross-functional teams to develop Meta’s next foundational models
* Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
* Fundamentally improve our data velocity across workflows and projects by contributing to quality in data tooling
* Execute on high priority projects in pre-training, mid-training, or post-training data curation
* Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
* Lead complex technical projects end-to-end

Qualifications
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
* PhD in Computer Science or a related technical field
* 1+ year of industry research experience in LLM/LMM, computer vision, or related AI/ML models
* Experience owning and/or driving complex technical projects from end-to-end
* Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
* Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI Experience working on frontier-quality/ state-of-the-art Large Language or Large Media Models
* First-author publications at top peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV)
* Programming experience in Python and hands-on experience with frameworks like PyTorch or Spark, or related distributed computing frameworks (Ray, DataFlow)
* Familiarity with SQL and file formats, such as Hive, Iceberg, Parquet, etc
Meta

About Meta

Meta's mission is to build the future of human connection and the technology that makes it possible.

Our technologies help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.

To help create a safe and respectful online space, we encourage constructive conversations on this page. Please note the following:

• Start with an open mind. Whether you agree or disagree, engage with empathy.

• Comments violating our Community Standards will be removed or hidden. Please treat everybody with respect.

• Keep it constructive. Use your interactions here to learn about and grow your understanding of others.

• Our moderators are here to uphold these guidelines for the benefit of everyone, every day.

• If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community).

For a full listing of our jobs, visit https://www.metacareers.com

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
Menlo Park, CA
Year Founded
2004
Website
meta.com
Social Media