Job Description

Meta is seeking AI Research Scientists to join the Safety Alignment team within Meta Superintelligence Labs, dedicated to advancing the safe development and deployment of superintelligent AI. Our mission is to pioneer robust safety alignment techniques that empower Meta’s most ambitious AI capabilities, ensuring billions of users experience our products and services securely and responsibly.

Responsibilities
Design, implement, and evaluate novel safety alignment techniques for large language models and multimodal AI systems
* Create, curate, and analyze high-quality datasets for safety alignment
* Fine-tune and evaluate LLMs to adhere to Meta’s safety policies and evolving global standards
* Build scalable infrastructure and tools for safety evaluation, monitoring, and rapid mitigation of emerging risks
* Work closely with researchers, engineers, and cross-functional partners to integrate safety alignment into Meta’s products and services
* Lead complex technical projects end-to-end

Qualifications
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
* PhD in Computer Science, Machine Learning, or a relevant technical field
* 3+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML model training
* Experience as a technical lead on a team and/or leading complex technical projects from end-to-end
* Publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
* Programming experience in Python and hands-on experience with frameworks such as PyTorch Hands-on experience applying RL techniques (e.g., RLHF, PPO, DPO, GRPO, RLVF, reward modeling) to fine-tune large language models for safety and policy adherence
* Experience developing, fine-tuning, or evaluating LLMs across multiple languages and modalities (text, image, voice, video)
* Demonstrated experience to innovate in safety alignment, including custom guideline enforcement, dynamic policy adaptation, and rapid hotfixing of model vulnerabilities
* Experience designing, curating, and evaluating safety datasets, including adversarial and borderline prompt pairs for risk mitigation
* Experience with distributed training of LLMs (hundreds/thousands of GPUs), scalable safety mitigations, and automation of safety tooling

About Meta

Meta's mission is to build the future of human connection and the technology that makes it possible.

Our technologies help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.

To help create a safe and respectful online space, we encourage constructive conversations on this page. Please note the following:

• Start with an open mind. Whether you agree or disagree, engage with empathy.

• Comments violating our Community Standards will be removed or hidden. Please treat everybody with respect.

• Keep it constructive. Use your interactions here to learn about and grow your understanding of others.

• Our moderators are here to uphold these guidelines for the benefit of everyone, every day.

• If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community).

For a full listing of our jobs, visit https://www.metacareers.com

Industry

IT & Software

Company Size

10,000+ employees

Headquarters

Menlo Park, CA

Year Founded

2004

Website

meta.com

Social Media

AI Research Scientist - Safety Alignment Team

Job Description

About Meta