Meta

Research Scientist, Multi-Modal

Meta  •  Pittsburgh, PA (Onsite)  •  1 hour ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Meta is seeking a creative, skilled and motivated Research Scientist to advance the state-of-the-art in multi-modal understanding. You will work on developing models that reason across vision, language, and other modalities to enable richer AI experiences across Meta's family of apps and products. You will collaborate with research scientists, software engineers, and data scientists to design technical solutions in a fast-paced multidisciplinary environment.

Responsibilities
Develop and advance multi-modal models that integrate vision, language, audio, and other modalities
* Research novel architectures and training methods for cross-modal reasoning and understanding
* Design and prototype interactive experiences that leverage multi-modal AI capabilities
* Collaborate across teams to develop concepts that advance the entire research pipeline (hardware, software, data collection, machine learning, etc.)
* Publish research findings at top-tier conferences and contribute to the broader research community

Qualifications
Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
* Currently has, or is in the process of obtaining, a PhD degree in Computer Science, Machine Learning, or relevant technical field. Degree must be completed prior to joining Meta
* Experience in multi-modal learning, combining vision, audio, language, or related areas
* Experience working with PyTorch or TensorFlow
* Experience with transformer architectures and large-scale model training
* Technical knowledge across machine learning, deep learning, and statistical modeling
* Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment First-authored publications at leading conferences such as NeurIPS, ICML, and CVPR, or similar
* Experience with large language models (LLMs) and their integration with other modalities
* Experience transferring multi-modal research into shipping products
* Experience working and communicating cross-functionally in a team environment
* Research experience in vision-language models, multi-modal transformers, or cross-modal representation learning
Meta

About Meta

Meta's mission is to build the future of human connection and the technology that makes it possible.

Our technologies help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.

To help create a safe and respectful online space, we encourage constructive conversations on this page. Please note the following:

• Start with an open mind. Whether you agree or disagree, engage with empathy.

• Comments violating our Community Standards will be removed or hidden. Please treat everybody with respect.

• Keep it constructive. Use your interactions here to learn about and grow your understanding of others.

• Our moderators are here to uphold these guidelines for the benefit of everyone, every day.

• If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community).

For a full listing of our jobs, visit https://www.metacareers.com

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
Menlo Park, CA
Year Founded
2004
Website
meta.com
Social Media