Job Description

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Snapshot

We are seeking highly motivated and innovative Research Scientists to join our team in Tokyo, focused on advancing the state-of-the-art in multilingual, multicultural, and multimodal large language models (LLMs). You will conduct cutting-edge research on Gemini, particularly in the multilingual, multicultural, and multimodal domain (speech, vision, and text), with a direct path to impacting billions of users through Google products. This role offers a unique opportunity to contribute to foundational research in multilingual and multimodal LLMs, with a special emphasis on unique challenges in the Asia-Pacific (APAC) region, while collaborating with a world-class team at Google DeepMind around the world. If you are passionate about shaping the future of human-computer interaction through multilingual and multimodal LLMs and are eager to make a significant impact on users in the APAC region and beyond, we encourage you to apply.

About us

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence and build Artificial General Intelligence (AGI). We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

The Tokyo position is based in the Frontier AI group at Google DeepMind Japan. The group conducts research and development on various aspects of multimodal LLM development, particularly focusing on voice-based conversational AI. Their work has also enhanced various Google products such as Gemini App, NotebookLM, Google AI Studio, Google Labs, and Google Cloud.

The role

As a Research Scientist at Google DeepMind, you will:

Key responsibilities

Develop and evaluate multilingual, multicultural, and multimodal LLMs Design, implement, and evaluate multilingual, multicultural, and multimodal capabilities in Gemini and other frontier models at Google.
Collaborate with a world-class team Work closely with other research scientists, engineers, and product teams across Google DeepMind, fostering a collaborative and intellectually stimulating environment.
Contribute to real-world impact See your work contribute to Gemini and other frontier models at Google, with applications across various domains, directly influencing the future of Google products and services.

Contribute to the research community: Contribute to the research community by sharing insights and participating in external academic workshops and conferences.

About you

You are a passionate and talented researcher with a strong foundation and a proven ability to conduct impactful research in AI. You embrace change and thrive under ambiguity. You have a collaborative mindset and are excited to work as part of a team to tackle ambitious research challenges. You are passionate about seeing your research translated into real-world products that improve the lives of users and are eager to work in an environment where research has a direct path to product impact. You are eager to see your research contribute to real-world applications and are driven by a desire to create positive change through AI.

Ph.D. (or equivalent research experience) in Computer Science, Artificial Intelligence, or a related field.
Strong publication record in top-tier AI conferences or journals.
Solid understanding of deep learning, natural language processing, computer vision, and/or speech processing.
Experience with relevant ML frameworks such as JAX, TensorFlow, or PyTorch.
Excellent communication and collaboration skills.

In addition, the following would be an advantage:

Experience with multilingual, multicultural, and/or multimodal learning in LLMs.
Experience with pretraining, post-training techniques, prompt engineering, few-shot learning, and evaluations.
Familiarity with large-scale model training and deployment.
Strong programming skills in Python or similar languages.

About Google DeepMind

We’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

Our long term aim is to solve intelligence, developing more general and capable problem-solving systems, known as artificial general intelligence (AGI).

Guided by safety and ethics, this invention could help society find answers to some of the world’s most pressing and fundamental scientific challenges.

We have a track record of breakthroughs in fundamental AI research, published in journals like Nature, Science, and more.Our programs have learned to diagnose eye diseases as effectively as the world’s top doctors, to save 30% of the energy used to keep data centres cool, and to predict the complex 3D shapes of proteins - which could one day transform how drugs are invented.

Industry

Biotech & Life Sciences

Company Size

5,001-10,000 employees

Headquarters

London, GB

Year Founded

2010

Website

deepmind.com

Social Media