
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
We are seeking highly motivated and innovative Research Scientists to join our team in Tokyo, focused on advancing the state-of-the-art in multilingual, multicultural, and multimodal large language models (LLMs). You will conduct cutting-edge research on Gemini, particularly in the multilingual, multicultural, and multimodal domain (speech, vision, and text), with a direct path to impacting billions of users through Google products. This role offers a unique opportunity to contribute to foundational research in multilingual and multimodal LLMs, with a special emphasis on unique challenges in the Asia-Pacific (APAC) region, while collaborating with a world-class team at Google DeepMind around the world. If you are passionate about shaping the future of human-computer interaction through multilingual and multimodal LLMs and are eager to make a significant impact on users in the APAC region and beyond, we encourage you to apply.
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence and build Artificial General Intelligence (AGI). We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
The Tokyo position is based in the Frontier AI group at Google DeepMind Japan. The group conducts research and development on various aspects of multimodal LLM development, particularly focusing on voice-based conversational AI. Their work has also enhanced various Google products such as Gemini App, NotebookLM, Google AI Studio, Google Labs, and Google Cloud.
As a Research Scientist at Google DeepMind, you will:
Contribute to the research community: Contribute to the research community by sharing insights and participating in external academic workshops and conferences.
About you
You are a passionate and talented researcher with a strong foundation and a proven ability to conduct impactful research in AI. You embrace change and thrive under ambiguity. You have a collaborative mindset and are excited to work as part of a team to tackle ambitious research challenges. You are passionate about seeing your research translated into real-world products that improve the lives of users and are eager to work in an environment where research has a direct path to product impact. You are eager to see your research contribute to real-world applications and are driven by a desire to create positive change through AI.
In addition, the following would be an advantage:

We’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
Our long term aim is to solve intelligence, developing more general and capable problem-solving systems, known as artificial general intelligence (AGI).
Guided by safety and ethics, this invention could help society find answers to some of the world’s most pressing and fundamental scientific challenges.
We have a track record of breakthroughs in fundamental AI research, published in journals like Nature, Science, and more.Our programs have learned to diagnose eye diseases as effectively as the world’s top doctors, to save 30% of the energy used to keep data centres cool, and to predict the complex 3D shapes of proteins - which could one day transform how drugs are invented.