Sony

Full-Time - Audio-Visual AI Research Scientist_ICASSP

Sony  •  Tokyo, JP / Ōsaki, JP (Onsite)  •  28 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Technology Field

Computer Vision

Speech/Audio Signal Processing

We are seeking Research Scientist to join our fundamental and applied research teams at Sony in Tokyo.
Our aim is to rapidly advance the process of cinematic content creation. To achieve this, we work together with Sony Pictures Entertainment to develop AI technologies that restore and enhance movie content.

With us, you will research and develop innovative computer vision and machine learning technologies for cinematic content creation. You will also have many opportunities to publish your findings and collaborate with a variety of academic institutions worldwide.

See More Here: https://sony.github.io/creativeai/

Responsibilities

■ Research and development of novel computer vision technologies in areas including generative methods, audio-visual scene understanding, audio-visual sound separation/localization, and beyond.
■ Implement findings from computer vision research into real products through collaboration.
■ Work with a strong international team of researchers and engineers with various areas of expertise to develop innovative solutions.
■ Collaborate with Sony`s various branches, including Sony Pictures Entertainment.
■ Collaborate with academic institutions to drive state-of-the-art research.
■ Contribute to the development of research publications to be published at top-tier conferences and journals.

Required qualifications

■ Experience publishing research about machine learning/computer vision at conferences and/or in journals (e.g. CVPR/ICCV/ECCV/NeurIPS/ICLR/ICML/IJCV/PAMI).
■ Experience developing ML/deep learning models for computer vision tasks.
■ Fluency in Python and deep learning frameworks.

Preferred qualifications

■ Ph.D. Degree (graduated or currently pursuing) in computer science, machine learning, or electrical engineering, OR equivalent practical experience.
■ Experience developing ML/deep learning models for audio-visual tasks or other multi-modal tasks.
■ Experience developing ML/deep learning-based generative models.
■ Professional proficiency in English.

Product, Service

Movie production for Sony Pictures Entertainment.


Development Environment

OS: Windows and Linux

Application Requirements

Essay: Required

Coding test: Not Required

Required Skills:

Audio Signal Processing, Computer Vision, Speech Processing

Required Skills:

Audio Signal Processing, Computer Vision, Speech Processing

Optional Skills:

Sony

About Sony

Sony’s purpose is simple. We aim to fill the world with emotion, through the power of creativity and technology. We want to be responsible for getting hearts racing, stirring ambition, and putting a smile on the faces of our customers. That challenge, combined with our spirit of innovation, motivates us to create groundbreaking technology, entertainment, and services for people worldwide.

Our history as a global brand has been built around employees that all have a passion for touching peoples'​ lives, and pride in pushing beyond the status quo to produce truly extraordinary results.

We’re uniquely positioned because we operate in many different industries - from movies and music to video games and electronics. And, with offices around the globe, we benefit from a global workforce that learns and grows together through mutual respect.

If you're ready to join a diverse team at an innovation-led company with the power to change lives, then we encourage you to read up on the different Sony group companies and check out our Life page. Then, get in touch, and together, let’s make the world say wow.

Industry
Arts & Entertainment
Company Size
10,000+ employees
Headquarters
Tokyo, JP
Year Founded
Unknown
Website
sony.com
Social Media