XPENG

Master Thesis Position _ Generative AI for synthetic acoustic environments in voice assistants

XPENG  •  Munich, DE (Onsite)  •  1 month ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Master Thesis Position _ Generative AI for synthetic acoustic environments in voice assistantsMunichInternshipIntelligent manufacturing / Industrial Internet / Industrial automation - R&DResponsibilitiesOpen Position
We are looking for a motivated student to join a paid, onsite 6-month Master’s thesis project starting in July at XPENG European R&D Center in Munich, focusing on generative AI. The student will become part of the AI Model Team and work under the supervision of experts in NLP and Audio Machine Learning.
The project explores how generative audio methods can be used to create realistic background noise and spatial acoustic conditions for data augmentation and controlled evaluation.
This opportunity is well suited for students with a strong interest in speech processing, audio machine learning, and generative AI.
Overview and Motivation
Modern multimodal voice assistants rely on large, high-quality speech datasets recorded in diverse acoustic environments. However, collecting and curating such real-world data is both costly and time-consuming, and often fails to sufficiently cover important edge cases and environmental variations.
Recent advances in generative AI for audio have enabled the synthesis of realistic and controllable acoustic environments, including background noise and spatial sound characteristics. This thesis investigates whether these generative methods can be leveraged to augment or partially replace traditional data collection approaches. The goal is to evaluate whether synthetic acoustic environments can improve model robustness and downstream performance in controlled evaluation settings, thereby enhancing the overall quality of multimodal voice assistants.QualificationsDetails
- Topic: Exploring Generative AI for Synthetic Acoustic Environment Creation in Multimodal Voice Assistant
- Position Type: Paid Master’s thesis
- Location: XPENG European R&D Center, Weimarer Str. 32, 80807 Munich (on-site, 30 hours per week)
- Team: AI Model Team
- Supervision: Guidance from experts in NLP and Audio Machine Learning
- Duration: 6 months (starting July)
XPENG offers:
- 6-Month Paid Master Thesis Position
- An exciting, unique, and highly diverse role with opportunities at XPENG
- The opportunity to contribute to the growth of an established EV brand in Europe Apply
XPENG

About XPENG

XPeng is a leading Chinese Smart EV company that designs, develops, manufactures, and markets Smart EVs that appeal to the large and growing base of technology-savvy middle-class consumers. Its mission is to drive Smart EV transformation with technology and data, shaping the mobility experience of the future. In order to optimize its customers’ mobility experience, XPeng develops in-house its full-stack advanced driver-assistance system technology and in-car intelligent operating system, as well as core vehicle systems including powertrain and the electrical/electronic architecture. XPeng is headquartered in Guangzhou, China. In 2021, the Company established its European headquarters in Amsterdam, along with other dedicated offices in Copenhagen, Munich, Oslo, and Stockholm.The Company’s Smart EVs are mainly manufactured at its plant in Zhaoqing and Guangzhou,Guangdong province.

For more information, please visit https://heyxpeng.com.

Industry
Automotive & Mobility
Company Size
1,001-5,000 employees
Headquarters
Guangzhou, CN
Year Founded
2014
Social Media