
We are seeking a highly qualified Phonetic Linguist to join our team working on cutting-edge AI voice synthesis technology. This role involves precise annotation of professional singing recordings to create training data for advanced voice synthesis models.
The position requires segmenting singing audio into individual phonemes with millisecond-level accuracy using spectrogram analysis. You will work with recordings across multiple languages, identifying and labeling vocal techniques including breathing patterns, glottal stops, vocal fry, and silence markers.
This is a remote contract position offering the opportunity to work with proprietary annotation technology while contributing to breakthrough developments in AI-powered vocal synthesis. The role involves collaboration with an international team and handling confidential voice synthesis research.
Key responsibilities include:
Segmenting singing recordings into 5-30 second phrases with precise timing
Correcting automated phoneme predictions using visual and auditory analysis
Labeling special vocal elements (breathing, silences, glottal stops)
Handling cross-linguistic pronunciation variations in musical contexts
Maintaining consistent quality standards across large datasets
Working with proprietary web-based annotation platforms
Technical requirements:
Reliable high-speed internet connection
Professional audio equipment for precise listening
Quiet workspace suitable for detailed audio analysis
Availability for 20-30 hours per week
Comfortable with NDA and confidentiality requirements
Essential qualifications:
Master's or PhD in Linguistics, Phonetics, or related field
Expert knowledge of International Phonetic Alphabet (IPA)
Proven experience with acoustic analysis software (Praat, ELAN)
Strong background in phonetic transcription and spectrogram interpretation
Experience with time-aligned annotation workflows
Native or near-native English proficiency
Additional language skills (Japanese, Mandarin, Korean, Spanish preferred)
Preferred qualifications:
Experience with speech synthesis or voice technology projects
Understanding of vocal techniques and singing styles
Background in computational linguistics or audio signal processing
Previous work with AI training data annotation
Research experience in acoustic phonetics or speech science

At Your Personal AI (YPAI), we specialize in helping businesses harness the transformative power of Artificial Intelligence (AI) and Machine Learning (ML). From foundational data collection to advanced AI model development, and from strategic consulting to team training, we deliver comprehensive solutions that guide organizations through every stage of their AI journey.
What We Do
Data Collection & Preparation: We provide high-quality, diverse, and ethically sourced datasets, including audio recordings, image collections, and detailed annotations, precisely calibrated to drive AI advancement.
AI Consulting: We guide businesses in developing robust AI strategies, designing scalable systems, and identifying opportunities for innovation and optimization.
Custom AI Solutions: We create tailored AI and ML systems for predictive analytics, natural language processing, generative AI, and automation.
Training & Upskilling: We equip teams with essential skills and knowledge, from foundational AI concepts to advanced implementation and optimization.
With our global presence, we combine deep technical expertise with localized insights to support businesses across industries. Whether you're initiating your AI journey, refining existing systems, or building cutting-edge models, YPAI is your trusted partner for impactful, scalable, and ethical AI solutions.
At YPAI, we enable organizations to fully realize AI's potential. Through strategic collaboration, we co-create solutions that align with your business objectives, driving innovation, efficiency, and sustainable growth.
Partner with Your Personal AI to transform your vision into reality. Connect with us today to discover how we can help your business thrive in an AI-driven future.