Job Description
About the Team
At Multimedia Lab, we push the boundaries of what’s possible in multimedia technology. Our mission is to pioneer cutting-edge research across image and video understanding, generation, processing, compression, and transmission—and transform these innovations into real-world products that delight hundreds of millions of users globally.
The ideal candidate combines deep technical expertise with a strong record of innovation, thrives on solving challenging problems at scale, and is passionate about shaping the future of multimedia experiences. This is an opportunity to work alongside top talent, drive frontier research, and turn breakthrough ideas into impactful technologies used around the world.
Topic Content:
Multimodal Foundation Models for Intelligent Multimedia Processing
Explore next-generation multimedia technologies powered by multimodal foundation models, including perceptual quality modeling, generative enhancement, temporal video understanding, user-centric evaluation, and intelligent visual representation/compression, to advance video quality, efficiency, and user experience in future multimedia systems.
Challenges for the analysis, understanding, and quality assessment and enhancement based on multimodal large models:
- Modeling complex time sequences in long multimodal videos
- Building few-shot grounding-based models for quality assessment
- Creating interactive video processing/enhancement models aligned with user preferences
Research value of analysis, understanding, and quality assessment and enhancement based on multimodal large models:
- Enhance semantic understanding and event localization in medium- and long-length videos, improving processing efficiency, and support key areas such as ads recommendation, content comprehension, video value evaluation, and transcoding enhancement
- Lower the cost of quality annotation, enable interpretable assessment of local degradation, boost generalization across different content types, and support pixel-level quality inspection and optimization
We are looking for talented individuals to join our team in 2027. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at our Company.
Successful candidates must be able to commit to an onboarding date by end of year 2027. Please state your availability and graduation date clearly in your resume.
Responsibilities
- Design video analysis (ROI/SOD, content understanding, temporal grounding etc.) and quality assessment algorithms, and participate in database creation, algorithm design/development/optimization, etc.
- Participate in designing strategy and solution for E2E video quality optimization with a combination of video analysis, processing and encoding algorithms
- Apply designed algorithms for VOD / Live streaming monitoring, data analysis, objective evaluation for algorithms etc.
- Collaborate with cross-functional teams to integrate algorithms into production workflows and validate their impact through A/B testing.
The base salary range for this position in the selected city is $212800 - $450000 annually.