Job Description
Team Introduction:
The Applied Machine Learning (AML) team is committed to the research and deployment of the next-generation of machine learning core technologies. This covers large pre-trained models and device-cloud collaboration learning, as well as wide applications in search, recommendation, advertising, auditing, federated learning, and more. The team has a strong foundation in scientific research, engineering and product implementation. Our team members have rich backgrounds covering natural language processing (NLP), computer vision (CV), multimodality, graph computing, search and recommendation, federated learning and other fields, and have published more than 100 top-tier conference papers.
We are looking for talented individuals to join our team in 2027. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at our Company.
Successful candidates must be able to commit to an onboarding date by end of year 2027. Please state your availability and graduation date clearly in your resume.
Topic Content: Large-scale recommendation systems are being increasingly adopted across products such as short-video, text-based community, and image platforms, with modality-specific information playing an ever-growing role in recommendations.
In ByteDance's practice, we have found that modality information serves effectively as generalizable features to support recommendation and other business scenarios. Research on end-to-end ultra-large-scale multimodal recommendation systems holds significant potential.
Building on an algorithm-engineering co-design approach, we aim to further explore directions including multimodal co-training, models with hundreds of billions of parameters, and end-to-end modeling with extended sequence lengths.
On the engineering side, research directions include: multimodal sample representation; high-performance multimodal inference engines built on the PyTorch framework; high-performance multimodal training framework development; and the application of heterogeneous hardware in multimodal recommendation systems.
On the algorithm side, research directions include: designing effective recommendation-and-ads multimodal co-training architectures, sparse MoE, memory networks, and mixed precision.
Topic Challenges:
1. High difficulty in unifying multimodal representations and achieving efficient fusion.
2. Extremely high computational costs for training and inference of ultra-large-parameter models.
3. Difficulty balancing stability and efficiency in end-to-end modeling with long sequences.
4. Significant complexity in algorithm-engineering co-design and heterogeneous hardware adaptation
Topic Value:
1. Technical value:
- Achieve breakthroughs in multimodal representation fusion and training/inference bottlenecks for ultra-large-scale models; refine the co-design framework for algorithms and engineering; advance heterogeneous hardware adaptation and the development and deployment of domestically developed high-performance frameworks.
2. Business value:
- Enhance recommendation accuracy and generalization capability in multimodal scenarios; overcome the modality limitations of existing recommendation systems; empower multiple products including short-video and text-based community platforms; reduce computational costs; and drive scalable business growth.