MiniMax

大模型流量调度工程师

MiniMax  •  Onsite  •  15 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

大模型流量调度工程师北京、上海社招全职互联网 / 电子 / 网游大模型系统职位描述1. 负责大规模的的流量调度框架设计与开发。
2. 负责提升流量分发系统的效率,在稳定性,吞吐,延迟等多维指标中构建优化目标进行合理取舍。
3. 负责负载均衡算法的研发与迭代,能够根据多种类型的流量负载Pattern进行合理选型以及调优。
4. 负责针对LLM训推服务的需求迭代流量承载模型,包括但不限于同步,异步,流式,批式等。职位要求1. 计算机及相关专业,本科及以上学历;
2. 有跨集群 & 集群内流量调度,流量代理,负载均衡,大规模分布式系统等相关经验。
3. 熟悉LLM生态,对MOE,PD分离,EP并行等LLM架构有经验者优先。
4. 有扎实的编程能力和代码品位,良好的数据结构和算法基础,有acm/icpc,oi等竞赛获奖经历者优先;
5. 能熟练使用一种编程语言,包括不限于Golang/Python/C++/C;
6. 理解常规的架构设计思想,包括不限于服务化、异步、高可用、可扩展等;
7. 有良好的团队沟通和协作能力,有良好的责任心;
8. 对常用的云原生中间件有深入了解,如ServiceMesh,Redis,MQ等。
9. 有可观测意识,熟悉常用的可观测组件。
10. 有良好的自驱力和学习能力; 投递
MiniMax

About MiniMax

MiniMax is a leading global technology company and one of the pioneers of large language models (LLMs) in Asia. Our mission is to build a world where intelligence thrives with everyone.

MiniMax develops proprietary LLMs across various modalities, including a trillion-parameter MoE model, a speech model with low latency and native support for major Asian languages, and a state-of-the-art text-to-speech and text-to-video models. Experience it now at https://hailuoai.com/

Leveraging these multi-modality general-purpose models, the MiniMax API Platform offers enterprises and developers secure, flexible, and reliable API services, enabling the rapid deployment of AI applications.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Singapore, SG
Year Founded
2022
Social Media