MiniMax

资源调度工程师(训练/推理)

MiniMax  •  Onsite  •  15 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

资源调度工程师(训练/推理)上海、北京社招全职互联网 / 电子 / 网游大模型系统职位描述1. 负责大规模多集群异构资源的管理与调度。
2. 负责对LLM训推服务完成编排抽象,持续迭代高效的部署模式。
3. 负责与流量调度/任务调度层协作,保障训推资源稳定性以及资源使用效率。
4. 负责服务以及集群弹性能力,保障训推服务质量的情况下完成高效的资源流转。
5. 负责集成多种训推资源管理体系和框架进行统一管理,向全局最优逼近。职位要求1. 计算机及相关专业,本科及以上学历;
2. 有大规模异构集群维护管理,多云多集群架构,大规模弹性系统,在离线混布等经验者优先;
3. 熟悉LLM生态,对MOE,PD分离,EP并行等LLM架构有经验者优先。
4. 有扎实的编程能力和代码品位,良好的数据结构和算法基础,有acm/icpc,oi等竞赛获奖经历者优先;
5. 能熟练使用一种编程语言,包括不限于Golang/Python/C++/C;
6. 理解常规的架构设计思想,包括不限于服务化、异步、高可用、可扩展等;
7. 有良好的团队沟通和协作能力,有良好的责任心;
8. 有良好的自驱力和学习能力; 投递
MiniMax

About MiniMax

MiniMax is a leading global technology company and one of the pioneers of large language models (LLMs) in Asia. Our mission is to build a world where intelligence thrives with everyone.

MiniMax develops proprietary LLMs across various modalities, including a trillion-parameter MoE model, a speech model with low latency and native support for major Asian languages, and a state-of-the-art text-to-speech and text-to-video models. Experience it now at https://hailuoai.com/

Leveraging these multi-modality general-purpose models, the MiniMax API Platform offers enterprises and developers secure, flexible, and reliable API services, enabling the rapid deployment of AI applications.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Singapore, SG
Year Founded
2022
Social Media