MiniMax

AI Infra研发工程师-模型服务化方向

MiniMax  •  Onsite  •  16 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

AI Infra研发工程师-模型服务化方向北京、上海社招全职互联网 / 电子 / 网游职位描述1. 支持MiniMax模型在全球多机房大规模部署优化。
2. 与模型框架同学合作,优化模型推理中的overhead,构建高性能服务。
3. 参与端到端模型推理服务的研发,在多流量入口,多云多机房等复杂场景下构建稳定高效的服务架构和部署模式,最优化模型服务性能。
4. 构建治理能力,参与可观测,压测,容灾,调试,降级,灰度等能力建设并输出标准文档,以可交付为标准进行持续迭代。职位要求1. 计算机及相关专业,本科及以上学历;
2. 3年以上系统工程或高性能服务工作经验;
3. 理解操作系统基本原理;
4. 能熟练使用一种编程语言,包括不限于Golang/Python/C++/C;
5. 理解常规的架构设计思想,包括不限于服务化、异步、高可用、可扩展等;
6. 有良好的可靠性意识,包括不限于监控、容灾等;
7. 有良好的团队沟通和协作能力,有良好的责任心;
8. 有良好的自驱力和学习能力;
9. 理解常用推理框架如SGLang,Vllm等优先。 投递
MiniMax

About MiniMax

MiniMax is a leading global technology company and one of the pioneers of large language models (LLMs) in Asia. Our mission is to build a world where intelligence thrives with everyone.

MiniMax develops proprietary LLMs across various modalities, including a trillion-parameter MoE model, a speech model with low latency and native support for major Asian languages, and a state-of-the-art text-to-speech and text-to-video models. Experience it now at https://hailuoai.com/

Leveraging these multi-modality general-purpose models, the MiniMax API Platform offers enterprises and developers secure, flexible, and reliable API services, enabling the rapid deployment of AI applications.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Singapore, SG
Year Founded
2022
Social Media