MiniMax

SRE运维研发工程师-云原生

MiniMax  •  Onsite  •  15 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

SRE运维研发工程师-云原生上海、北京社招全职研发 - 运维职位描述1.负责 Minimax 线上 K8S及云原生周边系统的运维保障和工具开发;
2.负责公司内部大规模 K8S 集群的建设和稳定性保障;
3.负责监控/日志/网络/存储等原生基础设施的保障和工具开发;
4.负责业务容器化部署、互联互通以及疑难问题的排查解决;
5.参与 OnCall 值班,第一时间响应并与研发团队共同解决各类突发事件,保障核心系统的稳定性。职位要求1.大规模 k8s 系统的建设和运维经验,熟悉linux、网络等系统运维的技能;
2.对大规模分布式集群的部署架构设计,分析,故障排查有强烈兴趣;
3.熟悉 Docker/Kubernetes 容器生态核心开源项目和周边服务生态项目,如监控、日志、网络等方案,精通或者有实施经验。
加分项:
1.具有 k8s 二次开发经验,有自定义 operator 的开发经验,或者csi/cni插件的经验;
2.对 k8s 调度系统深入研究,熟悉 volcano,kueue 等组件;
3.具有大规模GPU集群运维经验 投递
MiniMax

About MiniMax

MiniMax is a leading global technology company and one of the pioneers of large language models (LLMs) in Asia. Our mission is to build a world where intelligence thrives with everyone.

MiniMax develops proprietary LLMs across various modalities, including a trillion-parameter MoE model, a speech model with low latency and native support for major Asian languages, and a state-of-the-art text-to-speech and text-to-video models. Experience it now at https://hailuoai.com/

Leveraging these multi-modality general-purpose models, the MiniMax API Platform offers enterprises and developers secure, flexible, and reliable API services, enabling the rapid deployment of AI applications.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Singapore, SG
Year Founded
2022
Social Media