Xiaomi Technology

大模型训练与推理Infra工程师-MiMo

Xiaomi Technology  •  Remote  •  8 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

大模型训练与推理Infra工程师-MiMo北京社招全职职位 ID:A14015职位描述1. 模型训练基础设施开发
- 设计和实现支持大规模分布式训练的计算平台,优化模型训练效率和资源利用率。
- 维护和扩展现有的分布式训练框架,确保平台的高性能和稳定性(如基于 PyTorch、TensorFlow 或 JAX)。
- 集成和优化高性能计算技术(如 CUDA、MPI、NCCL 等)。
2. 模型推理基础设施开发
- 构建高效的推理框架,支持大模型的在线和离线推理需求。
- 优化推理速度、内存占用和能耗,支持多种硬件架构(GPU、NPU等 )。
- 实现PD分离、Context Caching、模型量化、推敲编码等推理优化技术。
3. 性能监控与优化
- 开发工具链和监控系统,跟踪训练与推理过程的性能瓶颈。
- 分析并优化数据加载、通信效率和硬件利用率等关键环节。
4. 跨团队协作
- 与模型研究团队密切合作,理解模型需求,定制训练和推理策略。
- 支持产品团队的模型部署需求,推动大模型在实际场景中的落地应用。职位要求基本要求:
- 计算机科学、软件工程、机器学习或相关领域的本科及以上学历,硕士或博士优先。
- 深入理解深度学习原理和分布式训练框架(如 Horovod、DeepSpeed、Ray 等)。
- 熟练掌握至少一种主流深度学习框架(如 PyTorch、TensorFlow 或 JAX)。
- 熟悉高性能计算技术(CUDA、NCCL、cuDNN 等)及硬件架构(GPU、NPU 等)。
- 具有扎实的编程能力,精通 Python 和至少一种系统级编程语言(如 C++)。
优先条件:
- 有参与或主导过大规模模型(如 Transformer、大语言模型)的训练和部署经验。
- 熟悉模型优化技术(如混合精度训练、剪枝、量化等)。
- 对云计算和容器化技术(如 Kubernetes、Docker、Terraform)有实际经验。
- 对新兴 AI 硬件(如 H卡)有实操经验。
- 具备优秀的系统设计和性能调优能力。 投递
Xiaomi Technology

About Xiaomi Technology

Xiaomi Corporation was founded in April 2010 and listed on the Main Board of the Hong Kong Stock Exchange on July 9, 2018 (1810.HK). Xiaomi is a consumer electronics and smart manufacturing company with smartphones and smart hardware connected by an IoT platform at its core.

Embracing our vision of “Make friends with users and be the coolest company in the users’ hearts”, Xiaomi continuously pursues innovations, high-quality user experience and operational efficiency. The company relentlessly builds amazing products with honest prices to let everyone in the world enjoy a better life through innovative technology.

Xiaomi is one of the world's leading smartphone companies. The company has also established the world’s leading consumer AIoT (AI+IoT) platform,reached 558 million smart devices connected to its platform (excluding smartphones,laptops and tablets) as of September 30 2022. Xiaomi products are present in more than 100 countries and regions around the world. In August 2022, Xiaomi was included in the Fortune Global 500 list for the fourth year in a row, ranking 266th. The company is the fastest-rising Chinese technology conglomerate during the four-year period.

Xiaomi is a constituent of the Hang Seng Index, Hang Seng China Enterprises Index, Hang Seng TECH Index and Hang Seng China 50 Index.

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
Beijing, CN
Year Founded
2010
Social Media