Job Description
We are looking for talented individuals to join our team in 2027. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at our Company.
Successful candidates must be able to commit to an onboarding date by end of year 2027. Please state your availability and graduation date clearly in your resume.
Team Introduction:
The Applied Machine Learning Enterprise team combines system engineering and machine learning to develop and operate Large Language Model (LLM) service platforms that offer businesses Model-as-a-Service (MaaS) solutions, serving both large model providers and downstream users. The US team drives the design, development, and operation of MaaS solutions across the US and international markets outside mainland China. We are building full-stack, end-to-end solutions spanning text and multimodal LLM algorithms, LLM training/fine-tuning/inference frameworks, prompt engineering, model alignment, and intelligent agent systems. Beyond model serving, we operate large-scale log analytics pipelines that process massive volumes of invocation logs from text models, multimodal models, and agent systems — extracting usage patterns, quality signals, and actionable insights to inform model improvement, system optimization, and product decisions through continuous, data-driven feedback loops. We are actively seeking talented engineers and researchers specializing in Large Language Models and AI Agent systems to join our dynamic team.
Topic Content:
With foundation models gradually being applied in real ToB scenarios, AI system optimization now extends beyond the foundation model itself to include a complex business system composed of the model, prompt, memory, tools, skills, workflow, and the external environment. Compared to offline benchmarks, real-world cases offer greater potential for optimization but also present challenges such as larger data volumes, higher noise levels, more diverse scenarios, greater structural heterogeneity, and limited user feedback, making them difficult to be directly utilized.
Relying on the real-world data accumulated on the Volcano Ark case platform, this project aims to unify logs, cases, feedback, and environmental information into structured objects that are understandable, and attributable, and optimizable. By integrating AI-assisted tools to guide users in providing efficient feedback, it aims to build an AI data flywheel system tailored to real scenarios. This system will both support foundation model iteration and address issues related to environment, memory, tools, and workflows within the business system, focusing on developing agent optimization capabilities that enhance SA/FDE’s efficiency in supporting customers.
Responsibilities:
- Building a next-generation big model as a service platform to serve hundreds of LLMs based applications;
- To develop and maintain the big model as a service platform, including offline training/finetuning, online inference, model management, and resource orchestration, etc.;
- To manage a huge number of GPU resources and provide computing power efficiently.
The base salary range for this position in the selected city is $202160 - $368220 annually.