
Our story might surprise you. We’re the world’s largest restaurant company—encompassing KFC, Pizza Hut, Taco Bell and Habit Burger & Grill —but there’s a lot more going on behind the scenes than just frying chicken, baking pizzas, and serving up tacos. We put this delicious food in the hands of customers through apps, websites, kiosks, POS, and other digital dining experiences.
We are looking for a skilled AI Support Specialist (MLE Focus) to join our 24/7 operations team, focusing on maintaining and optimizing machine learning pipelines, infrastructure, and deployments. This role involves troubleshooting model deployment issues, ensuring system scalability, and working closely with MLEs to resolve infrastructure-related challenges.
Operational Support:
Monitor machine learning pipelines, APIs, and deployment environments for errors and performance degradation.
Troubleshoot issues related to model inference, deployment failures, or infrastructure bottlenecks.
Perform root cause analysis for system incidents, documenting findings and implementing preventive measures.
Support CI/CD workflows for model updates and pipeline changes.
Incident Management:
Act as the first responder for MLE-related incidents detected via monitoring tools or reported by users.
Escalate unresolved issues to MLEs or engineering teams and follow through to resolution.
Track incident metrics (e.g., mean time to resolution) and provide insights for operational improvement.
Collaboration:
Partner with MLEs to support the deployment of new models and infrastructure changes.
Collaborate with Data Scientists and AI Engineers to ensure seamless handoffs and alignment on system requirements.
Continuous Improvement:
Contribute to the development of operational playbooks for model deployments and infrastructure support.
Bachelor’s degree in Computer Science, Engineering, or a related field.
1-2 years of experience in MLE, DevOps, or AI system operations roles.
Strong knowledge of cloud platforms (AWS, GCP, or Azure) and container orchestration tools (e.g., Kubernetes, Docker).
Familiarity with MLOps tools and frameworks (e.g., MLflow, Kubeflow, SageMaker).
Experience with CI/CD pipelines and infrastructure as code (e.g., Terraform, CloudFormation).
Excellent problem-solving skills and ability to work in a fast-paced, 24/7 support environment.
Preferred Qualifications:
Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog).
Knowledge of scripting and automation with Python or Bash.

Yum! Brands, Inc., based in Louisville, Kentucky, and its subsidiaries franchise or operate a system of over 60,000 restaurants in more than 155 countries and territories under the Company’s concepts – KFC, Taco Bell, Pizza Hut and the Habit Burger Grill. The Company's KFC, Taco Bell and Pizza Hut brands are global leaders of the chicken, Mexican-style food, and pizza categories, respectively. The Habit Burger Grill is a fast casual restaurant concept specializing in made-to-order chargrilled burgers, sandwiches and more.
What makes Yum! a great place to work? It's our people. As the world's largest restaurant company, we invest in people capability so that our global workforce can make the most of their careers. With ongoing opportunities for personal and professional success, we've built a culture that rewards and recognizes great effort while providing the flexibility that is so important to all of us.