Komodor

AI Engineer Team Lead

Komodor  •  Tel Aviv, IL (Onsite)  •  2 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Who are we?

Komodor is a cutting-edge Kubernetes platform built by developers, for developers. We help engineering and infrastructure teams manage complex systems with ease, efficiency, and transparency — so they can focus more on innovation and less on firefighting Kubernetes challenges.

Our platform is trusted by thousands of teams worldwide, with standout capabilities like Klaudia – our AI-powered Kubernetes failure detection and analysis engine that delivers real-time insights to dev and infra teams; the Cost team, helping companies dramatically reduce cloud spend; the Health team, building industry-leading troubleshooting features; and our Operations group, crafting powerful Kubernetes-native agents, operators, and controllers.

Your mission:

Lead the team building the agentic infrastructure powering Klaudia — our AI SRE. You'll own the architecture, set the technical direction, and push experimental ideas into production.

This isn't a coordination role. You'll be a technical leader who builds alongside your team — setting the bar for architecture quality, driving experimental bets, and turning research into production-grade systems.

What will you do?

Own the agentic infrastructure- You're responsible for the architecture of Klaudia's core — the systems that make autonomous investigation, root cause analysis, and self-healing possible at scale.

Build and grow your team- Hire great engineers, mentor them, run meaningful design reviews, and build a culture of high ownership and engineering excellence.

• Drive experimental development- Run structured bets — LLM evaluation frameworks, new agent architectures, novel tool-use patterns — and bring the best ones to production.

• Stay hands-on You'll design systems, review PRs critically, and write code where it matters. Your technical credibility is the foundation of your leadership.

• Shape our AI strategy- Work closely with product and R&D leadership to define what Klaudia becomes next.

Why this role rocks?

• Category-defining work- AI SRE is the hottest space in cloud infra. You'll be leading the team that defines what it means — not catching up to anyone.

• Real autonomy- You'll set the technical roadmap for Core. No pre-solved problems handed down to you — you decide what gets built and how.

• Build something that lasts- Hire and shape a team from an early stage. Your engineering culture decisions here will compound for years.

• Serious engineering culture- High bar for craft, strong team around you, and leadership that respects technical depth over process theater.

Requirements

What will you bring?

• Proven tech leader- 8+ years of backend engineering experience, including at least 2 years in a tech lead or team lead capacity. You've owned systems end-to-end, not just delivered tickets.

• Proven ability to build teams- You've hired engineers you're proud of, mentored people to grow beyond where they thought they could, and created an environment where ownership is the default.

• Deep experience with distributed systems- Concurrency, reliability, observability — you design for failure, not around it.

• Experience building with AI-assisted tools- (e.g Cursor, Claude Code, etc.) You treat these as serious multipliers, not toys.

• Strong communicator- You can hold a design review, align with product leadership, and explain a tricky architecture decision to an engineer in their first week — all with the same clarity.

• Comfortable with open-ended problems- You enjoy tackling challenges that don't have a simple one-prompt AI answer — where the path forward requires judgment, not just execution.

Bonus points:

  • AI/LLM product experience
  • Kubernetes & cloud infra ·
  • Agent evaluation frameworks

What We Offer:

  • Work with the latest tech (AI, Agents, cloud-native tools)
  • Room to grow and learn
  • Flexible work setup
  • Competitive pay and benefits
  • Budget for conferences, courses, and professional development

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Komodor

About Komodor

Komodor enables enterprises to unlock the full potential of Kubernetes at scale. Our pioneering Kubernetes Management Platform eliminates complexity across the entire Kubernetes stack, to drive efficiency, empower developers, and optimize cost and performance.

Komodor is the leading Autonomous AI SRE Platform for cloud native infrastructure and operations. Powered by Klaudia Agentic AI, Komodor automatically visualizes, troubleshoots, and optimizes Kubernetes-based platforms at scale.

Trusted by companies like Dell, Cisco, BlackRock, Balyasny, Rockwell Automation, Priceline, and OpenTable, Komodor is the go-to platform for innovative enterprises who need a comprehensive, autonomous solution for managing and optimizing cloud-native infrastructure and operations at scale.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Tel Aviv, IL
Year Founded
2020
Social Media