About Us:
Zenity is the first and only holistic platform built to secure and govern AI Agents from buildtime to runtime. We help organizations defend against security threats, meet compliance, and drive business productivity. Trusted by many of the world’s F500 companies, Zenity provides centralized visibility, vulnerability assessments, and governance by continuously scanning business-led development environments. We recently raised $38 million in a Series B funding, solidifying our position as a leader in the industry and enabling us to accelerate our mission of securing AI Agents everywhere.
About The Role:
This is a research‑first role focused on deeply understanding LLM internals to improve the security of AI agents. You’ll design careful experiments on activations and interpretable features- e.g., probing, attribution & ablation/patching, representation‑geometry analyses-to uncover mechanisms behind jailbreak, indirect prompt injection, and other attacks. Then translate those insights into signals that can be used for detection and analysis of a model response.
The field of LLM interpretability at scale is exploding, with several major publications in the last months, and major opportunities for innovation.
What You’ll Do

Zenity is the first security and governance platform purpose-built for AI agents - spanning SaaS, home grown platforms (Cloud), and end-user devices (Endpoint). Trusted by Fortune 500 enterprises, Zenity helps security teams confidently adopt AI by delivering defense in depth with full-lifecycle coverage: from agent discovery and posture management to real-time detection, prevention, and response. As enterprises adopt Microsoft Copilot, Salesforce Agentforce, AWS Bedrock, and developer tools like GitHub Copilot, Zenity eliminates blind spots and enforces consistent policy across environments so organizations can innovate with AI, without compromising security. Learn more at www.zenity.io.