Alice’s Innovation team builds adversarial RL environments that train the world’s most advanced AI models to be safer. Our customers are the leading frontier AI labs, who use these environments for post-training reinforcement learning and safety evaluation. This is the bleeding edge of AI safety technology: the environments you build will directly shape how next-generation models learn to resist adversarial attacks.
We’re looking for a Principal Software Engineer to own the RL Gym platform end-to-end: from architecting multi-site web environments that simulate real-world attack surfaces, to optimizing our in-house orchestration harness (AgenticVerse) for high-performance delivery into customer training pipelines.
This is a builder role. You’ll lead a small team (including a dedicated web environments engineer), operating with high autonomy, moving fast from concept to working prototype to production system. You’ll interact directly with customer engineering teams to understand their infrastructure constraints and deliver environments that meet their scale and reliability requirements.
Why this role
This is one of the few roles in the industry where your code directly influences how the next generation of AI models are trained. You’ll be at the center of advancing AI safety, building systems that the world’s top labs depend on to make their models more robust. The work is technically deep, the problem space is genuinely novel, and the field is moving faster than any team can keep up with alone. There’s no playbook. You’ll write it.
What you’ll do:
Platform & performance
Customer delivery
Rapid prototyping
Must have
Nice to have
Alice is a trust, safety, and security company built for the AI era. We safeguard the communicative technologies people use to create, collaborate, and interact—whether with each other or with machines.
In a world where AI has fundamentally changed the nature of risk, Alice provides end-to-end coverage across the entire AI lifecycle. We support frontier model labs, enterprises, and UGC platforms with a comprehensive suite of solutions: from model hardening evaluations and pre-deployment red-teaming to runtime guardrails and ongoing drift detection.

ActiveFence is the leading provider of AI security and safety solutions, protecting online experiences and AI applications for over 3 billion users, top foundation models, and the world’s largest enterprises and tech platforms.
As a trusted partner to major technology companies and Fortune 500 brands, we secure user-generated and GenAI products against prompt injection, adversarial attacks, and harmful content through Real-Time Guardrails, continuous Red Teaming, and the industry’s most advanced threat intelligence.
With unmatched detection capabilities in 117+ languages, ActiveFence empowers organizations to deliver engaging, safe, and trustworthy experiences globally, helping them innovate responsibly while staying ahead of emerging threats.