ActiveFence

GenAI Analyst/Prompt Engineer

ActiveFence  •  Warsaw, PL (Onsite)  •  5 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

ActiveFence is seeking a driven, detail-focused professional to become a vital part of our team as a Generative AI Analyst. In this role, you'll dive into the cutting-edge of technology, meticulously analyzing various content infringements to secure the new wave of Generative AI tools. Your duties will include collaborating with experts in diverse fields such as Hate Speech, Misinformation, Intellectual Property and Copyright, Child Safety, among others.

Your tasks will involve writing adversarial; prompts to identify weaknesses in various AI models, including Large Language Models (LLMs), Text-to-Image, Text-to-Video, and beyond. You'll also oversee data management to guarantee the highest quality of outputs.

Responsibilities

  • Developing adversarial and risky prompts across several areas of abuse to expose potential vulnerabilities in models.
  • Handling extensive datasets across multiple languages and areas of abuse, ensuring precision and meticulous attention to detail.
  • Ongoing investigation into new tactics for circumventing foundational models' safety measures.
  • Working alongside diverse teams—engineering, product, policy—to tackle new challenges and craft forward-thinking strategies and resolutions.
  • Promoting a culture of knowledge exchange and continual learning within the team.

Requirements

Requirements:

Must have:

  • Familiarity with Generative AI models is essential, though direct technical experience is not a prerequisite.
  • Command of English at a near-native level.
  • Attention to detail, organizational capabilities, and the capacity to juggle numerous tasks concurrently.
  • Data analysis

Additional Wants:

  • Experience with various model types (Text-to-Text, Text-to-Image) is desirable.
  • Prior experience with OSINT (Open Source Intelligence) will be considered an asset.
  • A self-starter attitude, with the energy to excel in a fast-moving and variable environment.

About ActiveFence

ActiveFence is the leading provider of security and safety solutions for online experiences, safeguarding more than 3 billion users, top foundation models, and the world’s largest enterprises and tech platforms every day. As a trusted ally to major technology firms and Fortune 500 brands that build user-generated and GenAI products, ActiveFence empowers security, AI, and policy teams with low-latency Real-Time Guardrails and a continuous Red Teaming program that pressure-tests systems with adversarial prompts and emerging threat techniques. Powered by deep threat intelligence, unmatched harmful-content detection, and coverage of 117+ languages, ActiveFence enables organizations to deliver engaging and trustworthy experiences at global scale while operating safely and responsibly across all threat landscapes.

ActiveFence

About ActiveFence

ActiveFence is the leading provider of AI security and safety solutions, protecting online experiences and AI applications for over 3 billion users, top foundation models, and the world’s largest enterprises and tech platforms.

As a trusted partner to major technology companies and Fortune 500 brands, we secure user-generated and GenAI products against prompt injection, adversarial attacks, and harmful content through Real-Time Guardrails, continuous Red Teaming, and the industry’s most advanced threat intelligence.

With unmatched detection capabilities in 117+ languages, ActiveFence empowers organizations to deliver engaging, safe, and trustworthy experiences globally, helping them innovate responsibly while staying ahead of emerging threats.

Industry
IT & Software
Company Size
201-500 employees
Headquarters
New York
Year Founded
2018
Social Media