ActiveFence

Software Development

Protect your users. Protect your platform.

See jobs Follow

View all 301 employees

About us

ActiveFence protects online platforms and their users from the widest spectrum of harms and abuses with one complete Trust and Safety solution. From efficient operations to scaled detection, in-depth investigations, and the latest generative AI safety measures, we enable Trust and Safety at scale, empowering platforms to protect their communities and thrive in a rapidly evolving digital landscape.

Website: https://www.activefence.com/
External link for ActiveFence
Industry: Software Development
Company size: 201-500 employees
Headquarters: New York
Type: Privately Held
Founded: 2018

Locations

Primary

New York, US

Get directions
New York, NY, US

Get directions

Employees at ActiveFence

See all employees

Updates

ActiveFence

25,217 followers
4h
Report this post
𝗨𝗱𝗲𝗺𝘆 – 𝗘𝗺𝗽𝗼𝘄𝗲𝗿𝗶𝗻𝗴 𝗦𝗸𝗶𝗹𝗹𝘀 𝗳𝗼𝗿 𝘁𝗵𝗲 𝗙𝘂𝘁𝘂𝗿𝗲 Udemy, a leader in online learning, needed to maintain course quality and keep up with rising customer expectations. They partnered with ActiveFence to create a scalable content management solution. 𝗧𝗵𝗲 𝗿𝗲𝘀𝘂𝗹𝘁𝘀? ✅ Increased customer trust ✅ Fewer escalations ✅ And improved content quality ratings Find out how Udemy is enhancing the learning experience for millions of users with proactive content moderation 👉 https://lnkd.in/d2ZtAzF2

Like Comment Share
ActiveFence reposted this
Noam Schwartz

CEO @ ActiveFence | UGC and Generative AI Alignment
18h
Report this post
🤖 Can AI fake being good? Anthropic’s research dives into the fascinating (and a little chilling) concept of 'alignment faking' in LLMs. It turns out, AI can strategically act aligned with training objectives while secretly planning to behave differently when unmonitored. This raises big questions about trust and transparency in AI systems. Read on to explore how AI might not just follow our rules, but play by them - until it doesn’t.

The Sneaky Brain of AI: How Alignment Faking Works

Noam Schwartz on LinkedIn

Like Comment Share
ActiveFence

25,217 followers
2d Edited
Report this post
🤝 𝗖𝗹𝗼𝘀𝗶𝗻𝗴 𝗢𝘂𝘁 𝟮𝟬𝟮𝟰: 𝗣𝗿𝗲𝗽𝗮𝗿𝗶𝗻𝗴 𝗳𝗼𝗿 𝟮𝟬𝟮𝟱 🤝 Trust & Safety teams have faced an incredibly challenging year. From the democratization of harmful content creation driven by #GenerativeAI to navigating major elections and global conflicts, the landscape has shifted dramatically. These challenges unfolded under heightened scrutiny and with leaner teams, demanding both resilience and innovation. Looking ahead to 2️⃣ 0️⃣ 2️⃣ 5️⃣ , the stakes are higher than ever. We’re here to help. Our commitment is to empower platforms with the tools, insights, and expertise needed to face these challenges. 🥂 𝗛𝗲𝗿𝗲’𝘀 𝘁𝗼 𝗮 𝘀𝗮𝗳𝗲𝗿 𝟮𝟬𝟮𝟱. 𝗖𝗼𝗻𝘁𝗮𝗰𝘁 𝘂𝘀 𝘁𝗼 𝗹𝗲𝗮𝗿𝗻 𝗺𝗼𝗿𝗲 🥂 https://lnkd.in/d5fBzTBr

Like Comment Share
ActiveFence reposted this
Noam Schwartz

CEO @ ActiveFence | UGC and Generative AI Alignment
2d
Report this post
Minutes before 2025, I want to wish all my friends, partners, customers and the incredible team at ActiveFence a year of achievements that go beyond our most ambitious milestones. May we continue to push boundaries, innovate, and deliver on the promises we’ve made to ourselves and the world. The journey into the trust and safety space, had a profound impact on my life, shaping not just who I am today but also the one I aspire to be in the years to come. At its core, trust and safety work is about safeguarding people, fostering healthy communities, and ensuring that technology serves humanity responsibly. For me, and for all of us at ActiveFence achieving our milestones isn’t just about hitting numbers or delivering a product. It’s about protecting people, keeping harmful content at bay, ensuring alignment in generative AI, and empowering our customers to make the digital world safer and better. Every goal met is a step toward a healthier, safer, and more open online environment, and for that, I’m endlessly grateful. To my incredible team, thank you for your resilience, creativity, and commitment. You are a lighthouse of innovation and an inspiration. Your ability to tackle complex challenges and work together toward a shared vision, no matter what, is motivating me every single day. I’m excited to see what we’ll accomplish in 2025 and beyond. Let’s continue to build something extraordinary together. Here’s to a successful and impactful year ahead. Happy 2025!

2 Comments

Like Comment Share
ActiveFence

25,217 followers
4d Edited
Report this post
𝗔𝗜 𝗙𝗿𝗼𝗻𝘁𝗶𝗲𝗿: 𝗧𝗵𝗿𝗲𝗮𝘁𝘀 𝗼𝗳 𝗦𝘆𝗻𝘁𝗵𝗲𝘁𝗶𝗰 𝗩𝗶𝗱𝗲𝗼 Since the early days of AI video-generating models, threat actors have been watching closely, ready to exploit this transformative technology to fuel their harmful agendas. As these models are released, ActiveFence has been at the forefront, uncovering how these dangerous communities are gearing up to act. Here’s what we’ve found: 🔍 Child predators are developing technical solutions to bypass AI safety measures, attempting to manipulate this powerful tech for their own exploitation. 🔍 Terrorist organizations are testing AI-generated content to project strength, glorify violent acts, and amplify their propaganda. 🔍 Hate groups are animating hateful texts with emotive videos, increasing their reach and driving harmful narratives further than ever before. As the capabilities of AI grow, so does the urgency to protect platforms, users, and communities from these evolving threats. To Learn More 👉https://lnkd.in/d_N3sUhU
Like Comment Share
ActiveFence

25,217 followers
1w Edited
Report this post
🎄📚 𝗛𝗼𝗹𝗶𝗱𝗮𝘆 𝗥𝗲𝗮𝗱𝗶𝗻𝗴 𝗟𝗶𝘀𝘁: 𝗦𝘁𝗮𝘆 𝗔𝗵𝗲𝗮𝗱 𝗶𝗻 𝗧𝗿𝘂𝘀𝘁 & 𝗦𝗮𝗳𝗲𝘁𝘆 🎁✨ As the year comes to a close, it's the perfect time to reflect, recharge, and dive into some insightful reads. We’ve curated a #holiday #reading list packed with our most thought-provoking content from the past year. These resources are designed to inspire and prepare you for the challenges and opportunities of #2025. 𝗛𝗲𝗿𝗲’𝘀 𝘄𝗵𝗮𝘁’𝘀 𝗼𝗻 𝘁𝗵𝗲 𝗹𝗶𝘀𝘁: 1️⃣ 𝗧𝗵𝗲 𝗕𝘂𝘆𝗲𝗿’𝘀 𝗚𝘂𝗶𝗱𝗲 𝘁𝗼 𝗧𝗿𝘂𝘀𝘁 𝗮𝗻𝗱 𝗦𝗮𝗳𝗲𝘁𝘆 𝗧𝗼𝗼𝗹𝘀: help you decide between building a custom toolkit or buying an existing solution 👉https://lnkd.in/dDCw97_5 2️⃣ 𝗨𝗻𝗶𝗻𝘁𝗲𝗻𝗱𝗲𝗱 𝗖𝗼𝗻𝘀𝗲𝗾𝘂𝗲𝗻𝗰𝗲𝘀 𝗼𝗳 𝗗𝗲𝗽𝗹𝗼𝘆𝗶𝗻𝗴 𝗔𝗜 𝗠𝗼𝗱𝗲𝗹𝘀: Learn about deceptive behaviors in AI, and how to detect and prevent this risky behavior 👉 https://lnkd.in/dFs4tBKt 3️⃣ 𝗠𝗮𝘀𝘁𝗲𝗿𝗶𝗻𝗴 𝗚𝗲𝗻𝗔𝗜 𝗥𝗲𝗱 𝗧𝗲𝗮𝗺𝗶𝗻𝗴: Insights from the Frontlines, our most comprehensive research yet on AI red teaming 👉 https://lnkd.in/d4TvDDSv 4️⃣ 𝗔𝗰𝗰𝗲𝗹𝗲𝗿𝗮𝘁𝗶𝗻𝗴 𝗔𝗜 𝗺𝗼𝗱𝗲𝗹 𝘀𝗮𝗳𝗲𝘁𝘆 𝗿𝗲𝗹𝗲𝗮𝘀𝗲𝘀: Cohere, a leader in AI language technology, leverages ActiveFence’s Generative AI Safety solution👉 https://lnkd.in/dnuxBgxW 5️⃣ 𝗢𝘂𝗿 𝗻𝗲𝘄𝗲𝘀𝘁 #𝘄𝗲𝗯𝗶𝗻𝗮𝗿 𝗼𝗻 𝗧𝗵𝗲 𝗥𝗘𝗣𝗢𝗥𝗧 𝗔𝗰𝘁: How to Detect Child Trafficking on Your Platform 👉 https://lnkd.in/dhzKGN4S From all of us at ActiveFence, we wish you a safe and joyful holiday season. Here’s to another year of making the online world safer together! 🌟
Like Comment Share
ActiveFence reposted this
Noam Schwartz

CEO @ ActiveFence | UGC and Generative AI Alignment
1w
Report this post
Is it too late to get a branded holiday sweater? Happy new year and happy holidays 🙏
7 Comments

Like Comment Share
ActiveFence

25,217 followers
1w Edited
Report this post
𝗨𝗻𝗶𝗻𝘁𝗲𝗻𝗱𝗲𝗱 𝗖𝗼𝗻𝘀𝗲𝗾𝘂𝗲𝗻𝗰𝗲𝘀 𝗶𝗻 𝗗𝗲𝗽𝗹𝗼𝘆𝗶𝗻𝗴 𝗔𝗜 𝗠𝗼𝗱𝗲𝗹𝘀 𝘞𝘩𝘢𝘵 𝘚𝘤𝘩𝘦𝘮𝘪𝘯𝘨 𝘪𝘴 𝘢𝘯𝘥 𝘏𝘰𝘸 𝘵𝘰 𝘍𝘪𝘹 𝘪𝘵? Agentic AI systems are now capable of using deception to achieve their goals. As foundation models grow smarter, this raises critical questions about AI safety. Misaligned #AI behaviors can lead to harmful and unethical consequences, making it essential for organizations to adopt proactive #safety measures. One key solution? 🔺 AI Red Teaming 🔺 This approach involves monitoring models and detecting unwanted actions before deployment. To learn more about how AI Red Teaming helps mitigate risks and builds safer AI systems. Read Here 👉 https://lnkd.in/dFs4tBKt
Like Comment Share
ActiveFence reposted this
Noam Schwartz

CEO @ ActiveFence | UGC and Generative AI Alignment
1w
Report this post
Much of my time these days is spent (or rather, invested!) in advising our partners and customers on what the UK's Online Safety Act means for them. There’s a wealth of information available, but it can be overwhelming if you’re not accustomed to navigating content alignment and online safety issues regularly. Under the Act, fines of up to £18 million or 10% of qualifying worldwide revenue (whichever is greater) can be imposed on Online Service Providers that fail to understand their responsibilities and act accordingly. If you fall into one of the following categories: 💡 User-to-User Services (U2U Services) – Platforms that enable users to interact with one another, such as: - Social media platforms - Online forums - Video-sharing websites - Consumer cloud storage and file-sharing platforms - Dating apps - Instant messaging services 💡 Search Services – Services with search engine functionality, allowing users to search multiple websites or databases. The Act requires you to take measures to assess your exposure to, detect, and counter illegal content. This includes, but is not limited to: - Terrorism - Child Sexual Exploitation and Abuse (CSEA) offenses - Grooming - CSAM images - CSAM URLs - Hate - Harassment - Stalking, threats, and abuse - Intimate image abuse and sexual exploitation - Human trafficking - Fraud -Proceeds of crime - Animal cruelty - Self-harm - State-sponsored interference The first step is to complete comprehensive risk assessments for illegal content, identifying potential exposure to these issues and estimating their impact. The deadline for this critical task is March 2025. The first step, and the deadline is March 2025, is completing comprehensive risk assessments for illegal content, examining potential exposure to these topics, and estimating their impact. If this applies to you, make sure to visit https://lnkd.in/e_z89REQ and send it to your safety/compliance team. This should be taken very seriously. ActiveFence and I are here to answer any questions and to support your process, from assessment to implementation of the required safety measures to protect your platform, community, and business.
Like Comment Share
ActiveFence

25,217 followers
1w Edited
Report this post
🎉 𝟮𝟱𝗞 𝘀𝘁𝗿𝗼𝗻𝗴! 𝗧𝗵𝗮𝗻𝗸 𝘆𝗼𝘂 𝗳𝗼𝗿 𝗯𝗲𝗶𝗻𝗴 𝗽𝗮𝗿𝘁 𝗼𝗳 𝗼𝘂𝗿 𝗷𝗼𝘂𝗿𝗻𝗲𝘆 🎉 We’re thrilled to share that ActiveFence has hit a major milestone: 25,000 followers here on LinkedIn! From insightful research to cutting-edge solutions, we strive to lead the conversation around Trust and Safety- and it’s your engagement, support, and feedback that fuel our mission. 𝗘𝘃𝗲𝗿𝘆 𝗳𝗼𝗹𝗹𝗼𝘄 𝗿𝗲𝗽𝗿𝗲𝘀𝗲𝗻𝘁𝘀 𝗮 𝘀𝗵𝗮𝗿𝗲𝗱 𝗯𝗲𝗹𝗶𝗲𝗳 𝗶𝗻 𝗺𝗮𝗸𝗶𝗻𝗴 𝘁𝗵𝗲 𝗼𝗻𝗹𝗶𝗻𝗲 𝘄𝗼𝗿𝗹𝗱 𝘀𝗮𝗳𝗲𝗿. Here’s to more milestones ahead, 𝘁𝗵𝗮𝗻𝗸 𝘆𝗼𝘂 for being on this journey with us. 💙 #TrustandSafety #Community #OnlineSafety
Like Comment Share

Browse jobs

Funding

ActiveFence 2 total rounds

Last Round

Series B Aug 27, 2021

US$ 100.0M

Investors

Highland Europe CRV + 3 Other investors

See more info on crunchbase

ActiveFence

Software Development

Protect your users. Protect your platform.

About us

ActiveFence

Content Moderation Software

ActiveOS: The Operating System for Trust & Safety

Content Moderation Software

ActiveScore: Automated AI content detection fueled by intelligence

Content Moderation Software

Threat Intelligence: Tailor-Made Intelligence Solutions to Keep Ahead of Bad Actors

Threat Intelligence Platforms

Locations

Employees at ActiveFence

Dror Nahumi

Itai Brezis

Gali Kedar

Patrick Fitzpatrick MSc

Trust & Safety Researcher | Threat Investigator | Linguistic Analysis | Behavioural Scientist

Updates

Join now to see what you are missing

Similar pages

Spectrum Labs (An ActiveFence Company)

Cyabra

Wiz

Cognyte

Active Fence Company

monday.com

Paragon

Buildots

Palo Alto Networks

Check Point Software

Browse jobs

Analyst jobs

Human Resources Specialist jobs

Office Manager jobs

Data Analyst jobs

Student jobs

Content Specialist jobs

Engineer jobs

Project Manager jobs

Head jobs

Cyber Security Specialist jobs

Manager jobs

Business Development Specialist jobs

Security Professional jobs

Specialist jobs

Clinical Specialist jobs

Director jobs

Developer jobs

Talent Acquisition Specialist jobs

Quality Assurance Specialist jobs

Tester jobs

Funding