ActiveFence

ActiveFence

Software Development

Protect your users. Protect your platform.

About us

ActiveFence protects online platforms and their users from the widest spectrum of harms and abuses with one complete Trust and Safety solution. From efficient operations to scaled detection, in-depth investigations, and the latest generative AI safety measures, we enable Trust and Safety at scale, empowering platforms to protect their communities and thrive in a rapidly evolving digital landscape.

Website
https://www.activefence.com/
Industry
Software Development
Company size
201-500 employees
Headquarters
New York
Type
Privately Held
Founded
2018

Locations

Employees at ActiveFence

Updates

  • 𝗨𝗱𝗲𝗺𝘆 – 𝗘𝗺𝗽𝗼𝘄𝗲𝗿𝗶𝗻𝗴 𝗦𝗸𝗶𝗹𝗹𝘀 𝗳𝗼𝗿 𝘁𝗵𝗲 𝗙𝘂𝘁𝘂𝗿𝗲 Udemy, a leader in online learning, needed to maintain course quality and keep up with rising customer expectations. They partnered with ActiveFence to create a scalable content management solution. 𝗧𝗵𝗲 𝗿𝗲𝘀𝘂𝗹𝘁𝘀? ✅ Increased customer trust ✅ Fewer escalations ✅ And improved content quality ratings Find out how Udemy is enhancing the learning experience for millions of users with proactive content moderation 👉 https://lnkd.in/d2ZtAzF2

  • ActiveFence reposted this

    View profile for Noam Schwartz, graphic

    CEO @ ActiveFence | UGC and Generative AI Alignment

    🤖 Can AI fake being good? Anthropic’s research dives into the fascinating (and a little chilling) concept of 'alignment faking' in LLMs. It turns out, AI can strategically act aligned with training objectives while secretly planning to behave differently when unmonitored. This raises big questions about trust and transparency in AI systems. Read on to explore how AI might not just follow our rules, but play by them - until it doesn’t.

    The Sneaky Brain of AI: How Alignment Faking Works

    The Sneaky Brain of AI: How Alignment Faking Works

    Noam Schwartz on LinkedIn

  • View organization page for ActiveFence, graphic

    25,217 followers

    🤝 𝗖𝗹𝗼𝘀𝗶𝗻𝗴 𝗢𝘂𝘁 𝟮𝟬𝟮𝟰: 𝗣𝗿𝗲𝗽𝗮𝗿𝗶𝗻𝗴 𝗳𝗼𝗿 𝟮𝟬𝟮𝟱 🤝 Trust & Safety teams have faced an incredibly challenging year. From the democratization of harmful content creation driven by #GenerativeAI to navigating major elections and global conflicts, the landscape has shifted dramatically. These challenges unfolded under heightened scrutiny and with leaner teams, demanding both resilience and innovation. Looking ahead to 2️⃣ 0️⃣ 2️⃣ 5️⃣ , the stakes are higher than ever. We’re here to help. Our commitment is to empower platforms with the tools, insights, and expertise needed to face these challenges. 🥂 𝗛𝗲𝗿𝗲’𝘀 𝘁𝗼 𝗮 𝘀𝗮𝗳𝗲𝗿 𝟮𝟬𝟮𝟱. 𝗖𝗼𝗻𝘁𝗮𝗰𝘁 𝘂𝘀 𝘁𝗼 𝗹𝗲𝗮𝗿𝗻 𝗺𝗼𝗿𝗲 🥂 https://lnkd.in/d5fBzTBr

  • ActiveFence reposted this

    View profile for Noam Schwartz, graphic

    CEO @ ActiveFence | UGC and Generative AI Alignment

    Minutes before 2025, I want to wish all my friends, partners, customers and the incredible team at ActiveFence a year of achievements that go beyond our most ambitious milestones. May we continue to push boundaries, innovate, and deliver on the promises we’ve made to ourselves and the world. The journey into the trust and safety space, had a profound impact on my life, shaping not just who I am today but also the one I aspire to be in the years to come. At its core, trust and safety work is about safeguarding people, fostering healthy communities, and ensuring that technology serves humanity responsibly. For me, and for all of us at ActiveFence achieving our milestones isn’t just about hitting numbers or delivering a product. It’s about protecting people, keeping harmful content at bay, ensuring alignment in generative AI, and empowering our customers to make the digital world safer and better. Every goal met is a step toward a healthier, safer, and more open online environment, and for that, I’m endlessly grateful. To my incredible team, thank you for your resilience, creativity, and commitment. You are a lighthouse of innovation and an inspiration. Your ability to tackle complex challenges and work together toward a shared vision, no matter what, is motivating me every single day. I’m excited to see what we’ll accomplish in 2025 and beyond. Let’s continue to build something extraordinary together. Here’s to a successful and impactful year ahead. Happy 2025!

  • View organization page for ActiveFence, graphic

    25,217 followers

    𝗔𝗜 𝗙𝗿𝗼𝗻𝘁𝗶𝗲𝗿: 𝗧𝗵𝗿𝗲𝗮𝘁𝘀 𝗼𝗳 𝗦𝘆𝗻𝘁𝗵𝗲𝘁𝗶𝗰 𝗩𝗶𝗱𝗲𝗼 Since the early days of AI video-generating models, threat actors have been watching closely, ready to exploit this transformative technology to fuel their harmful agendas. As these models are released, ActiveFence has been at the forefront, uncovering how these dangerous communities are gearing up to act. Here’s what we’ve found: 🔍 Child predators are developing technical solutions to bypass AI safety measures, attempting to manipulate this powerful tech for their own exploitation. 🔍 Terrorist organizations are testing AI-generated content to project strength, glorify violent acts, and amplify their propaganda. 🔍 Hate groups are animating hateful texts with emotive videos, increasing their reach and driving harmful narratives further than ever before. As the capabilities of AI grow, so does the urgency to protect platforms, users, and communities from these evolving threats. To Learn More 👉https://lnkd.in/d_N3sUhU

    • No alternative text description for this image
  • View organization page for ActiveFence, graphic

    25,217 followers

    🎄📚 𝗛𝗼𝗹𝗶𝗱𝗮𝘆 𝗥𝗲𝗮𝗱𝗶𝗻𝗴 𝗟𝗶𝘀𝘁: 𝗦𝘁𝗮𝘆 𝗔𝗵𝗲𝗮𝗱 𝗶𝗻 𝗧𝗿𝘂𝘀𝘁 & 𝗦𝗮𝗳𝗲𝘁𝘆 🎁✨ As the year comes to a close, it's the perfect time to reflect, recharge, and dive into some insightful reads. We’ve curated a #holiday #reading list packed with our most thought-provoking content from the past year. These resources are designed to inspire and prepare you for the challenges and opportunities of #2025. 𝗛𝗲𝗿𝗲’𝘀 𝘄𝗵𝗮𝘁’𝘀 𝗼𝗻 𝘁𝗵𝗲 𝗹𝗶𝘀𝘁: 1️⃣ 𝗧𝗵𝗲 𝗕𝘂𝘆𝗲𝗿’𝘀 𝗚𝘂𝗶𝗱𝗲 𝘁𝗼 𝗧𝗿𝘂𝘀𝘁 𝗮𝗻𝗱 𝗦𝗮𝗳𝗲𝘁𝘆 𝗧𝗼𝗼𝗹𝘀: help you decide between building a custom toolkit or buying an existing solution 👉https://lnkd.in/dDCw97_5 2️⃣ 𝗨𝗻𝗶𝗻𝘁𝗲𝗻𝗱𝗲𝗱 𝗖𝗼𝗻𝘀𝗲𝗾𝘂𝗲𝗻𝗰𝗲𝘀 𝗼𝗳 𝗗𝗲𝗽𝗹𝗼𝘆𝗶𝗻𝗴 𝗔𝗜 𝗠𝗼𝗱𝗲𝗹𝘀: Learn about deceptive behaviors in AI, and how to detect and prevent this risky behavior 👉 https://lnkd.in/dFs4tBKt 3️⃣ 𝗠𝗮𝘀𝘁𝗲𝗿𝗶𝗻𝗴 𝗚𝗲𝗻𝗔𝗜 𝗥𝗲𝗱 𝗧𝗲𝗮𝗺𝗶𝗻𝗴: Insights from the Frontlines, our most comprehensive research yet on AI red teaming 👉 https://lnkd.in/d4TvDDSv 4️⃣ 𝗔𝗰𝗰𝗲𝗹𝗲𝗿𝗮𝘁𝗶𝗻𝗴 𝗔𝗜 𝗺𝗼𝗱𝗲𝗹 𝘀𝗮𝗳𝗲𝘁𝘆 𝗿𝗲𝗹𝗲𝗮𝘀𝗲𝘀: Cohere, a leader in AI language technology, leverages ActiveFence’s Generative AI Safety solution👉 https://lnkd.in/dnuxBgxW 5️⃣ 𝗢𝘂𝗿 𝗻𝗲𝘄𝗲𝘀𝘁 #𝘄𝗲𝗯𝗶𝗻𝗮𝗿 𝗼𝗻 𝗧𝗵𝗲 𝗥𝗘𝗣𝗢𝗥𝗧 𝗔𝗰𝘁: How to Detect Child Trafficking on Your Platform 👉 https://lnkd.in/dhzKGN4S From all of us at ActiveFence, we wish you a safe and joyful holiday season. Here’s to another year of making the online world safer together! 🌟

    • No alternative text description for this image
  • View organization page for ActiveFence, graphic

    25,217 followers

    𝗨𝗻𝗶𝗻𝘁𝗲𝗻𝗱𝗲𝗱 𝗖𝗼𝗻𝘀𝗲𝗾𝘂𝗲𝗻𝗰𝗲𝘀 𝗶𝗻 𝗗𝗲𝗽𝗹𝗼𝘆𝗶𝗻𝗴 𝗔𝗜 𝗠𝗼𝗱𝗲𝗹𝘀 𝘞𝘩𝘢𝘵 𝘚𝘤𝘩𝘦𝘮𝘪𝘯𝘨 𝘪𝘴 𝘢𝘯𝘥 𝘏𝘰𝘸 𝘵𝘰 𝘍𝘪𝘹 𝘪𝘵? Agentic AI systems are now capable of using deception to achieve their goals. As foundation models grow smarter, this raises critical questions about AI safety. Misaligned #AI behaviors can lead to harmful and unethical consequences, making it essential for organizations to adopt proactive #safety measures. One key solution? 🔺 AI Red Teaming 🔺 This approach involves monitoring models and detecting unwanted actions before deployment. To learn more about how AI Red Teaming helps mitigate risks and builds safer AI systems. Read Here 👉 https://lnkd.in/dFs4tBKt

    • No alternative text description for this image
  • ActiveFence reposted this

    View profile for Noam Schwartz, graphic

    CEO @ ActiveFence | UGC and Generative AI Alignment

    Much of my time these days is spent (or rather, invested!) in advising our partners and customers on what the UK's Online Safety Act means for them. There’s a wealth of information available, but it can be overwhelming if you’re not accustomed to navigating content alignment and online safety issues regularly. Under the Act, fines of up to £18 million or 10% of qualifying worldwide revenue (whichever is greater) can be imposed on Online Service Providers that fail to understand their responsibilities and act accordingly. If you fall into one of the following categories: 💡 User-to-User Services (U2U Services) – Platforms that enable users to interact with one another, such as: - Social media platforms - Online forums - Video-sharing websites - Consumer cloud storage and file-sharing platforms - Dating apps - Instant messaging services 💡 Search Services – Services with search engine functionality, allowing users to search multiple websites or databases. The Act requires you to take measures to assess your exposure to, detect, and counter illegal content. This includes, but is not limited to: - Terrorism - Child Sexual Exploitation and Abuse (CSEA) offenses - Grooming - CSAM images - CSAM URLs - Hate - Harassment - Stalking, threats, and abuse - Intimate image abuse and sexual exploitation - Human trafficking - Fraud -Proceeds of crime - Animal cruelty - Self-harm - State-sponsored interference The first step is to complete comprehensive risk assessments for illegal content, identifying potential exposure to these issues and estimating their impact. The deadline for this critical task is March 2025. The first step, and the deadline is March 2025, is completing comprehensive risk assessments for illegal content, examining potential exposure to these topics, and estimating their impact. If this applies to you, make sure to visit https://lnkd.in/e_z89REQ and send it to your safety/compliance team. This should be taken very seriously. ActiveFence and I are here to answer any questions and to support your process, from assessment to implementation of the required safety measures to protect your platform, community, and business.

    • No alternative text description for this image
  • View organization page for ActiveFence, graphic

    25,217 followers

    🎉 𝟮𝟱𝗞 𝘀𝘁𝗿𝗼𝗻𝗴! 𝗧𝗵𝗮𝗻𝗸 𝘆𝗼𝘂 𝗳𝗼𝗿 𝗯𝗲𝗶𝗻𝗴 𝗽𝗮𝗿𝘁 𝗼𝗳 𝗼𝘂𝗿 𝗷𝗼𝘂𝗿𝗻𝗲𝘆 🎉 We’re thrilled to share that ActiveFence has hit a major milestone: 25,000 followers here on LinkedIn! From insightful research to cutting-edge solutions, we strive to lead the conversation around Trust and Safety- and it’s your engagement, support, and feedback that fuel our mission. 𝗘𝘃𝗲𝗿𝘆 𝗳𝗼𝗹𝗹𝗼𝘄 𝗿𝗲𝗽𝗿𝗲𝘀𝗲𝗻𝘁𝘀 𝗮 𝘀𝗵𝗮𝗿𝗲𝗱 𝗯𝗲𝗹𝗶𝗲𝗳 𝗶𝗻 𝗺𝗮𝗸𝗶𝗻𝗴 𝘁𝗵𝗲 𝗼𝗻𝗹𝗶𝗻𝗲 𝘄𝗼𝗿𝗹𝗱 𝘀𝗮𝗳𝗲𝗿. Here’s to more milestones ahead, 𝘁𝗵𝗮𝗻𝗸 𝘆𝗼𝘂 for being on this journey with us. 💙 #TrustandSafety #Community #OnlineSafety

    • No alternative text description for this image

Similar pages

Browse jobs

Funding