Thank you to the organizers of the Trust & Safety Hackathon for another wonderful event! Our team was tasked with the prompt “How can Safety by Design evolve to cover emerging threats and tech like AI?”. After reviewing current SbD resources, we wanted to address the lack of specific safety guidance at the earliest stages of LLM development- leading to our proposed T&S evaluation framework below. We believe that if a stakeholder hub of leading investors and tech companies committed to adopting this during their due diligence and LLM development processes, this would serve as a mechanism to mitigate T&S risks before new models are deployed. Thank you to the judges for selecting this as a top idea- now which responsible tech stakeholder org wants to champion this? 😁 The role of T&S is expanding in our AI-enabled future. There’s growing investment in startups developing innovative T&S solutions to help developers, enterprises, and consumers engage with AI safely. There’s also new impact investments that can reinforce safety priorities, such as philanthropic foundations recently purchasing shares of Anthropic. And after listening to the creative solutions presented at the hackathon, I’m encouraged by the passion and talent dedicated to solving future T&S solutions across disciplines. 🌟
Ex Google TrustSafety | Sampling Strategies | Metrics Design | Automation & ML Evaluation Workflow Design| Stakeholder Management
Story of the Week: From Hackathon Success to Personal Impact! 🚀 The first two days of this week were nothing short of exhilarating as I immersed myself in my first Trust and Safety Hackathon. Proud to be a part of Team 23 (Vivian Chong, Marcelo Davila Gonzalez, Chandana M.S. ), we put our heads together and secured 3rd place in the Trust & Safety Hackathon. Our idea on Standardizing Trust and Safety Evaluation for Large Language Models was a standout, earning us some rave reviews from Jonathan Bellack. Thank you so much for the reviews! The best part of the hackathon wasn't just the success, but the incredible ideas and collaborative spirit that filled the room. The discussions and ideas exchanged during the hackathon resonated deeply with me, reinforcing my respect and passion for the role of trust and safety in our lives. Working in the trust and safety domain has shown me the profound impact our work can have on people's lives. From protecting user information to combating identity fraud and payment fraud, every effort in this field contributes to a safer and more secure online environment for all. For anyone interested in the space, have a look at this repo:https://lnkd.in/g7eWcMwi based on Andrew Ng's course.
Healthy Digital Spaces | Yale MBA
8moExamples of T&S evaluation metrics (taken from Andrew Ng's course): https://github.com/praveenhosdrug123/Quality_and_Safety_for_LLM_Applications Rising investment in T&S and AI governance startups: https://ducoexperts.com/tsreport, https://www.public.io/report-post/the-international-state-of-safety-tech-2023 Impact investment to promote responsible innovation: https://impactalpha.com/impact-investments-in-anthropic/