We're in the news! Read about us in Breakit (Swedish): https://lnkd.in/dwBZ6G6A
About us
Custom capability evaluations for foundation models and LLM-agents to benchmark safety, risk and performance
- Website
-
https://vectorview.ai/
External link for Andon Labs (YC w24)
- Industry
- Information Services
- Company size
- 2-10 employees
- Type
- Privately Held
Employees at Andon Labs (YC w24)
Updates
-
Andon Labs (YC w24) reposted this
🚀 Vectorview launched! Custom capability evaluations for foundation models and LLM-agents to benchmark safety, risk, and performance. "Evaluating the capabilities of AI 🤖" 🌐 www.vectorview.ai ⭐ Makes it easy to evaluate the capabilities of foundation models and LLM agents. 📊 Offers a suite of custom evaluation tools designed to benchmark AI applications against specific, real-world scenarios they are likely to encounter. 🎯 Targeted approach ensures that AI behaves as intended, mitigating the risk of unintended behaviors 🗓 Book a demo here! 👉 https://lnkd.in/dNS2nW8e Congrats on the launch Emil Fröberg & Lukas Petersson!! https://lnkd.in/dbW8V7-N
The Journal by Fondo | Vectorview launches: evaluating the capabilities of AI 🤖
tryfondo.com
-
We’re hiring for multiple engineering positions, including a Founding Engineer. Come join us and work on some of the most exciting problems in AI 🤖 Read more here ⬇️
Work at Vectorview | Open Roles
thirty-seven.notion.site
-
Andon Labs (YC w24) reposted this
Just delivered the Y Combinator demo day pitch for Vectorview and I'm super happy that we were featured in TechCrunch as a staff favorite! This is the behind the scenes - it actually looks much better from the opposite angle. TechCrunch article: https://lnkd.in/d6K5Bazn
-
Andon Labs (YC w24) reposted this
Sometimes, LLMs act in ways we didn't intend. To solve this problem, Vectorview (YC W24) is providing custom evaluation tasks for AI. It’s difficult to prevent unwanted behaviors in LLMs due to their non-deterministic nature. Testing them against every possible scenario is hard, making it tough to catch all unintended behaviors. Additionally, most evaluation benchmarks are too general, missing the specific issues that can arise in real-world use. Vectorview’s platform offers a suite of custom evaluation tools designed to benchmark AI applications against specific, real-world scenarios they are likely to encounter. This targeted approach ensures that AI behaves as intended, mitigating the risk of unintended behaviors that generic benchmarks often miss. The founders, Emil Fröberg and Lukas Petersson, believe that enabling access to custom evaluations at scale is the way to realize the full potential of AI. Congrats to the team on the launch!
Launch YC: Vectorview: Evaluating the capabilities of AI 🤖 | Y Combinator
ycombinator.com
-
Andon Labs (YC w24) reposted this
AI: Enormous potential with even higher risks? Klarna recently announced that they have replaced 700 full-time employees with the use of AI. It showcases the huge potential of this technology. However, we have also seen massive catastrophes from companies rolling out their AI features without fully safety testing them. For example, we saw a chatbot making false promises that took a company to court. Are you not sure about the safety of your AI system? We would love to help!
-
AI: Enormous potential with even higher risks? Klarna recently announced that they have replaced 700 full-time employees with the use of AI. It showcases the huge potential of this technology. However, we have also seen massive catastrophes from companies rolling out their AI features without fully safety testing them. For example, we saw a chatbot making false promises that took a company to court. Are you not sure about the safety of your AI system? We would love to help!
-
We're happy to have been named one of the top generative AI Startups of 2024 by Y Combinator 🚀
-
We ran 1400 RAG experiments so you don't have to! Read our blog (👇) to find out how you can improve your RAG pipeline. https://lnkd.in/dBCQnVTv #RAG #llm #ai #llamaindex
optimizing-rag
vectorview.ai