Where Tech Meets Bio (Substack Newsletter) reposted this
Looking forward to the report by Andrii Buvailo, Ph.D. . But I do not necessarily agree that data is most important. The ability to rapidly validate is most important.Second most important success factor is community feedback and validation. Your tools need to work in the hands of the other companies. And with 22 PCCs and 10 clinical-stage assets in 4 years, I can make certain claims from time to time. We now have over $3 Trillion dollars worth of data at Insilico (based on grants we monitor), probably the biggest data repository in the world. SE of this data is no longer available publicly so if you were not collecting it in 2015, you don't have it. And we have a robolab producing data. 24/7 - also one of the biggest repositories of data. And the most valuable data is the data from 40+ programs connecting Insilico, invitro, preclinical and clinical experiments. But even though we are probably the king of data, I still believe that the algorithm + ability to test quickly and in the right models is the main reason for our success to date.
My entire view of AI in drug discovery (for now) is this: we already have sufficiently capable algo-s to improve drug discovery. But... ... we do not have enough domain specific data to reach that. So, the race now is not for novel super AI, but for data acquisition capabilities (via building internal infrastructure for doing so, partnering, etc). It is costly. Hence, huge AI in DD company valuations. The 2024 and 2025 AI in drug discovery race is basically not AI race, it is data race. To be the first "OpenAI" of biotech. No single company on the market is still there.... but some are closer than others. If you want to know which are closer, stay tuned for the upcoming report next week (may be later, but hopefully it goes out before JP Morgan Health anyway). Image credit: myself (created it before gen AI era if anything...)