Bruce Burke’s Post

View profile for Bruce Burke, graphic

Looking for new opportunities

OctoAI (formerly known as OctoML), today announced the launch of OctoStack, its new end-to-end solution for deploying generative AI models in a company’s private cloud, be that on-premises or in a virtual private cloud from one of the major vendors, including AWS, Google, Microsoft and Azure, as well as Coreweave, Lambda Labs, Snowflake and others. In its early days, OctoAI focused almost exclusively on optimizing models to run more effectively. Based on the Apache TVM machine learning compiler framework, the company then launched its TVM-as-a-Service platform and, over time, expanded that into a fully-fledged model-serving offering that combined its optimization chops with a DevOps platform. With the rise of generative AI, the team then launched the fully managed OctoAI platform to help its users serve and fine-tune existing models. OctoStack, at its core, is that OctoAI platform, but for private deployments. Image Credits: OctoAI Today, OctoAI CEO and co-founder Luis Ceze told me, the company has over 25,000 developers on the platform and hundreds of paying customers in production. A lot of these companies, Ceze said, are GenAI-native companies. The market of traditional enterprises wanting to adopt generative AI is significantly larger, though, so it’s maybe …

OctoAI wants to makes private AI model deployments easier with OctoStack | TechCrunch

OctoAI wants to makes private AI model deployments easier with OctoStack | TechCrunch

https://techcrunch.com

To view or add a comment, sign in

Explore topics