Just Launched: Deci’s Gen AI Development Platform and Deci-Nano
Our Gen AI Development Platform features a new series of proprietary, fine-tunable large language models (LLMs), an inference engine, and an AI cluster management solution.
Designed to balance quality, speed, and cost-effectiveness, our models are complemented by flexible deployment options. Customers can access them through our platform’s API or opt for deployment on their own infrastructure, whether through a Virtual Private Cloud (VPC) or directly within their data centers. Our goal is to equip our customers with access to high performance models, offering the needed flexibility and control over data privacy, applications, and costs.
The first LLM in our series to be made available is Deci-Nano. Here are some highlights:
Deci-Nano exhibits advanced language and reasoning capabilities, making it ideal for a broad spectrum of applications, such as financial and legal analysis, copywriting assistance, chatbots, summarization, and brainstorming.
The model achieves superior scores on MT Bench compared to both Mistral-7b-instruct-v0.2 and Gemma-7b-it.
Deci-Nano is significantly faster than other models with similar capabilities, such as Mistral-7b-instruct-v0.2 and Google’s Gemma 7b-it, making it an excellent choice for real-time applications. When benchmarked on NVIDIA A100 GPUs, Deci-Nano’s end-to-end latency for generating 256 tokens is 38% faster than Mistral-7b-instruct-v0.2 and 33% faster than Gemma 7b-it.*
Deci-Nano provides the best price in comparison to the same group of models, at only $0.1 per 1M tokens.
Featuring an 8k context window, Deci-Nano was trained on a mix of proprietary and public datasets and was preference-tuned using DPO.
In sum, Deci-Nano provides the best balance of quality, speed, and price, making it optimal for production.
Deci-Nano embodies our production-oriented approach which includes a dedication not only to model quality but also to efficiency and cost-effectiveness. Looking ahead, we anticipate the release of additional LLMs that promise to elevate this standard of excellence even higher. To make our line of production-oriented LLMs available to as many businesses and developers as possible, we developed a platform that delivers on three critical fronts: high performance, exceptional control, and unmatched cost efficiency.
Read our blog to learn more.
You can also:
💻 Try Deci-Nano in our playground or get your API trial token > https://auth.deci.ai/oauth/account/sign-up
📘 Check out the QuickStart notebook > bit.ly/Deci-Nanonotebook