Just Launched: Deci’s Gen AI Development Platform and Deci-Nano

Deci AI (Acquired by NVIDIA)

Deci enables deep learning to live up to its true potential by using AI to build better AI.

Published Mar 15, 2024

Our Gen AI Development Platform features a new series of proprietary, fine-tunable large language models (LLMs), an inference engine, and an AI cluster management solution.

Designed to balance quality, speed, and cost-effectiveness, our models are complemented by flexible deployment options. Customers can access them through our platform’s API or opt for deployment on their own infrastructure, whether through a Virtual Private Cloud (VPC) or directly within their data centers. Our goal is to equip our customers with access to high performance models, offering the needed flexibility and control over data privacy, applications, and costs.

The first LLM in our series to be made available is Deci-Nano. Here are some highlights:

Deci-Nano exhibits advanced language and reasoning capabilities, making it ideal for a broad spectrum of applications, such as financial and legal analysis, copywriting assistance, chatbots, summarization, and brainstorming.
The model achieves superior scores on MT Bench compared to both Mistral-7b-instruct-v0.2 and Gemma-7b-it.
Deci-Nano is significantly faster than other models with similar capabilities, such as Mistral-7b-instruct-v0.2 and Google’s Gemma 7b-it, making it an excellent choice for real-time applications. When benchmarked on NVIDIA A100 GPUs, Deci-Nano’s end-to-end latency for generating 256 tokens is 38% faster than Mistral-7b-instruct-v0.2 and 33% faster than Gemma 7b-it.*
Deci-Nano provides the best price in comparison to the same group of models, at only $0.1 per 1M tokens.
Featuring an 8k context window, Deci-Nano was trained on a mix of proprietary and public datasets and was preference-tuned using DPO.

In sum, Deci-Nano provides the best balance of quality, speed, and price, making it optimal for production.

Deci-Nano embodies our production-oriented approach which includes a dedication not only to model quality but also to efficiency and cost-effectiveness. Looking ahead, we anticipate the release of additional LLMs that promise to elevate this standard of excellence even higher. To make our line of production-oriented LLMs available to as many businesses and developers as possible, we developed a platform that delivers on three critical fronts: high performance, exceptional control, and unmatched cost efficiency.

Read our blog to learn more.

You can also:

💻 Try Deci-Nano in our playground or get your API trial token > https://auth.deci.ai/oauth/account/sign-up

Just Launched: Deci’s Gen AI Development Platform and Deci-Nano

Deci AI (Acquired by NVIDIA)

Deci enables deep learning to live up to its true potential by using AI to build better AI.

More articles by this author

Insights from the community

Others also viewed

Microsoft's Copilot Upgrade, OpenAI's DevDay Breakthroughs, and Liquid AI's Game-Changing Models Shape the Future of Productivity and Innovation #79

Inflection point for Open-Source LLMs Reached in March, 2024

AI in Practice: How to Choose and Deploy the Right Strategy

This week's latest generative AI updates - September 10, 2024

This AI newsletter is all you need #40

Next-Gen ML Power: Faster Insights, Lower Costs

Artificial Intelligence #183

Artificial Intelligence #183

The Hidden Costs of AI

NOTEWORTHY NEWS #26: OUR TAKE ON THE LATEST AI/GENAI NEWS

Explore topics

How to Improve Small Object Detection Accuracy Without Increasing Latency

Mar 28, 2024

What makes LLM inference more challenging than traditional NLP?

Mar 8, 2024

YOLO-NAS-Sat: A Small Object Detection Model for Edge Deployment

Feb 24, 2024

Exploring the Modern Transformer - From 'Attention Is All You Need' to SwiGLU, RoPE, and GQA

Feb 22, 2024

How to Build Better AI Models with a Production-Aware Approach and NAS

Jan 26, 2024

DeciCoder-6B and DeciDiffusion 2.0: Models Built for Accuracy, Speed, and Cost-Efficiency

Jan 18, 2024

Maximizing LLM Inference Speed: Proven Strategies and Best Practices

Dec 28, 2023

DeciLM-7B: The Fastest and Most Accurate 7 Billion-Parameter LLM to Date 🚀

Dec 12, 2023

Key Factors to Success of YOLO-NAS Pose 🚀

Nov 23, 2023

8 Community-Created Content to Get Started with YOLO-NAS Pose

Nov 15, 2023

Insights from the community

Others also viewed

Microsoft's Copilot Upgrade, OpenAI's DevDay Breakthroughs, and Liquid AI's Game-Changing Models Shape the Future of Productivity and Innovation #79

Inflection point for Open-Source LLMs Reached in March, 2024

AI in Practice: How to Choose and Deploy the Right Strategy

This week's latest generative AI updates - September 10, 2024

This AI newsletter is all you need #40

Next-Gen ML Power: Faster Insights, Lower Costs

Artificial Intelligence #183

Artificial Intelligence #183

The Hidden Costs of AI

NOTEWORTHY NEWS #26: OUR TAKE ON THE LATEST AI/GENAI NEWS

Explore topics