Google releases its own 'reasoning' AI model

Google has released what it’s calling a new “reasoning” AI model — but it’s in the experimental stages, and from our brief testing, there’s certainly room for improvement.

The new model, called Gemini 2.0 Flash Thinking Experimental (a mouthful, to be sure), is available in AI Studio, Google’s AI prototyping platform. A model card describes it as “best for multimodal understanding, reasoning, and coding,” with the ability to “reason over the most complex problems” in fields such as programming, math, and physics.

In a post on X, Logan Kilpatrick, who leads product for AI Studio, called Gemini 2.0 Flash Thinking Experimental “the first step in [Google’s] reasoning journey.” Jeff Dean, chief scientist for Google DeepMind, Google’s AI research division, said in his own post that Gemini 2.0 Flash Thinking Experimental is “trained to use thoughts to strengthen its reasoning.”

“We see promising results when we increase inference time computation,” Dean said, referring to the amount of computing used to “run” the model as it considers a question.

It’s still an early version, but check out how the model handles a challenging puzzle involving both visual and textual clues: (2/3) pic.twitter.com/JltHeK7Fo7

— Logan Kilpatrick (@OfficialLoganK) December 19, 2024

Built on Google’s recently announced Gemini 2.0 Flash model, Gemini 2.0 Flash Thinking Experimental appears to be similar in design to OpenAI’s o1 and other so-called reasoning models. Unlike most AI, reasoning models effectively fact-check themselves, which helps them avoid some of the pitfalls that normally trip up AI models.

As a drawback, reasoning models often take longer — usually seconds to minutes longer — to arrive at solutions.

Given a prompt, Gemini 2.0 Flash Thinking Experimental pauses before responding, considering a number of related prompts and “explaining” its reasoning along the way. After a while, the model summarizes what it considers to be the most accurate answer.

Well — that’s what’s supposed to happen. When I asked Gemini 2.0 Flash Thinking Experimental how many R’s were in the word “strawberry,” it said “two.”

Google reasoning model — Google’s new reasoning model struggles with counting the letters in words, SOMETIMES.Image Credits:Google

Your mileage may vary.

In the wake of the release of o1, there’s been an explosion of reasoning models from rival AI labs — not just Google. In early November, DeepSeek, an AI research company funded by quant traders, launched a preview of its first reasoning model, DeepSeek-R1. That same month, Alibaba’s Qwen team unveiled what it claimed was the first “open” challenger to o1.

Bloomberg reported in October that Google had several teams developing reasoning models. Subsequent reporting by The Information in November revealed that the company has at least 200 researchers focusing on the technology.

What opened the reasoning model floodgates? Well, for one, the search for novel approaches to refine generative AI. As my colleague Max Zeff recently reported, “brute force” techniques to scale up models are no longer yielding the improvements they once did.

Not everyone’s convinced that reasoning models are the best path forward. They tend to be expensive, for one, thanks to the large amount of computing power required to run them. And while they’ve performed well on benchmarks so far, it’s not clear whether reasoning models can maintain this rate of progress.

TechCrunch has an AI-focused newsletter! Sign up here to get it in your inbox every Wednesday.

Topics

AI, gemini, gemini 2.0, gemini 2.0 flash, gemini 2.0 flash thinking experimental, Generative AI, Google, google deepmind, reasoning, reasoning model

Kyle Wiggers

Senior Reporter, Enterprise

Kyle Wiggers is a senior reporter at TechCrunch with a special interest in artificial intelligence. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Brooklyn with his partner, a piano educator, and dabbles in piano himself. occasionally — if mostly unsuccessfully.

View Bio

In Brief

Palantir and Anduril reportedly building a tech consortium to bid on defense contracts
Anthony Ha

33 mins ago
AI

OpenAI trained o1 and o3 to ‘think’ about its safety policy
Maxwell Zeff

4 hours ago
Startups

Y Combinator alum Nowadays, founded by sisters, raises $2M to automate event planning
Julie Bort

Dec 6, 2024

Latest in AI

AI

Sriram Krishnan named Trump’s senior policy advisor for AI
Kyle Wiggers

9 mins ago
In Brief

Palantir and Anduril reportedly building a tech consortium to bid on defense contracts
Anthony Ha

33 mins ago
AI

OpenAI trained o1 and o3 to ‘think’ about its safety policy
Maxwell Zeff

4 hours ago

Topics

More from TechCrunch

Google releases its own ‘reasoning’ AI model

Apple might be working on a smart doorbell

OpenAI’s GPT-5 reportedly falling short of expectations

EV startup Canoo places remaining employees on a ‘mandatory unpaid break’

After causing outrage on the first day of Y Combinator, AI code editor PearAI lands $1M seed

OpenAI announces new o3 models

Ransomware attack on health giant Ascension hits 5.6 million patients

Related

Palantir and Anduril reportedly building a tech consortium to bid on defense contracts

OpenAI trained o1 and o3 to ‘think’ about its safety policy

Y Combinator alum Nowadays, founded by sisters, raises $2M to automate event planning

Latest in AI

Sriram Krishnan named Trump’s senior policy advisor for AI

Palantir and Anduril reportedly building a tech consortium to bid on defense contracts

OpenAI trained o1 and o3 to ‘think’ about its safety policy

Topics

More from TechCrunch

Google releases its own ‘reasoning’ AI model

Most Popular

Newsletters

TechCrunch Daily News

TechCrunch AI

TechCrunch Space

Startups Weekly

Related

Latest in AI