Google DeepMind Introduces Agentic Vision to Gemini 3 Flash

Share This Post

Google DeepMind added this week agentic vision capabilities to its Gemini 3 Flash model, turning image analysis an active rather than passive task.

While typical multimodal models process images in a single “glance,” by introducing agentic capabilities, Google allows its model to actively study a picture and home in on specific details, such as street signs or a serial number on a microchip. 

The new feature works by generating and running Python code that zooms, manipulates and inspects images methodically.

“By combining visual reasoning with code execution, one of the first tools supported by Agentic Vision, the model formulates plans to zoom in, inspect and manipulate images step-by-step, grounding answers in visual evidence,” Rohan Doshi, product manager at Google DeepMind, wrote in a blog post about the announcement.

The feature uses a Think-Act-Observe loop, whereby Gemini 3 Flash will study a user query and image and formulate a plan, use Python code to actively conduct an image analysis, and then inspect the results before generating its final response. 

According to Google, the update saw a quality improvement of between 5% to 10% across vision benchmarks.

A range of new agentic behaviors have, Google said, already been demonstrated from the update via Google AI Studio, such as iterative zooming, direct image annotation and visual plotting. The latter is said to reduce hallucinations ­— a common problem with visual math tasks. 

Looking ahead, the company said it plans to add more implicit code-driven behaviors into the model, meaning certain capabilities that currently require a specific prompt will become an autonomous feature.

More features, such as web and reverse image search, as well as a greater range of model sizes, are also expected to be rolled out in the future. 

Related Posts

Strive stacks more bitcoin as ASST surges 133% in three months

Strive (ASST) purchased 1,109 bitcoin at an average price...

Trump praises prediction markets, defends CFTC as court cases compound

U.S. President Donald Trump said it was "critically important"...

bitcoin on sidelines as markets surge on Iran peace hopes

A big block sale of BlackRock's bitcoin exchange-traded IBIT...

The Clarity Act won’t lead to adoption without crypto tax reform

A growing number of people see the Clarity Act,...

Cardano Treasury Vote Ratifies Developer Experience Initiative With 67.9% Support

Cardano's IO-backed Developer Experience Initiative has been approved through...

Base Launches MCP Agent Gateway for Onchain Portfolio Management

Base introduces Model Context Protocol integration enabling AI agents...