Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

Baidu ERNIE multimodal AI outperforms GPT and Gemini on benchmarks

November 12, 2025

EU plans to relax AI laws in response to technology backlash

November 12, 2025

Towards encrypted large-scale language models with FHE

November 12, 2025
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Wednesday, November 12
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources
Versa AI hub
Home»Tools»Introducing the Gemini 2.5 computer usage model
Tools

Introducing the Gemini 2.5 computer usage model

versatileaiBy versatileaiOctober 8, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
#image_title
Share
Facebook Twitter LinkedIn Pinterest Email

Earlier this year, he said it was providing computer usage capabilities to developers through the Gemini API. Today we are releasing a Gemini 2.5 computer-use model. This enhances the agent that can interact with the user interface (UIS) with a new specialized model built on the visual understanding and inference capabilities of Gemini 2.5 Pro. It is better than the major alternatives on multiple web and mobile control benchmarks, all with delays. Developers can access these features through Google AI Studio and Vertex AI’s Gemini API.

AI models can interface with software via structured APIs, but many digital tasks require direct interaction with the graphical user interface, such as filling and submitting forms. To complete these tasks, agents must navigate web pages and applications, as humans do. By clicking, typing, scrolling. Fill in the form natively, manipulate interactive elements such as dropdowns and filters, and the functionality behind login is an important next step in building a powerful, generic agent.

How it works

The core functionality of the model is exposed through the new `Computer_use` tool in the Gemini API and must be manipulated within a loop. Inputs to the tool are user requests, screenshots of the environment, and history of recent actions. The input can also specify whether to exclude functions from the complete list of supported UI actions or to specify additional custom functions to include.

author avatar
versatileai
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHow AI will change the way we travel
Next Article 3D Gaussian Splatting Overview
versatileai

Related Posts

Tools

Baidu ERNIE multimodal AI outperforms GPT and Gemini on benchmarks

November 12, 2025
Tools

Towards encrypted large-scale language models with FHE

November 12, 2025
Tools

How Moonshot AI beat GPT-5 and Claude at a fraction of the cost

November 11, 2025
Add A Comment

Comments are closed.

Top Posts

Latamdate addresses the rising risk of AI with online romance and strengthens its commitment to security

March 12, 20255 Views

AI Security 2025: Why you need to build data protection is why it’s not bolted

March 12, 20255 Views

New bill introduced in the Senate would require US companies to report AI layoffs

November 6, 20254 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

Latamdate addresses the rising risk of AI with online romance and strengthens its commitment to security

March 12, 20255 Views

AI Security 2025: Why you need to build data protection is why it’s not bolted

March 12, 20255 Views

New bill introduced in the Senate would require US companies to report AI layoffs

November 6, 20254 Views
Don't Miss

Baidu ERNIE multimodal AI outperforms GPT and Gemini on benchmarks

November 12, 2025

EU plans to relax AI laws in response to technology backlash

November 12, 2025

Towards encrypted large-scale language models with FHE

November 12, 2025
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2025 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?