Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

The most cost-effective AI model ever

March 4, 2026

Google’s industrial robot AI Play makes physical AI a priority

March 4, 2026

PRX Part 3 — Train a Text-to-Image Model in 24 Hours!

March 3, 2026
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Thursday, March 5
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources
Versa AI hub
Home»Tools»Introduction to Gemini 2.5 Computer Usage Model
Tools

Introduction to Gemini 2.5 Computer Usage Model

versatileaiBy versatileaiFebruary 8, 2026No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
#image_title
Share
Facebook Twitter LinkedIn Pinterest Email

Earlier this year, we said we would provide computer usage capabilities to developers through the Gemini API. Today we are releasing the Gemini 2.5 computer usage model. This is a new specialized model built on Gemini 2.5 Pro’s visual understanding and reasoning capabilities that power agents that can interact with the user interface (UI). Outperforms leading alternatives on multiple web and mobile control benchmarks, all with lower latency. Developers can access these capabilities through Google AI Studio and Vertex AI’s Gemini API.

Although AI models can interact with software through structured APIs, many digital tasks still require direct interaction with graphical user interfaces, such as filling out and submitting forms. To complete these tasks, agents must interact with web pages and applications like humans by clicking, typing, and scrolling. The ability to natively fill out forms, interact with interactive elements like dropdowns and filters, and operate behind a login is an important next step in building powerful general-purpose agents.

structure

The core functionality of the model is exposed through the new `computer_use` tool in the Gemini API and must be manipulated within a loop. Inputs to the tool are user requests, screenshots of the environment, and a history of recent actions. In the input, you can also specify whether to exclude the function from the full list of supported UI actions or specify additional custom functions to include.

author avatar
versatileai
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHow Google, social media, and AI are reshaping the open web — and what it means for independent websites – Azat TV
Next Article Business News | IIM Raipur launches AI-integrated advanced general management program as AI reshapes Indian business
versatileai

Related Posts

Tools

The most cost-effective AI model ever

March 4, 2026
Tools

Google’s industrial robot AI Play makes physical AI a priority

March 4, 2026
Tools

PRX Part 3 — Train a Text-to-Image Model in 24 Hours!

March 3, 2026
Add A Comment

Comments are closed.

Top Posts

Open Source DeepResearch – Unlocking Search Agents

February 7, 20259 Views

Improving the accuracy of multimodal search and visual document retrieval using the Llama Nemotron RAG model

January 7, 20267 Views

Google’s industrial robot AI Play makes physical AI a priority

March 4, 20264 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

Open Source DeepResearch – Unlocking Search Agents

February 7, 20259 Views

Improving the accuracy of multimodal search and visual document retrieval using the Llama Nemotron RAG model

January 7, 20267 Views

Google’s industrial robot AI Play makes physical AI a priority

March 4, 20264 Views
Don't Miss

The most cost-effective AI model ever

March 4, 2026

Google’s industrial robot AI Play makes physical AI a priority

March 4, 2026

PRX Part 3 — Train a Text-to-Image Model in 24 Hours!

March 3, 2026
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2026 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?