Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

California Building aims to protect children from AI chatbots

July 12, 2025

AI is rewriting the rules of the insurance industry

July 12, 2025

Data and AI Status: Security and Privacy

July 12, 2025
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Sunday, July 13
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
Versa AI hub
Home»Tools»Gemini 2.5 update from Google Deepmind
Tools

Gemini 2.5 update from Google Deepmind

versatileaiBy versatileaiMay 27, 2025No Comments1 Min Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
#image_title
Share
Facebook Twitter LinkedIn Pinterest Email

New Gemini 2.5 Features

Native audio output and live API improvements

Today, Live APIs introduce preview versions of audiovisual input and native audio out dialogs, allowing you to directly build conversational experiences with more natural and expressive Gemini.

It also allows users to manipulate tones, accents and speech styles. For example, you can instruct your model to use dramatic voices when telling stories. It also supports the use of the tool and allows you to search for it on your behalf.

You can try out a set of early features including:

An emotional dialogue in which the model detects and responds appropriately to the user’s voice emotions. In ProActual Audio, models can ignore background conversations and know when to respond. The idea in the live API utilizes Gemini’s thinking capabilities to help the model support more complex tasks.

We are also releasing new previews of text-to-speech in 2.5 Pro and 2.5 Flash. These have initial support for multiple speakers, allowing speech from text using two voices via native audio out.

Like native audio dialogs, text-to-speech is expressive and can capture very subtle nuances such as whispers. Works in over 24 languages ​​and seamlessly switch between them.

author avatar
versatileai
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThe US policy movement reflects major technology issues with state AI laws
Next Article Gusts of Social Media AI Content reveal cultural change – Scott Scoop News
versatileai

Related Posts

Tools

AI is rewriting the rules of the insurance industry

July 12, 2025
Tools

Deploy the Full Stack Desktop Agent

July 11, 2025
Tools

Google’s open Medgemma AI model could transform healthcare

July 11, 2025
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Leading the Korean LLM evaluation ecosystem

July 8, 20251 Views

Introducing the Red Team Resistance Leaderboard

July 6, 20251 Views

Will AI apps help carry the mental load of moms?

May 8, 20251 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

Leading the Korean LLM evaluation ecosystem

July 8, 20251 Views

Introducing the Red Team Resistance Leaderboard

July 6, 20251 Views

Will AI apps help carry the mental load of moms?

May 8, 20251 Views
Don't Miss

California Building aims to protect children from AI chatbots

July 12, 2025

AI is rewriting the rules of the insurance industry

July 12, 2025

Data and AI Status: Security and Privacy

July 12, 2025
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2025 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?