Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

JBS Dev: About incomplete data and the last mile of AI – from model capabilities to cost sustainability

May 13, 2026

AI automates HR compliance except where tech companies need it

May 12, 2026

Pre-training a mix of experts to achieve new modularity

May 11, 2026
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Wednesday, May 13
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources
Versa AI hub
Home»Tools»Gemini 2.5 Native Audio upgrade and text-to-speech model update
Tools

Gemini 2.5 Native Audio upgrade and text-to-speech model update

versatileaiBy versatileaiDecember 13, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
#image_title
Share
Facebook Twitter LinkedIn Pinterest Email

Customer testimonials

Google Cloud customers are already using Gemini’s native audio capabilities to drive real business outcomes, from processing mortgages to calling customers.

“Users often forget they’re talking to an AI within a minute of using Sidekick, and in some cases, they even thank the bot after a long chat…The new Live API AI capabilities delivered through Gemini (2.5 Flash Native Audio) allow sellers to win.” – David Wurtz, VP of Products, Shopify Since its launch in May, Mia’s capabilities have been significantly enhanced. This powerful combination has enabled us to generate over 14,000 loans for our broker partners.” – Jason Bressler, Chief Technology Officer, United Wholesale Mortgage (UWM) “By working with the Gemini 2.5 Flash native audio model through Vertex AI, Receptionists can achieve unparalleled conversational intelligence: identify the main speaker in noisy environments, switch languages mid-conversation, and sound incredibly natural and expressive.” – David Yang, Co-Founder, Newo.ai.

live voice translation

Gemini now natively supports a new live voice-to-speech translation feature designed to handle both continuous listening and two-way conversation.

With continuous listening, Gemini automatically translates audio spoken in multiple languages ​​into a single target language. This allows you to put on your headphones and hear the world around you in your own language.

For two-way conversations, Gemini’s Live Voice Translator processes translations between two languages ​​in real-time and automatically switches the output language based on who is speaking. For example, if you speak English and want to chat with someone who speaks Hindi, you’ll hear the English translation in real time through your headphones, and when you’re done speaking, your phone will broadcast the Hindi.

Gemini’s live voice translation has many important features that are useful in the real world.

Language coverage: Gemini models’ world knowledge and multilingual capabilities, combined with native audio capabilities, translate audio in over 70 languages ​​and over 2,000 language pairs. Style Transfer: Captures the nuances of human speech and preserves the speaker’s intonation, pace, and pitch, making translations sound natural. Multilingual input: Understand multiple languages ​​simultaneously in one session and help you follow multilingual conversations without having to fiddle with language settings. Auto-detection: Starts by identifying the language being spoken. Noise Resistant: Eliminates ambient noise so you can have a comfortable conversation even in noisy outdoor environments.

author avatar
versatileai
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTurn ideas into artwork: Use CapCut PC as an AI image generator – APN News
Next Article Marine veteran launches podcast for marketing and business leaders, ‘The AI ​​Briefing Room’
versatileai

Related Posts

Tools

JBS Dev: About incomplete data and the last mile of AI – from model capabilities to cost sustainability

May 13, 2026
Tools

AI automates HR compliance except where tech companies need it

May 12, 2026
Tools

Pre-training a mix of experts to achieve new modularity

May 11, 2026
Add A Comment

Comments are closed.

Top Posts

OpenAI blocks Sora from creating MLK video after Estate object

November 23, 200521 Views

SNS Network Project Increases GPUAAS Business and Server Sales, Expanding AI Adoption

May 6, 202518 Views

How Prezi leverages hubs and expert support programs to accelerate your ML roadmap

April 22, 202516 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

OpenAI blocks Sora from creating MLK video after Estate object

November 23, 200521 Views

SNS Network Project Increases GPUAAS Business and Server Sales, Expanding AI Adoption

May 6, 202518 Views

How Prezi leverages hubs and expert support programs to accelerate your ML roadmap

April 22, 202516 Views
Don't Miss

JBS Dev: About incomplete data and the last mile of AI – from model capabilities to cost sustainability

May 13, 2026

AI automates HR compliance except where tech companies need it

May 12, 2026

Pre-training a mix of experts to achieve new modularity

May 11, 2026
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2026 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?