safety and responsibility
We have actively evaluated potential risks at every stage of the development process for these native audio features and leveraged what we learned to inform mitigation strategies. We validate these measures through rigorous internal and external safety assessments, including comprehensive red teams for responsible implementation. Additionally, all audio output from our models is embedded with our watermarking technology, SynthID, making AI-generated audio identifiable and transparent.
Native audio features for developers
Gemini 2.5 introduces native audio output to models, giving developers new capabilities to build richer, more interactive applications through the Gemini API in Google AI Studio or Vertex AI.
To start exploring, developers can try out native audio dialog using the Gemini 2.5 Flash preview in the Streams tab in Google AI Studio. Controllable Speech Generation (TTS) is available in preview in both Gemini 2.5 Pro and Flash by selecting Speech Generation in the (Media Generation) tab within Google AI Studio.

