Safety and responsibility
We use what we have learned to actively assess potential risks and inform mitigation strategies at every stage of the development process for these native audio features. We examine these measures through rigorous internal and external safety assessments, including comprehensive red teaming for responsible deployment. Additionally, all audio outputs from the model are embedded with SynthID, a transparent technology, to ensure transparency by identifiable audio generated by AI.
Native audio features for developers
It offers native audio output for Gemini 2.5 models, providing developers with new capabilities to develop richer, interactive applications via Google AI Studio or Vertex AI’s Gemini API.
To begin exploring, developers can try out the native audio dialog in the Streams tab in Google AI Studio with Gemini 2.5 Flash Preview. Controlable Speech Generation (TTS) is available in both Gemini 2.5 Pro and Flash previews by selecting voice generation in Generate Media Tab within Google AI Studio.