It gives you the transcript. You can find the solution here: Will it be possible to receive text and audio data in the multimodal API?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Will it be possible to receive text and audio data in the multimodal API? | 13 | 839 | July 22, 2025 | |
Why in Gemini Live API with Audio Modality its Transcription is not available in response | 5 | 169 | August 15, 2025 | |
outputAudioTranscription NOT WORKING WHEN [Modality.AUDIO] | 2 | 131 | June 19, 2025 | |
How to get text output from gemini-2.5-flash-preview-native-audio-dialog | 3 | 342 | July 10, 2025 | |
Retrieving transcribed audio input prompt with reply | 1 | 156 | July 24, 2025 |