Gemini Audio Understanding
2026-04-08, successful Crazyrouter and local :4000 retests show:
gemini-2.5-procan readaudio/wav- the currently verified primary path is
inlineData - short audio classification, transcription, language hints, and summaries can be requested directly in text
Verified Minimal Request
Request Notes
- Prefer
inlineDatafor audio understanding - Keep
mimeTypealigned with the actual format, such asaudio/wavoraudio/mpeg - Put raw Base64 into
datawithout a Data URL prefix - If you need strict JSON instead of JSON-looking text, combine this route with Structured Outputs