So smaller Gemma can read audio file, which is cool...
But when I tried it with LMStudio, it's not actually feeding Gemma my audio, it's using Whisper to transcribe THEN feed the text output.
Which, I can definitely see why that's a feature, but I just want my model to read the audio.
Is this planned feature or do I have to figure out ollama?