Model card
Vicuna-13B + Whisper Q-Former.
Academicopen-source15B paramsWhisper encoder + Q-Former + LLM
Connecting Speech Encoder and LLM for ASR. arXiv:2309.13963.
§ 01 · Benchmarks
No recorded benchmark results yet.
This model is in the registry but doesn’t have any benchmark_results rows yet. If you have a score, submit it →
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 04 · Related models