tts-hardtext-v2:gradium-audrey:2026-05-17
This packet is meant to answer one narrow question: did the model keep hard text intact enough for downstream use? It is not trying to prove that the voice is pleasant or expressive.
The quarterly revenue increased by 17.8 percent to 4.2 million dollars.
The quarterly revenue increased by 17.8% to $4.2 million.
meaning preserved; percent and dollar amount normalized by ASR
Can the voice preserve numbers, dates, names, URLs, acronyms, and business-critical wording after ASR transcription?
73.3% critical entity accuracy across 30 prompts; 13.4% WER and 6.7% CER.
meaning preserved; percent and dollar amount normalized by ASR
p95 time-to-first-byte was 299 ms on api-eu.
This is not a naturalness, acting, emotion, cloning, audiobook, or preference score.
Prompt set, transcript outputs, latency log, model config, and run config are linked below.
sha256:gradium-audrey-v1-manifest
sha256:hardtext-v2-en-30-prompts