Codesota · Audio · Automatic Speech Recognition · CosyVoice3 Cross-Lingual Test Set zh to enTasks/Audio/Automatic Speech Recognition
Automatic Speech Recognition · benchmark dataset · EN

CosyVoice3 Cross-Lingual Test Set (zh→en).

A cross-lingual evaluation test set used in the CosyVoice 3 paper to assess cross-lingual speech generation (Chinese → English). The test set appears as part of the CosyVoice 3 evaluation (multiple language-pair rows reported in the paper) and is used to measure quality of zh→en generation in the paper's experiments. No standalone public dataset page or Hugging Face dataset named exactly "CosyVoice3 Cross-Lingual Test Set (zh→en)" was found in web or Hugging Face searches; CosyVoice-related datasets exist on Hugging Face, but this specific test set appears to be an evaluation/test split reported within the CosyVoice 3 paper rather than a separate publicly released dataset.

Paper Submit a result
§ 01 · Leaderboard

Best published scores.

No results indexed yet — be the first to submit a score.

No benchmark results indexed yet
§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies