A cross-lingual evaluation test set used in the CosyVoice 3 paper to assess cross-lingual speech generation (Chinese → English). The test set appears as part of the CosyVoice 3 evaluation (multiple language-pair rows reported in the paper) and is used to measure quality of zh→en generation in the paper's experiments. No standalone public dataset page or Hugging Face dataset named exactly "CosyVoice3 Cross-Lingual Test Set (zh→en)" was found in web or Hugging Face searches; CosyVoice-related datasets exist on Hugging Face, but this specific test set appears to be an evaluation/test split reported within the CosyVoice 3 paper rather than a separate publicly released dataset.
No results indexed yet — be the first to submit a score.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.