-
Notifications
You must be signed in to change notification settings - Fork 72
Open
Description
Hello! Thanks for your wonderful work. Trying to reproduce your results on the TTS task, I'm wondering if you could provide more details about the evaluation of the TTS task, especially:
- How many / Which samples are used in VCTK dataset
- Which ASR model is used to convert the generated speech into text
- How the WER is calculated; What kind of text normalization is applied before the calculation
Thanks!
Metadata
Metadata
Assignees
Labels
No labels