NTT123
diff --git a/‎README.md‎
Lines changed: 4 additions & 3 deletions b/‎README.md‎
Lines changed: 4 additions & 3 deletions
diff --git a/‎assets/infore/clip.wav‎
-57.5 KB b/‎assets/infore/clip.wav‎
-57.5 KB
@@ -1,7 +1,7 @@
 A Vietnamese TTS
 ================
 
-Tacotron + HiFiGAN vocoder for vietnamese datasets.
+Duration model + Acoustic model + HiFiGAN vocoder for vietnamese text-to-speech application.
 
 Online demo at https://huggingface.co/spaces/ntt123/vietTTS.
 
@@ -32,12 +32,13 @@ Download InfoRe dataset
 -----------------------
 
 ```sh
-bash ./scripts/download_aligned_infore_dataset.sh
+python ./scripts/download_aligned_infore_dataset.py
 ```
 
 **Note**: this is a denoised and aligned version of the original dataset which is donated by the InfoRe Technology company (see [here](https://www.facebook.com/groups/j2team.community/permalink/1010834009248719/)). You can download the original dataset (**InfoRe Technology 1**) at [here](https://github.com/TensorSpeech/TensorFlowASR/blob/main/README.md#vietnamese).
 
-The Montreal Forced Aligner (MFA) is used to align transcript and speech (textgrid files). [Here](https://colab.research.google.com/gist/NTT123/c99b5a391af56e0cb8f7b190d3d7f0ee/infore-mfa-example.ipynb) is a Colab notebook to align InfoRe dataset. Visit [MFA](https://montreal-forced-aligner.readthedocs.io/en/latest/) for more information on how to create textgrid files.
+See `notebooks/denoise_infore_dataset.ipynb` for instructions on how to denoise the dataset. We use the Montreal Forced Aligner (MFA) to align transcript and speech (textgrid files). 
+See `notebooks/align_text_audio_infore_mfa.ipynb` for instructions on how to create textgrid files.
 
 Train duration model
 --------------------