Skip to content

Commit a93daad

Browse files
author
ntt123
committed
v0.1
1 parent f28188f commit a93daad

18 files changed

+5223
-8
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
1+
test_data/
12
assets/reinfo/
23
# IDE files
34
.idea

README.md

Lines changed: 25 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,9 @@
11
A Vietnamese TTS
22
================
33

4-
Tacotron + WaveRNN for vietnamese dataset.
4+
Tacotron + WaveRNN for vietnamese datasets.
5+
6+
A synthesized audio clip is at [assets/reinfo/clip.wav](assets/reinfo/clip.wav).
57

68
Install
79
-------
@@ -13,20 +15,38 @@ cd vietTTS
1315
pip3 install -e .
1416
```
1517

18+
19+
Quick start using pretrained models
20+
----------------------------------
21+
```sh
22+
bash ./scripts/quick_start.sh
23+
```
24+
25+
1626
Download reinfo dataset
1727
-----------------------
1828

1929
```sh
2030
bash ./scripts/download_reinfo_dataset.sh
2131
```
2232

23-
Train Tacotron
24-
--------------
33+
34+
Train duration model
35+
--------------------
2536

2637
```sh
27-
python3 -m vietTTS.tacotron.trainer
38+
python3 -m vietTTS.nat.duration_trainer
2839
```
2940

41+
42+
Train acoustic model
43+
--------------------
44+
```sh
45+
python3 -m vietTTS.nat.acoustic_trainer
46+
```
47+
48+
49+
3050
Train waveRNN
3151
-------------
3252

@@ -39,5 +59,5 @@ Synthesize speech
3959
-----------------
4060

4161
```sh
42-
python3 -m vietTTS.synthesizer --text="####### hôm qua em tới trường #######" --output=clip.wav
62+
python3 -m vietTTS.synthesizer --use-nat --text="hôm qua em tới trường" --output=clip.wav
4363
```

assets/reinfo/clip.wav

1.65 MB
Binary file not shown.

0 commit comments

Comments
 (0)