hifisampler

A new UTAU resampler based on pc-nsf-hifigan for virtual singer.

For Jinriki please use our Hachimisampler.

Why is it called hifisampler?

Hifisampler was modified from straycatresampler, replacing the original WORLD with pc-nsf-hifigan.

What makes pc-nsf-hifigan different from traditional vocoders?

pc-nsf-hifigan employs neural networks to upsample the input features, offering clearer audio quality than traditional vocoders. It is an improvement over the traditional nsf-hifigan, supporting f0 inputs that do not match mel, making it suitable for UTAU resampling.

How to use?

Three installation methods are provided; choose the one that best suits your needs and preferences.

Using Integrated Environment Package (Recommended for NVIDIA GPU)

Download the latest release package and extract it. Run start.bat to start the rendering service.
If you're using the experimental server auto-start feature (Optional, but not recommended), keep config.default.yaml, hifiserver.py, hifisampler.exe, and launch_server.py in the same directory. It's best to keep the original file structure after extracting the release. For OpenUTAU, you can create a symbolic link to place hifisampler.exe in the Resamplers folder.
```
mklink "C:\[OpenUTAU Path]\Resamplers\hifisampler.exe" "C:\[Project Path]\hifisampler.exe"
```
Set the UTAU resampler to hifisampler.exe and ensure the rendering service is running.

Manual Installation using uv

Install uv following the instructions in the uv documentation.
Download and extract the source code from the latest release. Then, navigate into the extracted folder.
Download model files from release assets. Unzip and place it in the project folder.
Fill in the configuration details in config.yaml. If this is your first time using the software, modify config.default.yaml instead. The config.yaml file will be automatically generated upon the first run.
Depending on your hardware, you can select a suitable CUDA version for acceleration. To do this, modify the tool.uv.sources section in pyproject.toml. For example, to enable CUDA acceleration:
```
[tool.uv.sources]
torch = [
   { index = "pytorch-cu128" },
]
```
If you're using the CPU version, set it as follows:
```
[tool.uv.sources]
 torch = [
     { index = "pytorch-cpu" },
 ]
```
If you're using the experimental server auto-start feature (Optional, but not recommended), keep config.default.yaml, hifiserver.py, hifisampler.exe, and launch_server.py in the same directory. It's best to keep the original file structure after extracting the release. For OpenUTAU, you can create a symbolic link to place hifisampler.exe in the Resamplers folder.
```
mklink "C:\[OpenUTAU Path]\Resamplers\hifisampler.exe" "C:\[Project Path]\hifisampler.exe"
```
Before each use, run hifiserver.py to start the rendering service. If you're using the experimental server auto-start feature, you can skip this step. Enter the following command in your terminal:
```
uv run hifiserver.py
```
Set the resampler in UTAU to hifisampler.exe and ensure the rendering service is running.

Manual Installation using conda/pip

Install Python 3.10 and run the following commands (it's strongly recommended to use conda for easier environment management):
```
pip install -r requirements.txt
```
Download the CUDA version of PyTorch from the Torch website (If you're certain about only using the ONNX version, then downloading the CPU version of PyTorch is fine).
Download model files from release assets. Unzip and place it in the project folder.
If you're using the experimental server auto-start feature (Optional, but not recommended), keep config.default.yaml, hifiserver.py, hifisampler.exe, and launch_server.py in the same directory. It's best to keep the original file structure after extracting the release. For OpenUTAU, you can create a symbolic link to place hifisampler.exe in the Resamplers folder.
```
mklink "C:\[OpenUTAU Path]\Resamplers\hifisampler.exe" "C:\[Project Path]\hifisampler.exe"
```
Download the release, unzip it, and run 'hifiserver.py'.
Set UTAU's resampler to hifisampler.exe.

Implemented flags

g: Adjust gender/formants.
- Range: -600 to 600 | Default: 0
Hb: Adjust breath/noise.
- Range: 0 to 500 | Default: 100
Hv: Adjust voice/harmonic.
- Range: 0 to 150 | Default: 100
HG: Vocal fry/growl.
- Range: 0 to 100 | Default: 0
P: Normalize loudness at the note level, targeting -16 LUFS. Enable this by setting wave_norm to true in your config.yaml file.
- Range: 0 to 100 | Default: 100
t: Shift the pitch by a specific amount, in cents. 1 cent = 1/100 of a semitone.
- Range: -1200 to 1200 | Default: 0
Ht: Adjust tension.
- Range: -100 to 100 | Default: 0
A: Modulating the amplitude based on pitch variations, which helps creating a more realistic vibrato.
- Range: -100 to 100 | Default: 0
G: Force to regenerate feature cache (Ignoring existed cache).
- No value needed
He: Enable Mel spectrum loop mode.
- No value needed

Note: The flags B and V were renamed to Hb and Hv respectively because they conflict with other UTAU flags but have different definitions.

Other Notes

If using server-side auto-start (Experimental), closing the terminal window or rendering process during server startup may cause the server to freeze. You can try manually releasing the file lock on hifisampler.exe. We recommend manually starting the rendering service using ./start.bat to avoid issues.

Name		Name	Last commit message	Last commit date
Latest commit History 139 Commits
backend		backend
client		client
hnsep		hnsep
util		util
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_zh_cn.md		README_zh_cn.md
config.default.yaml		config.default.yaml
config.py		config.py
hifisampler.exe		hifisampler.exe
hifiserver.py		hifiserver.py
launch_server.py		launch_server.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
start.bat		start.bat
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

hifisampler

Why is it called hifisampler?

What makes pc-nsf-hifigan different from traditional vocoders?

How to use?

Using Integrated Environment Package (Recommended for NVIDIA GPU)

Manual Installation using uv

Manual Installation using conda/pip

Implemented flags

Other Notes

Acknowledgments

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors 7

Languages

License

openhachimi/hifisampler

Folders and files

Latest commit

History

Repository files navigation

hifisampler

Why is it called hifisampler?

What makes pc-nsf-hifigan different from traditional vocoders?

How to use?

Using Integrated Environment Package (Recommended for NVIDIA GPU)

Manual Installation using uv

Manual Installation using conda/pip

Implemented flags

Other Notes

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors 7

Languages

Packages