The first full release of Hanami's diffsinger voicebank. Hope you enjoy!
I changed the name of the voicebank, since it's how part of Project AI❤dol; you'll hear more details about that at a later date.
Download on MediaFire
(For organizational purposes)
Voice description
- General voice type: Natural "pop soprano", suitable for most popular genres of music.
- Vocal modes:
- Root (Core/Normal);
- Fragrance (Power);
- Nectar (Soft);
- Other voicebank features:
- Duration;
- Velocity;
- Gender;
- Auto-pitch;
- Custom vocoder (AI❤dolGAN).
Officially supported languages
These are languages for which actual data has been recorded.
- English (approx. 2 hours of data);
- Japanese (approx. 40 min. of data);
- German (approx. 17 min. of data);
- Korean (approx. 11 min. of data);
- Spanish (approx. 7 min. of data);
- Mandarin Chinese (approx. 6 min. of data);
- Latin (Ecclesiastical) (approx. 5 min. of data; no dictionary and/or phonemizer included yet);
- French (approx. 3 min. of data; phonemizer included).
Total data count: 3 hours, 53 minutes.
No external data has been used for training. Due to this, some of the data might have a bit of an accent. Apologies in advance!
Unofficially supported languages (dictionaries included)
- Cantonese;
- Vietnamese (phonemizer included).
Note that there are likely many more unofficially supported languages; they simply haven't been tested yet. End users are encouraged to experiment.
Relevant credits
Millefeuille phonemizer by imsupposedto @ UTAUFrance.