News

Now you can train models of this dataset with speechbrain here.

Aishell1Mix

Aishell1Mix is a mandarin version of speech separation dataset like WSJMix and LibriMix. It mixes 2 or 3 speaker sources from the open source mandarin speech corpus Aishell1 with the noise dataset WHAM. The scripts are modified from LibriMix. Please refer to it for more details.

How to generate

Firstly make sure that SoX is installed on your machine.

For windows :

conda install -c groakat sox

For Linux or MacOS:

conda install -c conda-forge sox

Then to generate LibriMix, clone the repo and run the main script: generate_aishell1mix.sh

git clone https://github.com/huangzj421/Aishell1Mix.git
cd Aishell1Mix
pip install -r requirements.txt
./generate_aishell1mix.sh storage_dir

Features

In Aishell1Mix you can choose :

The number of sources in the mixtures.
The sample rate of the dataset from 16 KHz to any frequency below.
The mode of mixtures : min (the mixture ends when the shortest source ends) or max (the mixtures ends with the longest source)
The type of mixture : mix_clean (utterances only) mix_both (utterances + noise) mix_single (1 utterance + noise)

You can customize the generation by editing generate_aishell1mix.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
metadata		metadata
scripts		scripts
LICENSE		LICENSE
README.md		README.md
generate_aishell1mix.sh		generate_aishell1mix.sh
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News

Aishell1Mix

How to generate

Features

About

Releases

Packages

Languages

License

huangzj421/Aishell1Mix

Folders and files

Latest commit

History

Repository files navigation

News

Aishell1Mix

How to generate

Features

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages