Sharing Our Common Voice — Mozilla Releases Second Largest Public Voice Data Set
Michael Henretty
6833

Hi Michael, this public dataset is absolutely amazing! It is a boon to researchers who cannot afford those expensive datasets.

I was wondering whether there will be plans to collect Mandarin Chinese or Cantonese speech data. Actually, I am not aware of many publicly available datasets in these languages. If there is such a plan, as a phonetician, I can help prepare phonetically balanced sentences and evaluate some of the recordings.

Thanks !

Luke

One clap, two clap, three clap, forty?

By clapping more or less, you can signal to us which stories really stand out.