Just use the Chinese frequency dictionary from https://github.com/hermitdave/FrequencyWords/blob/master/content/2018/zh_cn/zh_cn_50k.txt with LoadDictionary.
For other languages you can find frequency dictionaries here: https://github.com/hermitdave/FrequencyWords/tree/master/content/2018
You should be able to load a Chinese word frequency list with symSpell.LoadDictionary() without problems.
symSpell.CreateDictionary(string corpus) will not work for Chinese, because no Chinese Word Segmentation is executed automatically (Chinese words are usually not separated by white space).
Are you using the Python port of SymSpell ?https://github.com/mammothb/symspellpy
By using CreateDictionaryEntry(“NewWord”,1) you may add any new word to the existing frequency dictionary. The defaultCountThreshold is 1. That means every word with a count≥1 in CreateDictionaryEntry is added as new word (if the word is not…
The better the quality of the text corpus, the better the quality of the dictionary, the better the quality of the spelling correction. That said, the spelling correction suggestions provided by SymSpell are sorted by edit distance and (for equal edit distance) word occurrence frequency.
>> What do I need to build my own freq. dictionary using SymSpell.CreateDictionary?
>>Do I need MS Visual Studio to be able to generate this?
The CreateDictionary(string corpus) method expects a path/filename to large text file (corpus). SymSpell will automatically generate a frequency dictionary…
SymSpell is language independent and works also with non-Latin languages like Russian (Cyrillic alphabet), for Chinese and even Georgian. So it should also work with Arabic, though I can’t test it as I’m not familiar with Arabic.
With SymSpell.LoadDictionary you can use any word frequency list.
SymSpell is an algorithm (implementations are available in many programming languages) to be used by developers to add fast approximate string search or spelling correction to their products, rather than being a consumer-ready turnkey product itself.