Which encoding mechanism is best for Chinese, English, Japanese, and Korean?

CDS’s founding director Yann LeCun & Ph.D. student Xiang Zhang produce first systematic study of 473 encoding models for text classification on 14 multilingual data sets

Center for Data Science

This is the official research blog of the NYU Center for Data Science (CDS). Established in 2013, we are a leading data science training and research facility, offering a MS in Data Science and, as of 2017, one of the nation’s first universities to offer a Ph.D. in Data Science.

NYU Center for Data Science

Written by

Official account of the Center for Data Science at NYU, home of the Master’s and Ph.D. in Data Science.

Center for Data Science

This is the official research blog of the NYU Center for Data Science (CDS). Established in 2013, we are a leading data science training and research facility, offering a MS in Data Science and, as of 2017, one of the nation’s first universities to offer a Ph.D. in Data Science.