Translated by Hu Yanjun, Shen Jiali, Dong Wenwen, Jia Chuan Starting with BERT in 2018, large models sprung up one after another, including GPT-3 and ViT, whose parameters are counted in billions. Explosive growths in model size happen so frequently that they can hardly impress AI developers. …