Vladimir MalinovskiiinYandexThe Evolution of Extreme LLM Compression: From QuIP to AQLM with PV-TuningWe live in the era of Large Language Models (LLMs), with companies increasingly deploying models with billions of parameters. These…Aug 5Aug 5