Published inPyTorchColossal-LLaMA-2: Low Cost and High-quality Domain-specific LLM Solution Using LLaMA and…The most prominent distinction between LLaMA-1 and LLaMA-2 lies in the incorporation of higher-quality corpora, a pivotal factor…Jan 29Jan 29
Published inPyTorchColossalChat: An Open-Source Solution for Cloning ChatGPT With a Complete RLHF PipelineLarge AI models and applications like ChatGPT and GPT-4 have become extremely popular worldwide, serving as a foundation for the…Mar 29, 20236Mar 29, 20236
Published inPyTorchLatest Colossal-AI boasts novel automatic parallelism and offers savings up to 46x for Stable…This post is authored by Prof. Yang You, founder of HPC-AI Tech, the company developing Colossal-AI. Yang received his Ph.D. in Computer…Jan 31, 20231Jan 31, 20231
Diffusion Pretraining and Hardware Fine-Tuning Can Be Almost 7X Cheaper!Author: Yang You, Presidential Young Professor at the National University of SingaporeNov 8, 20222Nov 8, 20222
Use a Laptop to Analyze 90% of Proteins, With a Single-GPU Inference Sequence Exceeding 10,000!Proteins are the basis of almost all functions of life. Evaluating the shape a protein folds — the “protein folding problem” — has been a…Oct 27, 2022Oct 27, 2022
Embedding Training With 1% GPU Memory and 100 Times Less Budget, an Open Source Solution for…Deep recommendation models (DLRMs) have become critical for deep learning applications in IT companies. DLRMs can be used to improve user…Oct 18, 20221Oct 18, 20221
Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging FaceForbes News, the world’s leading voice, recently declared large AI models as one of six AI trends to watch for in 2022. As large-scale AI…Jul 12, 2022Jul 12, 2022