Bowen Li – Medium

Bowen Li

Bowen Li

Paper review: BERT

BERT: Bidirectional Encoder Representations from Transformers.

Aug 5

Paper review: BERT

Aug 5

Bowen Li

MoE to PEER

More compute-efficient model architecture

Aug 5

MoE to PEER

Aug 5

Bowen Li

Papers review: GPT1~3

Learning note of GPT1,2,3 papers

Aug 3

Papers review: GPT1~3

Aug 3

Bowen Li

Make choices on model parameters based on Llama3.1 and Mistral Large 2 news

Model size isn’t all things.

Jul 31

Make choices on model parameters based on Llama3.1 and Mistral Large 2 news

Jul 31

Bowen Li

Quantization tech of LLMs-GGUF

We can use GGUF to offload any layer of the LLM to the CPU. This allows us to use both the CPU and GPU when we don’t have enough VRAM.

Jul 29

Quantization tech of LLMs-GGUF

Jul 29

Bowen Li

Quantization tech of LLMs

Learning note and practices of quantization tech of LLMs

Jul 28

Quantization tech of LLMs

Jul 28

Bowen Li

The three-pass approach to read a paper

How to find a paper that most related to your work in an efficient way?

Jul 13

Jul 13

Bowen Li

Classification Algorithms

Learning note for classification algorithms

Jun 25

Jun 25

Bowen Li

Models Training Techniques

Pre-training, Continued-Pre-training and Fine-tuning

Jun 25

Jun 25

Bowen Li

Linear Regression

Learning note of linear regression

Jun 19

Linear Regression

Jun 19

Bowen Li

Bowen Li

Casual researcher at RMIT and open-source contributor.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams