Homepage
Open in app
Sign in
Get started
SqueezeBits Team Blog
SqueezeBits Team Blog
Team
Product
Tech Insight
Career
Follow
Latest
[EN] FP8 Quantization with OwLite
[EN] FP8 Quantization with OwLite
Introducing FP8 Quantization applied to OwLite.
Changjun Lee
Aug 4
[KR] OwLite와 함께하는 FP8 Quantization
[KR] OwLite와 함께하는 FP8 Quantization
OwLite에 적용된 FP8 Quantization을 소개해드립니다.
Changjun Lee
Jul 28
취업 준비 필수 가이드: 채용 행사 활용하기
취업 준비 필수 가이드: 채용 행사 활용하기
다양한 채용 행사의 종류와 유용한 질문 리스트, 스퀴즈비츠 채용 FAQ까지 한 번에 정리해 드립니다.
NAEUN KIM
Jul 21
How much can we save through compression?
How much can we save through compression?
Estimating the cost savings from model compression.
Semin Cheon
Jun 25
현장에서 체험해보는 AI 경량화, IT 전시회 참여기
현장에서 체험해보는 AI 경량화, IT 전시회 참여기
IT 전시회에 참여한 스퀴즈비츠 이야기를 소개합니다.
NAEUN KIM
May 26
‘Breaking Down’ tokenizers in LLMs
‘Breaking Down’ tokenizers in LLMs
An introduction to tokenizers and their implications in language models.
Semin Cheon
May 9
SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
Accelerating LLM inference by pruning redundant transformer blocks
Semin Cheon
May 7
About SqueezeBits Team Blog
Latest Stories
Archive
About Medium
Terms
Privacy
Teams