Homepage
Open in app
Sign in
Get started
Intel Analytics Software
Machine Learning
Graph Processing
Intel Neural Compressor
Tagged in
Quantization
Intel Analytics Software
Better Insights Faster: Big Data Driving AI
More information
Followers
374
Elsewhere
More, on Medium
Quantization
Intel(R) Neural Compressor
in
Intel Analytics Software
Apr 2
The AutoRound Quantization Algorithm
Weight-Only Quantization for LLMs Across Hardware Platforms
Read more…
10
Intel(R) Neural Compressor
in
Intel Analytics Software
Oct 23, 2023
Quantizing Large Language Models on Your Laptop
Layer-Wise Low-Bit Weight-Only Quantization
Read more…
10
Intel(R) Neural Compressor
in
Intel Analytics Software
Aug 13, 2023
Diagnosing Quantization Accuracy Loss with Neural Insights
Easily Identify the Operators Causing…
Read more…
Intel(R) Neural Compressor
in
Intel Analytics Software
Jul 27, 2023
Faster Stable Diffusion Inference with Intel Extension for Transformers
Faster, High-Quality Stable…
Read more…
2
Intel(R) Neural Compressor
in
Intel Analytics Software
Jul 5, 2023
Model Quantization Diagnosis with Neural Insights
A New Tool for Analyzing Neural Network Quantization
Read more…
Intel(R) Neural Compressor
in
Intel Analytics Software
Dec 6, 2022
Accelerate Stable Diffusion with Intel Neural Compressor
Faster Inference through 8-Bit Post-Training…
Read more…
124
Benjamin Consolvo
in
Intel Analytics Software
Dec 12, 2022
Quantizing a DistilBERT Humor NLP Model
Going from FP32 to INT8 for Faster Inference with Optimum…
Read more…
12
1 response
Intel(R) Neural Compressor
in
Intel Analytics Software
Sep 23, 2022
Efficient Text Classification with Intel Neural Compressor
Better Performance and Smaller Model Size…
Read more…
15
Intel(R) Neural Compressor
in
Intel Analytics Software
Sep 22, 2022
Easy Quantization in PyTorch Using Fine-Grained FX
Improve Quantization Productivity with Intel Neural…
Read more…
11