Sign in Get started

Tagged in

Quantization

Intel Analytics Software

Better Insights Faster: Big Data Driving AI

More information

Followers

374

Elsewhere

More, on Medium

Quantization

Intel(R) Neural Compressor in Intel Analytics Software

The AutoRound Quantization Algorithm

Weight-Only Quantization for LLMs Across Hardware Platforms

Intel(R) Neural Compressor in Intel Analytics Software

Quantizing Large Language Models on Your Laptop

Layer-Wise Low-Bit Weight-Only Quantization

Intel(R) Neural Compressor in Intel Analytics Software

Diagnosing Quantization Accuracy Loss with Neural Insights

Easily Identify the Operators Causing…

Intel(R) Neural Compressor in Intel Analytics Software

Faster Stable Diffusion Inference with Intel Extension for Transformers

Faster, High-Quality Stable…

Intel(R) Neural Compressor in Intel Analytics Software

Model Quantization Diagnosis with Neural Insights

A New Tool for Analyzing Neural Network Quantization

Intel(R) Neural Compressor in Intel Analytics Software

Accelerate Stable Diffusion with Intel Neural Compressor

Faster Inference through 8-Bit Post-Training…

Benjamin Consolvo in Intel Analytics Software

Quantizing a DistilBERT Humor NLP Model

Going from FP32 to INT8 for Faster Inference with Optimum…

Intel(R) Neural Compressor in Intel Analytics Software

Efficient Text Classification with Intel Neural Compressor

Better Performance and Smaller Model Size…

Intel(R) Neural Compressor in Intel Analytics Software

Easy Quantization in PyTorch Using Fine-Grained FX

Improve Quantization Productivity with Intel Neural…