Quantization in Pytorch

About 103,000 results

Open links in new tab

Any time

wikipedia.org
https://en.wikipedia.org › wiki › Quantization_(signal_processing)
Quantization (signal processing) - Wikipedia
In mathematics and digital signal processing, quantization is the process of mapping input values from a large set (often a continuous set) to output values in a (countable) smaller set, often …
geeksforgeeks.org
https://www.geeksforgeeks.org › deep-learning › ...
What is Quantization - GeeksforGeeks
Nov 6, 2025 · Quantization is a model optimization technique that reduces the precision of numerical values such as weights and activations in models to make them faster and more …
nvidia.com
https://developer.nvidia.com › blog › model...
Model Quantization: Concepts, Methods, and Why It Matters
Nov 24, 2025 · Quantization has emerged as a crucial technique to address this challenge, enabling resource-intensive models to run on constrained hardware. The NVIDIA TensorRT …
medium.com
https://medium.com › @isanghao › what-is-quantization...
What is Quantization and Why It Matters for AI Inference?
Jul 20, 2025 · Among many optimization techniques to improve AI inference performance, quantization has become an essential method when deploying modern AI models into real …
ibm.com
https://www.ibm.com › think › topics › quantization
What is quantization? - IBM
Quantization is the process of reducing the precision of a digital signal, typically from a higher-precision format to a lower-precision format. This technique is widely used in various fields, …
mathworks.com
https://www.mathworks.com › discovery › quantization.html
What Is Quantization? | How It Works & Applications
Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the context of simulation and embedded computing, it is about approximating real …
maartengrootendorst.com
https://www.maartengrootendorst.com › blog › quantization
A Visual Guide to Quantization - Maarten Grootendorst
One of the most popular quantization techniques is post-training quantization (PTQ). It involves quantizing a model’s parameters (both weights and activations) after training the model.
cloudflare.com
https://www.cloudflare.com › learning › ai › what-is-quantization
What is quantization in machine learning? - Cloudflare
What is quantization in machine learning? Quantization is a technique for lightening the load of executing machine learning and artificial intelligence (AI) models. It aims to reduce the …
arxiv.org
https://arxiv.org › html
A Comprehensive Study on Quantization Techniques for Large …
Oct 30, 2024 · With the advancement of computer science and machine learning, quantization research fields expands significantly. One of the most common quantization techniques is 8-bit …
huggingface.co
https://huggingface.co › ... › concept_guides › quantization
Quantization - Hugging Face
Try post-training static quantization which can be faster than dynamic quantization but often with a drop in terms of accuracy. Apply observers to your models in places where you want to quantize.

Some results have been removed
Pagination
- 1
- 2
- 3
- Next

Quantization (signal processing) - Wikipedia

What is Quantization - GeeksforGeeks

Model Quantization: Concepts, Methods, and Why It Matters

What is Quantization and Why It Matters for AI Inference?

What is quantization? - IBM

What Is Quantization? | How It Works & Applications

A Visual Guide to Quantization - Maarten Grootendorst

What is quantization in machine learning? - Cloudflare

A Comprehensive Study on Quantization Techniques for Large …

Quantization - Hugging Face