quantization-aware-training
Here are 58 public repositories matching this topic...
Neural Network Compression Framework for enhanced OpenVINO™ inference
-
Updated
Jun 7, 2024 - Python
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
-
Updated
Jun 9, 2024 - Python
Tutorial notebooks for hls4ml
-
Updated
Jun 6, 2024 - Jupyter Notebook
ECQx: Explainability-Driven Quantization for Low-Bit and Sparse DNNs
-
Updated
Jun 6, 2024 - Python
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
-
Updated
Jun 6, 2024 - Python
Quantization of Models : Post-Training Quantization(PTQ) and Quantize Aware Training(QAT)
-
Updated
May 21, 2024 - Jupyter Notebook
A Tutorial Notebook to Quantization in Machine Learning
-
Updated
May 19, 2024 - Jupyter Notebook
Implementation of MedQ: Lossless ultra-low-bit neural network quantization for medical image segmentation
-
Updated
May 17, 2024
Quantization notebooks (adapted from and for Mobile Apps w/ Machine Learning, By Dara Varam and Lujain Khalil)
-
Updated
May 9, 2024 - Jupyter Notebook
EfficientNetV2 (Efficientnetv2-b2) and quantization int8 and fp32 (QAT and PTQ) on CK+ dataset . fine-tuning, augmentation, solving imbalanced dataset, etc.
-
Updated
May 4, 2024 - Jupyter Notebook
A lightweight Convolutional Autoencoder for recognizing Bangla font styles along with quantization for deploying resource-constrained IoT devices.
-
Updated
Apr 30, 2024 - Jupyter Notebook
Training neural nets with quantized weights on arbitrarily specified bit-depth
-
Updated
Mar 29, 2024 - Python
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
-
Updated
Mar 17, 2024 - Python
Quantization simulation of neural networks with PyTorch
-
Updated
Feb 8, 2024 - Python
Quantization Aware Training
-
Updated
Jan 13, 2024 - Python
Quantization Aware Training
-
Updated
Jan 13, 2024 - Python
A model compression and acceleration toolbox based on pytorch.
-
Updated
Jan 12, 2024 - Python
Classify alcohols and its snacks
-
Updated
Dec 31, 2023 - Python
Notes on quantization in neural networks
-
Updated
Dec 14, 2023 - Jupyter Notebook
Improve this page
Add a description, image, and links to the quantization-aware-training topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the quantization-aware-training topic, visit your repo's landing page and select "manage topics."