AQLM: Extreme Compression of Large Language Models via Additive Quantization #5984

joseph777111 started this conversation in Ideas

joseph777111
Mar 10, 2024

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization

https://arxiv.org/pdf/2401.06118.pdf

https://github.com/Vahe1994/AQLM

Thoughts?

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment