Add quantization examples using torchao and quanto

by a-r-r-o-w HF staff - opened Aug 27, 2024

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+57

-2

a-r-r-o-w

Aug 27, 2024

Hey, I'm Aryan from the Diffusers team 👋

Congratulations on the release of CogVideoX-5B!

It would be great to showcase some examples on how the quantized inference (int8, and other datatypes) can be run to lower memory requirements by using TorchAO and Quanto, especially since we mention it in the model card table. Feel free to modify the code/wording/URLs in whichever way you see best fit. Could we do it for the chinese README, CogVideoX-2B and CogVideo GitHub repo as well? Thanks!

Add quantization examples using torchao and quantoaf6c70af

a-r-r-o-w

Aug 27, 2024

cc @zRzRzRzRzRzRzR

Update README.mdb4318625

Update README.mdcfce5408

zRzRzRzRzRzRzR changed pull request status to merged Aug 28, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment