Custom 4-bit Finetuning 5-7 times faster inference than QLora

#13
by rmihaylov - opened
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment