The method get_max_length of 'DynamicCache' is deprecated and has been removed in transformer 4.49
#10 opened 4 days ago
by
login256
Fix for missing blank space at the end of chat template.
#9 opened 27 days ago
by
ShaneTian

OOM with int4 quant
#8 opened 2 months ago
by
chungimungi

I know this is insane but is it possible?
#7 opened 3 months ago
by
Assbang
MMLU benchmark performance on math domain
#6 opened 5 months ago
by
Fighoture
Use try-except for flash_attn
#5 opened 6 months ago
by
LiangliangMa
deepseek-v2-lite模型怎么微调?
1
#2 opened 10 months ago
by
guowl