Attention matrix
1
#12 opened about 1 year ago
by
stolosa
Lower precision
#11 opened over 1 year ago
by
pipparichter
PEFT LoRA and QLoRA
#10 opened over 1 year ago
by
AmelieSchreiber
accessing to embedding layer and generate embeddings step by step
#9 opened over 1 year ago
by
francescopatane
Understanding vocabulary size
#8 opened over 1 year ago
by
dannyLCG
how visualize attention matrix
2
#7 opened over 1 year ago
by
francescopatane
TorchScript export failed. Maybe related to sequence length cache.
#5 opened almost 2 years ago
by
chenchaozhao
inferring device map for model
#4 opened almost 2 years ago
by
mahdi-b
passing parameters to the underlying model's forward
4
#3 opened about 2 years ago
by
mahdi-b