Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yulan-team
/
YuLan-Mini-Before-Annealing
like
6
Follow
RUC-GSAI-YuLan
29
Safetensors
optimizer_states
arxiv:
2412.17743
License:
mit
Model card
Files
Files and versions
Community
2
main
YuLan-Mini-Before-Annealing
/
global_step243198_universal
/
zero
/
model.layers.0.self_attn.q_proj.weight
1 contributor
History:
1 commit
IvanHU
Upload correct optimizer states
13226fe
25 days ago
exp_avg.pt
14.7 MB
LFS
Upload correct optimizer states
25 days ago
exp_avg_sq.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
14.7 MB
LFS
Upload correct optimizer states
25 days ago
fp32.pt
14.7 MB
LFS
Upload correct optimizer states
25 days ago
step.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
852 Bytes
LFS
Upload correct optimizer states
25 days ago