Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yulan-team
/
YuLan-Mini-Before-Annealing
like
6
Follow
RUC-GSAI-YuLan
29
Safetensors
optimizer_states
arxiv:
2412.17743
License:
mit
Model card
Files
Files and versions
Community
2
cb90a53
YuLan-Mini-Before-Annealing
/
global_step262772_universal
/
zero
/
model.layers.0.self_attn.v_proj.bias
1 contributor
History:
1 commit
IvanHU
Upload deepspeed checkpoint
85d4dac
15 days ago
exp_avg.pt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
2.72 kB
LFS
Upload deepspeed checkpoint
15 days ago
exp_avg_sq.pt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
2.73 kB
LFS
Upload deepspeed checkpoint
15 days ago
fp32.pt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
2.64 kB
LFS
Upload deepspeed checkpoint
15 days ago
step.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
852 Bytes
LFS
Upload deepspeed checkpoint
15 days ago