Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yulan-team
/
YuLan-Mini-Before-Annealing
like
6
Follow
RUC-GSAI-YuLan
28
Safetensors
optimizer_states
arxiv:
2412.17743
License:
mit
Model card
Files
Files and versions
Community
2
cb90a53
YuLan-Mini-Before-Annealing
/
global_step262772_universal
1 contributor
History:
1 commit
IvanHU
Upload deepspeed checkpoint
85d4dac
14 days ago
zero
Upload deepspeed checkpoint
14 days ago
mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.Size"
,
"torch.BFloat16Storage"
,
"__builtin__.set"
How to fix it?
4.47 GB
LFS
Upload deepspeed checkpoint
14 days ago