Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nanotron
/
doremi-llama-2.5b-reference
like
0
Follow
Nanotron Research
25
License:
mit
Model card
Files
Files and versions
Community
4a65de7
doremi-llama-2.5b-reference
/
70000
/
optimizer
1 contributor
History:
1 commit
neuralink
HF staff
add the 70k checkpoint
4a65de7
10 months ago
optimizer_config.json
Safe
124 Bytes
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-0-of-8.pt
Safe
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-1-of-8.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-2-of-8.pt
Safe
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-3-of-8.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-4-of-8.pt
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-5-of-8.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-6-of-8.pt
Safe
3.78 GB
LFS
add the 70k checkpoint
10 months ago
optimizer_pp-0-of-1_tp-7-of-8.pt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.78 GB
LFS
add the 70k checkpoint
10 months ago