Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nanotron
/
doremi-llama-2.5b-reference
like
0
Follow
Nanotron Research
25
License:
mit
Model card
Files
Files and versions
Community
4a65de7
doremi-llama-2.5b-reference
/
70000
/
optimizer
/
optimizer_config.json
neuralink
HF staff
add the 70k checkpoint
4a65de7
10 months ago
raw
Copy download link
history
blame
Safe
124 Bytes
{
"type"
:
"OptimizerFromGradientAccumulator"
,
"parallelism"
:
{
"tp_size"
:
"8"
,
"dp_size"
:
"8"
,
"pp_size"
:
"1"
}
,
"configs"
:
{
}
}