Dipto084
/

llama31-8b-gdpo-v7-step35

Model card Files Files and versions

llama31-8b-gdpo-v7-step35

16.1 GB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

Dipto084's picture

Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20

ff5a1bb verified 23 days ago

.gitattributes

1.52 kB
initial commit 23 days ago
chat_template.jinja

4.61 kB
Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20 23 days ago
config.json

1.12 kB
Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20 23 days ago
generation_config.json

183 Bytes
Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20 23 days ago
model.safetensors

16.1 GB
xet

Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20 23 days ago
tokenizer.json

9.09 MB
Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20 23 days ago
tokenizer_config.json

55.4 kB
Initial: Llama-3.1-8B GDPO v7 step 35 (merged) — best safety/helpfulness balance per PHTest+val_20 23 days ago