Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Dipto084
/
llama31-8b-gdpo-v7-step35

Safetensors
llama
Model card Files Files and versions
xet
Community
llama31-8b-gdpo-v7-step35
16.1 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
Dipto084's picture
Dipto084
Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20
ff5a1bb verified 23 days ago
  • .gitattributes
    1.52 kB
    initial commit 23 days ago
  • chat_template.jinja
    4.61 kB
    Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20 23 days ago
  • config.json
    1.12 kB
    Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20 23 days ago
  • generation_config.json
    183 Bytes
    Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20 23 days ago
  • model.safetensors
    16.1 GB
    xet
    Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20 23 days ago
  • tokenizer.json
    9.09 MB
    Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20 23 days ago
  • tokenizer_config.json
    55.4 kB
    Initial: Llama-3.1-8B GDPO v7 step 35 (merged) β€” best safety/helpfulness balance per PHTest+val_20 23 days ago