Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Crystalcareai
/
Quiet-Star-Custom
like
11
Text Generation
Transformers
Safetensors
open-web-math/open-web-math
quiet
custom_code
arxiv:
2403.09629
Model card
Files
Files and versions
Community
2
Train
Use this model
866d907
Quiet-Star-Custom
1 contributor
History:
587 commits
Crystalcareai
Update configuration_quiet.py
866d907
verified
8 months ago
.gitattributes
Safe
1.52 kB
initial commit
8 months ago
README.md
Safe
196 Bytes
Upload folder using huggingface_hub
8 months ago
added_tokens.json
Safe
59 Bytes
Upload folder using huggingface_hub
8 months ago
config.json
Safe
1.25 kB
Update config.json
8 months ago
configuration_quiet.py
Safe
8.21 kB
Update configuration_quiet.py
8 months ago
generate.py
Safe
7.32 kB
Update generate.py
8 months ago
generation_config.json
Safe
116 Bytes
Upload folder using huggingface_hub
8 months ago
inference.py
Safe
4.84 kB
Update inference.py
8 months ago
model-00001-of-00003.safetensors
Safe
4.94 GB
LFS
Upload folder using huggingface_hub
8 months ago
model-00002-of-00003.safetensors
Safe
5 GB
LFS
Upload folder using huggingface_hub
8 months ago
model-00003-of-00003.safetensors
Safe
4.64 GB
LFS
Upload folder using huggingface_hub
8 months ago
model.safetensors.index.json
Safe
24.4 kB
Upload folder using huggingface_hub
8 months ago
modeling_quiet.py
Safe
102 kB
Update modeling_quiet.py
8 months ago
optuna.py
Safe
5.03 kB
Create optuna.py
8 months ago
sft-dora-alpaca.py
Safe
5.44 kB
Rename train-dora-alpaca.py to sft-dora-alpaca.py
8 months ago
special_tokens_map.json
Safe
772 Bytes
Upload folder using huggingface_hub
8 months ago
tokenization_quiet.py
Safe
19.9 kB
Upload tokenization_quiet.py
8 months ago
tokenizer.json
Safe
1.8 MB
Upload folder using huggingface_hub
8 months ago
tokenizer.model
Safe
493 kB
LFS
Upload folder using huggingface_hub
8 months ago
tokenizer_config.json
Safe
1.38 kB
Update tokenizer_config.json
8 months ago
train-h100-sharegpt-sft.py
Safe
6.56 kB
Update train-h100-sharegpt-sft.py
8 months ago