Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
32
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
main
gsa-1.3B-100B
2 contributors
History:
8 commits
yzhangcs
Upload GSAForCausalLM
1e4ffda
verified
7 days ago
.gitattributes
1.52 kB
initial commit
8 months ago
README.md
250 Bytes
Upload GSAForCausalLM
7 days ago
config.json
1.02 kB
Upload GSAForCausalLM
7 days ago
generation_config.json
132 Bytes
Remove the `norm_first` option
10 days ago
model.safetensors
2.75 GB
LFS
Upload GSAForCausalLM
7 days ago
special_tokens_map.json
551 Bytes
Upload GSAForCausalLM
8 months ago
tokenizer.json
1.8 MB
Upload GSAForCausalLM
8 months ago
tokenizer.model
493 kB
LFS
Upload GSAForCausalLM
8 months ago
tokenizer_config.json
995 Bytes
Update tokenizer_config.json
6 months ago