Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
cbecf3e
qwerrwe
/
examples
100 contributors
History:
107 commits
jrc
Add shifted sparse attention (#973) [skip-ci]
1d70f24
unverified
10 months ago
cerebras
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
code-llama
Add shifted sparse attention (#973) [skip-ci]
10 months ago
falcon
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
gptj
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
jeopardy-bot
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
llama-2
Add shifted sparse attention (#973) [skip-ci]
10 months ago
mamba
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
mistral
Set eval_sample_packing to false in mistral config.yaml (#1003)
11 months ago
mpt-7b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
openllama-3b
Add shifted sparse attention (#973) [skip-ci]
10 months ago
phi
pin model_revision for phi2 (#1123)
10 months ago
pythia-12b
Feat(wandb): Refactor to be more flexible (#767)
12 months ago
pythia
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
qwen
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
redpajama
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
replit-3b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
tiny-llama
streaming multipack for pretraining dataset (#959)
11 months ago
xgen-7b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
12 months ago
yi-34B-chat
Add an example config for finetuning a 34B model on a 24GB GPU (#1000)
11 months ago