Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / examples

52.8 kB

100 contributors

History: 107 commits

jrc's picture

jrc

Add shifted sparse attention (#973) [skip-ci]

1d70f24 unverified almost 2 years ago

cerebras
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
code-llama
Add shifted sparse attention (#973) [skip-ci] almost 2 years ago
falcon
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
gptj
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
jeopardy-bot
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
llama-2
Add shifted sparse attention (#973) [skip-ci] almost 2 years ago
mamba
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
mistral
Set eval_sample_packing to false in mistral config.yaml (#1003) about 2 years ago
mpt-7b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
openllama-3b
Add shifted sparse attention (#973) [skip-ci] almost 2 years ago
phi
pin model_revision for phi2 (#1123) almost 2 years ago
pythia-12b
Feat(wandb): Refactor to be more flexible (#767) about 2 years ago
pythia
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
qwen
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
redpajama
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
replit-3b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
tiny-llama
streaming multipack for pretraining dataset (#959) almost 2 years ago
xgen-7b
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) about 2 years ago
yi-34B-chat
Add an example config for finetuning a 34B model on a 24GB GPU (#1000) about 2 years ago