Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
476a205
qwerrwe
100 contributors
History:
1084 commits
Haoxiang-Wang
Remove learning rate scheduler in deepspeed config to avoid conflict (#909)
476a205
unverified
12 months ago
.github
try #2: pin hf transformers and accelerate to latest release, don't reinstall pytorch (#867)
about 1 year ago
deepspeed
Remove learning rate scheduler in deepspeed config to avoid conflict (#909)
12 months ago
docker
don't compile deepspeed or bitsandbytes from source (#837)
about 1 year ago
docs
feat(doc): add dummyoptim faq fix (#802)
about 1 year ago
examples
fix: remove FA for qwen examples (#900)
about 1 year ago
image
badge (#739)
about 1 year ago
scripts
Create preprocess CLI (#785)
about 1 year ago
src
fix for qwen w lora (#906)
12 months ago
tests
Feat: Add warmup_ratio (#893)
about 1 year ago
.bandit
Safe
38 Bytes
Add bandit
over 1 year ago
.editorconfig
Safe
186 Bytes
WIP for axolotl trainer
over 1 year ago
.flake8
Safe
88 Bytes
Update ignores
over 1 year ago
.gitattributes
Safe
49 Bytes
make it work with pythia in the cloud
over 1 year ago
.gitignore
Safe
3.18 kB
ignore wandb to resolve isort headaches (#619)
about 1 year ago
.isort.cfg
Safe
49 Bytes
ignore wandb to resolve isort headaches (#619)
about 1 year ago
.mypy.ini
Safe
710 Bytes
Support Sample packing for phi arch (#586)
about 1 year ago
.pre-commit-config.yaml
Safe
896 Bytes
better py3 support w pre-commit
over 1 year ago
.pylintrc
Safe
603 Bytes
Ignore too-many-instance-attributes
over 1 year ago
FAQS.md
Safe
648 Bytes
Update FAQS.md
over 1 year ago
LICENSE
Safe
11.4 kB
add apache 2.0 license
over 1 year ago
README.md
37.4 kB
Feat: Add Qwen (#894)
about 1 year ago
TODO.md
Safe
262 Bytes
fdsp config dict fix, todo list, add torchdistx support
over 1 year ago
docker-compose.yaml
Safe
701 Bytes
add git environment variables to compose: avoid checkout failure error 128 on build (#534)
about 1 year ago
requirements-dev.txt
Safe
22 Bytes
Add mypy
over 1 year ago
requirements-tests.txt
Safe
7 Bytes
use requirements file for tests
over 1 year ago
requirements.txt
552 Bytes
update datasets version to cut down the warnings due to pyarrow arg change (#897)
about 1 year ago
setup.py
1.74 kB
Mistral: Sliding Window Attention with Flash Attention and Sample Packing (#732)
about 1 year ago