qwerrwe / docker

Commit History

jupyter lab fixes (#1139) [skip ci]
eaaeefc
unverified

winglian commited on

Dockerfile cloud ports (#1148)
729740d
unverified

winglian commited on

Agnostic cloud gpu docker image and Jupyter lab (#1097)
ece0211
unverified

winglian commited on

misc fixes from #943 (#1086) [skip ci]
23495a8
unverified

winglian commited on

fix: warn user to install mamba_ssm package (#1019)
d69ba2b
unverified

Nanobit commited on

attempt to also run e2e tests that needs gpus (#1070)
788649f
unverified

winglian commited on

Add tests to Docker (#993)
2e61dc3
unverified

hamel commited on

Dockerfile torch fix (#987)
161bcb6
unverified

winglian commited on

fix for build for nccl in dockerfile (#970)
85de004
unverified

winglian commited on

update to latest nccl in docker image (#965)
80ec7af
unverified

winglian commited on

Mixtral multipack (#928)
68b227a
unverified

winglian commited on

don't compile deepspeed or bitsandbytes from source (#837)
f544ab2
unverified

winglian commited on

add deepspeed-kernels dependency for deepspeed>=0.12.0 (#827)
8056ecd
unverified

fpreiss commited on

fix pytorch 2.1.0 build, add multipack docs (#722)
2aa1f71
unverified

winglian commited on

apex not needed as amp is part of pytorch (#696)
aca0398
unverified

winglian commited on

fix multiline for docker (#694)
de87ea6
unverified

winglian commited on

Feat: Set WORKDIR to /workspace/axolotl (#679)
133e676
unverified

Nanobit commited on

tweak: improve base builder for smaller layers (#500)
923eb91
unverified

Maxime commited on

let MAX_JOBS use the default since we're not resource constrained on our self-hosted runners (#427)
e85d2eb
unverified

winglian commited on

update dockerfile to not build evoformer since it fails the build (#607)
b53e777
unverified

winglian commited on

update readme to point to direct link to runpod template, cleanup install instrucitons (#532)
34c0a86
unverified

winglian commited on

Add support for GPTQ using native transformers/peft (#468)
3355706
unverified

winglian commited on

remove --force-reinstall from Dockerfile to ensure correct pytorch version (#492)
e356b29
unverified

tmm1 commited on

flash attn pip install (#426)
cf66547
unverified

mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on

bump flash-attn to 2.0.4 for the base docker image (#382)
ffac902
unverified

winglian commited on

add peft install back since it doesn't get installed by setup.py (#331)
db2a358
unverified

winglian commited on

pin accelerate so it works with llama2 (#330)
6c9a87c
unverified

winglian commited on

Prune cuda117 (#327)
2c37bf6
unverified

winglian commited on

add runpod envs to .bashrc, fix bnb env (#316)
cf62cfd
unverified

winglian commited on

pin flash attention 2 to the fix for backwards pass
cdf85fd

winglian commited on

flash attention 2
9b790d3

winglian commited on

explicitly pin flash attention 1 to v1.0.9
b06d3e3

winglian commited on

misc fixes
d75adb9

winglian commited on

set transformers cache env var in docker image
f162f3c

winglian commited on

git fetch fix for docker
eca3531

winglian commited on

pin pydantic so deepspeed isn't broken
7145695

winglian commited on

update pip install command for apex
530809f

winglian commited on

shallow clone
5cd2126

winglian commited on

clone in docker
12620f3

winglian commited on

py310, fix cuda arg in deepspeed
c43c5c8

winglian commited on

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq
bbc5bc5
unverified

winglian commited on

Lint and format
392dfd9

Nanobit commited on

cleanup from pr feedback
48612f8

winglian commited on

default to qlora support, make gptq specific image
6ef96f5

winglian commited on

move CUDA_VERSION_BNB arg inside of stage build scope
e43bcc6

winglian commited on

fix CUDA_VERSION_BNB env var
00323f0

winglian commited on

bnb fixes
21f17cc

winglian commited on

use python setup install, bdist wheel is unreliable in installing extension
809cceb

winglian commited on

ensure libbitsandbytes*.so gets included with wheel
a798ba1

winglian commited on

fix missing run coninuation
cf37980

winglian commited on