new lr, sample pack
4c8ddf2
-
cerebras
prepared dataset caching, other misc fixes (#665)
-
code-llama
prepared dataset caching, other misc fixes (#665)
-
falcon
prepared dataset caching, other misc fixes (#665)
-
gptj
prepared dataset caching, other misc fixes (#665)
-
jeopardy-bot
prepared dataset caching, other misc fixes (#665)
-
llama-2
prepared dataset caching, other misc fixes (#665)
-
mistral
new lr, sample pack
-
mpt-7b
prepared dataset caching, other misc fixes (#665)
-
openllama-3b
prepared dataset caching, other misc fixes (#665)
-
phi
prepared dataset caching, other misc fixes (#665)
-
pythia-12b
prepared dataset caching, other misc fixes (#665)
-
pythia
prepared dataset caching, other misc fixes (#665)
-
redpajama
prepared dataset caching, other misc fixes (#665)
-
replit-3b
prepared dataset caching, other misc fixes (#665)
-
xgen-7b
prepared dataset caching, other misc fixes (#665)