Zephyr 7B Beta Llamafiles

See here for a guide on how to use llamafiles!

Both the server and the CLI are based on TheBloke's Zephyr 7B Beta GGUF Q4_K_M model.

Usage

NOTE: Due to the executable being greater than 4GB, it is currently not compatible with Windows. I will update with a Windows friendly version of Zephyr 7B Beta when I can.

# replace with the CLI if you prefer
wget https://huggingface.co/TimeSurgeLabs/zephyr-7b-beta-llamafile/resolve/main/zephyr-beta-server.llamafile
chmod +x zephyr-beta-server.llamafile
./zephyr-beta-server.llamafile
Downloads last month
20
Inference API
Unable to determine this model's library. Check the docs .

Collection including TimeSurgeLabs/zephyr-7b-beta-llamafile