llama.cpp has changed the encoding from GGML to GGUF, breaking existing GGML model checkpoints/weights for llama.cpp users:
- https://github.com/ggerganov/llama.cpp/pull/2398
This is a temporary upload of GGUF encoded Llama-2 models using llama.cpp/convert-llama-ggmlv3-to-gguf.py on the GGML models while waiting for official uploads of natively produced GGUF model checkpoints

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.