Spaces:

ggml-org
/

gguf-my-repo

Running on A10G

App Files Files Community

148

This doesn't work for Llama3 models

#44

by birgermoell - opened Apr 19, 2024

Discussion

birgermoell

Apr 19, 2024

Tried out trying to gguf a llama-3 model that I merged. But it didn't work.
https://huggingface.co/birgermoell/Llama-3-dare_ties

birgermoell

Apr 19, 2024

Error: Error converting to fp16: b'Traceback (most recent call last):\n File "/home/user/app/llama.cpp/convert.py", line 1548, in \n main()\n File "/home/user/app/llama.cpp/convert.py", line 1515, in main\n vocab, special_vocab = vocab_factory.load_vocab(vocab_types, model_parent_path)\n File "/home/user/app/llama.cpp/convert.py", line 1417, in load_vocab\n vocab = self._create_vocab_by_path(vocab_types)\n File "/home/user/app/llama.cpp/convert.py", line 1407, in _create_vocab_by_path\n raise FileNotFoundError(f"Could not find a tokenizer matching any of {vocab_types}")\nFileNotFoundError: Could not find a tokenizer matching any of ['spm', 'hfft']\n'

ehristoforu

Apr 19, 2024

I get the same error, it would be cool if the developers fix it.

phymbert

ggml.ai org Apr 19, 2024

Llama3 is not supported atm:
https://github.com/ggerganov/llama.cpp/pull/6745

ehristoforu

Apr 19, 2024

I still don't understand, is this temporary or not supported at all?

hus960

Apr 21, 2024

•

edited Apr 21, 2024

https://github.com/ggerganov/llama.cpp/pull/6745

this PR has been merged , so Llama3 is now supported.

please update this app .

jefferylovely

Apr 21, 2024

This comment has been hidden

hus960

Apr 22, 2024

Just rebuild it. This will work.

reach-vb

ggml.ai org Apr 22, 2024

Just restarted the app - to pull the latest llama.cpp - running some quick tests for it.

reach-vb

ggml.ai org Apr 22, 2024

ALright made a small patch for llama models to go through the hf-convert script and it works now: https://huggingface.co/reach-vb/llama-3-8b-Q8_0-GGUF 🤗

hus960

Apr 22, 2024

https://github.com/ggerganov/llama.cpp/blob/master/docs/HOWTO-add-model.md

Convert the model to GGUF
This step is done in python with a convert script using the gguf library. Depending on the model architecture, you can use either convert.py or convert-hf-to-gguf.py.

Looks like convert-hf-to-gguf.py can convert any model

julien-c

Apr 22, 2024

cool, so i think this can now be closed!

please confirm @birgermoell

saishf

Apr 26, 2024

It worked as intended for me.

reach-vb

ggml.ai org Apr 29, 2024

(closing since it is fixed)

reach-vb changed discussion status to closed Apr 29, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment