Text Generation
Transformers
English
gpt_neox
red_pajama
Inference Endpoints

Failed when running with redpajama.cpp

#3
by carlosbdw - opened

❯ ./main -m ./models/RedPajama-INCITE-Chat-3B-v1-GGML/rp-chat-3b-v1-ggml-model-q4_0.bin
main: build = 544 (b9e0389)
main: seed = 1686306653
llama.cpp: loading model from ./models/RedPajama-INCITE-Chat-3B-v1-GGML/rp-chat-3b-v1-ggml-model-q4_0.bin
libc++abi: terminating due to uncaught exception of type std::runtime_error: unexpectedly reached end of file
[4] 34922 abort ./main -m
❯ ./main -m ./models/RedPajama-INCITE-Chat-3B-v1-GGML/rp-chat-3b-v1-ggml-model-q5_1.bin
main: build = 544 (b9e0389)
main: seed = 1686306676
llama.cpp: loading model from ./models/RedPajama-INCITE-Chat-3B-v1-GGML/rp-chat-3b-v1-ggml-model-q5_1.bin
libc++abi: terminating due to uncaught exception of type std::runtime_error: unexpectedly reached end of file
[4] 35059 abort ./main -m

Sign up or log in to comment