8 1 102

Neo Dim

NeoDim

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

bartowski/Confucius-o1-14B-GGUF

liked a model 4 days ago

FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview

liked a model 4 days ago

bartowski/FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview-GGUF

View all activity

Organizations

None yet

NeoDim's activity

New activity in bartowski/starchat2-15b-v0.1-GGUF 10 months ago

What is the prompt format?

#1 opened 11 months ago by

siddhesh22

New activity in NeoDim/starcoder-GGML over 1 year ago

how did you convert `transformers.PreTrainedTokenizer` to ggml format?

#2 opened over 1 year ago by

keunwoochoi

New activity in NeoDim/starchat-alpha-GGML over 1 year ago

demo space

#4 opened over 1 year ago by

matthoffner

Looks like the starchat-alpha-ggml-q4_1.bin is broken

#3 opened over 1 year ago by

xhyi

New activity in NeoDim/starcoderbase-GGML over 1 year ago

missing tok_embeddings.weight error when trying to run with llama.cpp

#1 opened over 1 year ago by

ultra2mh

New activity in NeoDim/starcoder-GGML over 1 year ago

Cannot run on llama.cpp and koboldcpp

#1 opened over 1 year ago by

FenixInDarkSolo

New activity in NeoDim/starchat-alpha-GGML over 1 year ago

Which inference repo is this quantized for?

#2 opened over 1 year ago by

xhyi

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened over 1 year ago by

MohamedRashad

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened over 1 year ago by

MohamedRashad

New activity in NeoDim/starcoder-GGML over 1 year ago

Cannot run on llama.cpp and koboldcpp

#1 opened over 1 year ago by

FenixInDarkSolo

New activity in NeoDim/starchat-alpha-GGML over 1 year ago

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened over 1 year ago by

MohamedRashad