John Leimgruber III's picture

John Leimgruber III

ubergarm

·

https://emptyduck.com

ubergarm

AI & ML interests

Open LLMs and Astrophotography image processing.

Recent Activity

liked a model about 12 hours ago

mradermacher/Rombos-Coder-V2.5-Qwen-32b-i1-GGUF

New activity about 1 month ago

bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

liked a model about 1 month ago

nvidia/Llama-3.1-Nemotron-70B-Instruct

Organizations

None yet

ubergarm's activity

New activity in bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF about 1 month ago

Observation: 4-bit quantization can't answer the Strawberry prompt

#2 opened about 1 month ago by

New activity in bartowski/SuperNova-Medius-GGUF about 1 month ago

63.17 MMLU-Pro Computer Science with `Q8_0`

#2 opened about 1 month ago by

New activity in bartowski/qwen2.5-7b-ins-v3-GGUF about 1 month ago

Benchmarks worse than Qwen2.5-7B-Instruct on MMLU-Pro Computer Science in limited testing.

#1 opened about 1 month ago by

New activity in bartowski/Qwen2.5-32B-Instruct-GGUF about 2 months ago

Promising looking results on 24GB VRAM folks!

#3 opened 2 months ago by

New activity in deepseek-ai/DeepSeek-V2.5 2 months ago

Awesome model

#5 opened 2 months ago by

New activity in bartowski/DeepSeek-V2.5-GGUF 2 months ago

vram usage of each?

#1 opened 2 months ago by

New activity in bartowski/Mistral-Large-Instruct-2407-GGUF 4 months ago

Works good generating python on my 64GB RAM w/ 3090TI 24GB VRAM dev box

#2 opened 4 months ago by

Chat template

#3 opened 4 months ago by

Can you please provide the command to change the context size?

#1 opened 4 months ago by

New activity in qwp4w3hyb/Meta-Llama-3.1-8B-Instruct-iMat-GGUF 4 months ago

The first GGUF that works with long context on llama.cpp!

#1 opened 4 months ago by

New activity in MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF 4 months ago

And where is the GGUF file itself?

#1 opened 4 months ago by

Anonimus12345678902

New activity in CompendiumLabs/mistral-nemo-instruct-2407-gguf 4 months ago

Got it working in llama.cpp! Thanks!

#1 opened 4 months ago by

New activity in QuantFactory/Mistral-Nemo-Instruct-2407-GGUF 4 months ago

Error loading model in llama.cpp ?

#1 opened 4 months ago by

New activity in gradientai/Llama-3-70B-Instruct-Gradient-262k 6 months ago

Prompt Format

#6 opened 7 months ago by

New activity in OpenGVLab/InternVL-Chat-V1-5 6 months ago

Quantized model coming?

#3 opened 7 months ago by

New activity in failspy/InternVL-Chat-V1-5-4bit 6 months ago

Output is empty

#3 opened 6 months ago by

New activity in ISTA-DASLab/Meta-Llama-3-70B-Instruct-AQLM-2Bit-1x16 7 months ago

~8 tok/sec with ~5k context on vLLM with Flash Attention and `kv_cache_dtype="fp8"` on 3090TI 24GB VRAM

#2 opened 7 months ago by

New activity in MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-32k-GGUF 7 months ago

The f16 with 32k ctx fits nicely in 24GB VRAM

#3 opened 7 months ago by

New activity in stabilityai/stable-cascade 9 months ago

AttributeError: 'generator' object has no attribute 'image_embeddings'

#26 opened 9 months ago by