LlamaEdge compatible quants for Gemma-3-it models.
AI & ML interests
Run open source LLMs across CPU and GPU without changing the binary in Rust and Wasm locally!
Recent Activity
View all activity
Organization Card
Run Open source LLMs and create OpenAI-compatible API services for the Llama2 series of LLMs locally With LlamaEdge!
Give it a try
Run a single command in your command line terminal.
bash <(curl -sSfL 'https://raw.githubusercontent.com/LlamaEdge/LlamaEdge/main/run-llm.sh') --interactive
Follow the on-screen instructions to install the WasmEdge Runtime and download your favorite open-source LLM. Then, choose whether you want to chat with the model via the CLI or via a web UI.
See it in action | GitHub | Docs
Why?
LlamaEdge, powered by Rust and WasmEdge, provides a strong alternative to Python in AI inference.
- Lightweight. The total runtime size is 30MB.
- Fast. Full native speed on GPUs.
- Portable. Single cross-platform binary on different CPUs, GPUs, and OSes.
- Secure. Sandboxed and isolated execution on untrusted devices.
- Container-ready. Supported in Docker, containerd, Podman, and Kubernetes.
Learn more
Please visit the LlamaEdge project to learn more.
Collections
12
LlamaEdge compatible quants for DeepSeek-R1 distilled models.
-
second-state/DeepSeek-R1-Distill-Qwen-1.5B-GGUF
Text Generation • Updated • 2.64k -
second-state/DeepSeek-R1-Distill-Qwen-7B-GGUF
Text Generation • Updated • 1.64k -
second-state/DeepSeek-R1-Distill-Qwen-14B-GGUF
Text Generation • Updated • 3.19k -
second-state/DeepSeek-R1-Distill-Qwen-32B-GGUF
Text Generation • Updated • 3.06k
models
239

second-state/gemma-3-27b-it-GGUF
Image-Text-to-Text
•
Updated
•
877

second-state/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
Updated
•
791

second-state/gemma-3-1b-it-GGUF
Text Generation
•
Updated
•
371

second-state/gemma-3-12b-it-GGUF
Image-Text-to-Text
•
Updated
•
686

second-state/MiniCPM-o-2_6-GGUF
Any-to-Any
•
Updated
•
290

second-state/llama-3.2-Korean-Bllossom-3B-GGUF
Text Generation
•
Updated
•
403

second-state/QwQ-32B-GGUF
Text Generation
•
Updated
•
867

second-state/Phi-4-mini-instruct-GGUF
Text Generation
•
Updated
•
1.12k

second-state/OuteTTS-0.2-500M-GGUF
Text-to-Speech
•
Updated
•
678

second-state/gte-Qwen2-7B-instruct-GGUF
Updated
•
1.5k
datasets
None public yet