Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jamesburton
/
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-GGUF
like
0
GGUF
English
Inference Endpoints
imatrix
conversational
License:
mit
Model card
Files
Files and versions
Community
Deploy
Use this model
7398b7b
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-GGUF
1 contributor
History:
14 commits
jamesburton
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q8_0-imat.gguf
7398b7b
verified
6 months ago
imatrix
Added GGUF generation script and configuration, please brief note
6 months ago
.gitattributes
2.52 kB
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q8_0-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ3_M-imat.gguf
9.84 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ3_M-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ3_XXS-imat.gguf
8.46 GB
LFS
README, imatrix used, and IQ3_XXS GGUF model
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ4_NL-imat.gguf
12 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ4_NL-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ4_XS-imat.gguf
11.4 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-IQ4_XS-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q4_K_M-imat.gguf
12.9 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q4_K_M-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q4_K_S-imat.gguf
12.1 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q4_K_S-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q5_K_M-imat.gguf
14.9 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q5_K_M-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q5_K_S-imat.gguf
14.5 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q5_K_S-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q6_K-imat.gguf
17.2 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q6_K-imat.gguf
6 months ago
Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q8_0-imat.gguf
22 GB
LFS
Upload Phi-3-mini-4k-instruct-LLaMAfied-8xMoE-raw-Q8_0-imat.gguf
6 months ago
README.md
814 Bytes
Added initial model-card elements to README
6 months ago
gguf-imat.py
7.83 kB
Added script used to generate GGUF files
6 months ago
imatrix.dat
16.7 MB
LFS
README, imatrix used, and IQ3_XXS GGUF model
6 months ago