mradermacher
/

falcon-mamba-7b-GGUF

Inference Endpoints

Model card Files Files and versions Community

mradermacher commited on 11 days ago

Commit

8f124a2

•

1 Parent(s): bb88d5b

auto-patch README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -9,7 +9,8 @@ library_name: transformers
 license: other
 license_link: https://falconllm.tii.ae/falcon-mamba-7b-terms-and-conditions.html
 license_name: falcon-mamba-7b-license
-no_imatrix: "llama.cpp/ggml/src/ggml-cuda/norm.cu:212: GGML_ASSERT(ggml_is_contiguous(src0)) failed"
 quantized_by: mradermacher
 ---
 ## About
@@ -22,7 +23,6 @@ quantized_by: mradermacher
 static quants of https://huggingface.co/tiiuae/falcon-mamba-7b
 <!-- provided-files -->
-weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's

 license: other
 license_link: https://falconllm.tii.ae/falcon-mamba-7b-terms-and-conditions.html
 license_name: falcon-mamba-7b-license
+no_imatrix: 'llama.cpp/ggml/src/ggml-cuda/norm.cu:212: GGML_ASSERT(ggml_is_contiguous(src0))
+  failed'
 quantized_by: mradermacher
 ---
 ## About
 static quants of https://huggingface.co/tiiuae/falcon-mamba-7b
 <!-- provided-files -->
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's