jbochi
/

candle-coedit-quantized

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jbochi commited on Oct 4, 2023

Commit

68c05f3

·

1 Parent(s): 544c6e9

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -67,22 +67,22 @@ $ cargo run --example quantized-t5 --release  -- \
 ## Models available
-These are all the available formats. Weight file is named `{name}_{quant}.gguf` and config-file `config-{base}.json`
 | Model | Base model | Quantization | # Params | Size |
 | ----- | ---------- | ------------ | ------ | ---- |
 | - | [large](https://huggingface.co/grammarly/coedit-large) | None | 770M | 3.13 GB |
 | model | large | 6k | 770M | 643 MB |
-| model-4k | large | 4k | 770M | 441 MB |
-| model-4_0 | large | 4_0 | 770M | 441 MB |
 |  | [xl](https://huggingface.co/grammarly/coedit-xl) | None | 3B | 11.4 GB |
 | model-xl | xl | 6k | 3B | 2.34 GB |
-| model-xl-4k | xl | 4k | 3B | 1.6 GB |
-| model-xl-4_0 | xl | 4_0 | 3B | 1.6 GB |
 | - | [xxl](https://huggingface.co/grammarly/coedit-xxl) | None | 11B | 44.5 GB |
 | model-xxl | xxl | 6k | 11B | 9.14 GB |
-| model-xxl-4k | xxl | 4k | 11B | 6.27 GB |
-| model-xxl-4_0 | xxl | 4_0 | 11B | 6.27 GB |
 ## Model generation

 ## Models available
+These are all the available formats. Weight file is named `{model}.gguf` and the config file is `config-{base_model}.json`
 | Model | Base model | Quantization | # Params | Size |
 | ----- | ---------- | ------------ | ------ | ---- |
 | - | [large](https://huggingface.co/grammarly/coedit-large) | None | 770M | 3.13 GB |
 | model | large | 6k | 770M | 643 MB |
+| model-q4k | large | 4k | 770M | 441 MB |
+| model-q4_0 | large | 4_0 | 770M | 441 MB |
 |  | [xl](https://huggingface.co/grammarly/coedit-xl) | None | 3B | 11.4 GB |
 | model-xl | xl | 6k | 3B | 2.34 GB |
+| model-xl-q4k | xl | 4k | 3B | 1.6 GB |
+| model-xl-q4_0 | xl | 4_0 | 3B | 1.6 GB |
 | - | [xxl](https://huggingface.co/grammarly/coedit-xxl) | None | 11B | 44.5 GB |
 | model-xxl | xxl | 6k | 11B | 9.14 GB |
+| model-xxl-q4k | xxl | 4k | 11B | 6.27 GB |
+| model-xxl-q4_0 | xxl | 4_0 | 11B | 6.27 GB |
 ## Model generation