bartowski
/

starcoder2-15b-instruct-v0.1-exl2

Text Generation

Transformers

code

Eval Results

Inference Endpoints

Model card Files Files and versions Community

bartowski commited on Apr 30

Commit

bd7fa7c

•

1 Parent(s): bc9d6a9

Update README.md

Browse files

Files changed (1) hide show

README.md +20 -21

README.md CHANGED Viewed

@@ -93,30 +93,36 @@ Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.20">turb
 Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
-Conversion was done using the default calibration dataset.
-Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
 Original model: https://huggingface.co/bigcode/starcoder2-15b-instruct-v0.1
-<a href="https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/8_0">8.0 bits per weight</a>
-<a href="https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/6_5">6.5 bits per weight</a>
-<a href="https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/5_0">5.0 bits per weight</a>
-<a href="https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/4_25">4.25 bits per weight</a>
-<a href="https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/3_5">3.5 bits per weight</a>
 ## Download instructions
 With git:
 ```shell
-git clone --single-branch --branch 6_5 https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2
 ```
 With huggingface hub (credit to TheBloke for instructions):
@@ -125,25 +131,18 @@ With huggingface hub (credit to TheBloke for instructions):
 pip3 install huggingface-hub
 ```
-To download the `main` (only useful if you only care about measurement.json) branch to a folder called `starcoder2-15b-instruct-v0.1-exl2`:
-```shell
-mkdir starcoder2-15b-instruct-v0.1-exl2
-huggingface-cli download bartowski/starcoder2-15b-instruct-v0.1-exl2 --local-dir starcoder2-15b-instruct-v0.1-exl2 --local-dir-use-symlinks False
-```
-To download from a different branch, add the `--revision` parameter:
 Linux:
 ```shell
-mkdir starcoder2-15b-instruct-v0.1-exl2-6_5
 huggingface-cli download bartowski/starcoder2-15b-instruct-v0.1-exl2 --revision 6_5 --local-dir starcoder2-15b-instruct-v0.1-exl2-6_5 --local-dir-use-symlinks False
 ```
 Windows (which apparently doesn't like _ in folders sometimes?):
 ```shell
-mkdir starcoder2-15b-instruct-v0.1-exl2-6.5
 huggingface-cli download bartowski/starcoder2-15b-instruct-v0.1-exl2 --revision 6_5 --local-dir starcoder2-15b-instruct-v0.1-exl2-6.5 --local-dir-use-symlinks False
 ```

 Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
 Original model: https://huggingface.co/bigcode/starcoder2-15b-instruct-v0.1
+## Prompt format
+```
+<|endoftext|>You are an exceptionally intelligent coding assistant that consistently delivers accurate and reliable responses to user instructions.
+### Instruction
+{prompt}
+### Response
+<|endoftext|>
+```
+## Available sizes
+| Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |
+| ----- | ---- | ------- | ------ | ------ | ------ | ------------ |
+| [8_0](https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/8_0) | 8.0 | 8.0 | 15.8 GB | 16.8 GB | 18.1 GB | Maximum quality that ExLlamaV2 can produce, near unquantized performance. |
+| [6_5](https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/6_5) | 6.5  | 8.0 | 13.9 GB | 14.9 GB | 16.2 GB | Near unquantized performance at vastly reduced size, **recommended**. |
+| [5_0](https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/5_0) | 5.0  | 6.0 | 11.0 GB | 12.0 GB | 13.2 GB | Slightly lower quality vs 6.5. |
+| [4_25](https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/4_25) | 4.25 | 6.0 | 9.5 GB | 10.5 GB | 11.8 GB | GPTQ equivalent bits per weight. |
+| [3_5](https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2/tree/3_5) | 3.5  | 6.0 | 8.1 GB | 9.1 GB | 10.4 GB | Lower quality, not recommended. |
 ## Download instructions
 With git:
 ```shell
+git clone --single-branch --branch 6_5 https://huggingface.co/bartowski/starcoder2-15b-instruct-v0.1-exl2 starcoder2-15b-instruct-v0.1-exl2-6_5
 ```
 With huggingface hub (credit to TheBloke for instructions):
 pip3 install huggingface-hub
 ```
+To download a specific branch, use the `--revision` parameter. For example, to download the 6.5 bpw branch:
 Linux:
 ```shell
 huggingface-cli download bartowski/starcoder2-15b-instruct-v0.1-exl2 --revision 6_5 --local-dir starcoder2-15b-instruct-v0.1-exl2-6_5 --local-dir-use-symlinks False
 ```
 Windows (which apparently doesn't like _ in folders sometimes?):
 ```shell
 huggingface-cli download bartowski/starcoder2-15b-instruct-v0.1-exl2 --revision 6_5 --local-dir starcoder2-15b-instruct-v0.1-exl2-6.5 --local-dir-use-symlinks False
 ```
+Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski