Nan-Do commited on
Commit
544c609
·
1 Parent(s): 4e4a96b

Added template for the model

Browse files
Files changed (1) hide show
  1. README.md +80 -0
README.md CHANGED
@@ -1,3 +1,83 @@
1
  ---
 
 
 
 
2
  license: llama2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model: Nan-Do/LeetCodeWizard_13B_v1.1a
3
+ inference: false
4
+ language:
5
+ - en
6
  license: llama2
7
+ model-index:
8
+ - name: LeetCodeWizard_13B_v1.1a
9
+ results: []
10
+ model_creator: Nan-Do
11
+ model_name: LeetCodeWizard 13B v1.1a
12
+ model_type: codellama
13
+ prompt_template: 'Below is an instruction that describes a task. Write a response that appropriately completes the request.
14
+
15
+ ### Instruction:
16
+ {instruction}
17
+
18
+ ### Response:'
19
+ quantized_by: Nan-Do
20
+ tags:
21
+ - codellama
22
+ - instruct
23
+ - finetune
24
+ - leetcode
25
+ - problem solving
26
  ---
27
+ <!-- markdownlint-disable MD041 -->
28
+
29
+ # LeetCodeWizard 7B V1.1 - GGUF
30
+ - Original model: [LeetCodeWizard 13B v1.1a](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_v1.1a)
31
+
32
+ <!-- description start -->
33
+ ## Description
34
+
35
+ This repo contains GGUF format model files for [LeetCodeWizard 13B v1.1a](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_v1.1a). (model template inspired by [TheBloke](https://huggingface.co/TheBloke))
36
+
37
+
38
+ ## Prompt template: Alpaca
39
+
40
+ ```
41
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
42
+
43
+ ### Instruction:
44
+ {instruction}
45
+
46
+ ### Response:
47
+ ```
48
+
49
+ <!-- prompt-template end -->
50
+
51
+
52
+ <!-- compatibility_gguf start -->
53
+ ## Explanation of quantisation methods
54
+
55
+ <details>
56
+ <summary>Click to see details</summary>
57
+
58
+ The new methods available are:
59
+
60
+ * GGML_TYPE_Q3_K - "type-0" 3-bit quantization in super-blocks containing 16 blocks, each block having 16 weights. Scales are quantized with 6 bits. This end up using 3.4375 bpw.
61
+ * GGML_TYPE_Q4_K - "type-1" 4-bit quantization in super-blocks containing 8 blocks, each block having 32 weights. Scales and mins are quantized with 6 bits. This ends up using 4.5 bpw.
62
+ * GGML_TYPE_Q5_K - "type-1" 5-bit quantization. Same super-block structure as GGML_TYPE_Q4_K resulting in 5.5 bpw
63
+ * GGML_TYPE_Q6_K - "type-0" 6-bit quantization. Super-blocks with 16 blocks, each block having 16 weights. Scales are quantized with 8 bits. This ends up using 6.5625 bpw
64
+
65
+ Refer to the Provided Files table below to see what files use which methods, and how.
66
+ </details>
67
+ <!-- compatibility_gguf end -->
68
+
69
+ <!-- README_GGUF.md-provided-files start -->
70
+ ## Provided files
71
+
72
+ | Name | Quantisation method | Bits | Size |
73
+ | ---- | :----: | ----: | ----: |
74
+ | [LeetCodeWizard_13B_V1.1a.Q3_K.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q3_K.gguf) | Q3_K | 3 | 6.34 GB|
75
+ | [LeetCodeWizard_13B_V1.1a.Q4_0.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q4_0.gguf) | Q4_0 | 4 | 7.37 GB|
76
+ | [LeetCodeWizard_13B_V1.1a.Q4_1.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q4_1.gguf) | Q4_1 | 4 | 8.17 GB|
77
+ | [LeetCodeWizard_13B_V1.1a.Q5_0.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q5_0.gguf) | Q5_0 | 5 | 8.97 GB|
78
+ | [LeetCodeWizard_13B_V1.1a.Q5_1.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q5_1.gguf) | Q5_1 | 5 | 9.78 GB|
79
+ | [LeetCodeWizard_13B_V1.1a.Q6_K.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q6_K.gguf) | Q6_K | 6 | 10.7 GB|
80
+ | [LeetCodeWizard_13B_V1.1a.Q8_0.gguf](https://huggingface.co/Nan-Do/LeetCodeWizard_13B_V1.1a-GGUF/resolve/main/LeetCodeWizard_13B_V1.1a.Q8_0.gguf) | Q8_0 | 8 | 13.8 GB|
81
+
82
+ <!-- original-model-card end -->
83
+