AIFunOver commited on
Commit
f8b8bac
1 Parent(s): 24252a1

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Qwen/Qwen2.5-Coder-32B-Instruct
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ license: apache-2.0
7
+ license_link: https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct/blob/main/LICENSE
8
+ pipeline_tag: text-generation
9
+ tags:
10
+ - code
11
+ - codeqwen
12
+ - chat
13
+ - qwen
14
+ - qwen-coder
15
+ - openvino
16
+ - nncf
17
+ - 8-bit
18
+ base_model_relation: quantized
19
+ ---
20
+
21
+ This model is a quantized version of [`Qwen/Qwen2.5-Coder-32B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct) and is converted to the OpenVINO format. This model was obtained via the [nncf-quantization](https://huggingface.co/spaces/echarlaix/nncf-quantization) space with [optimum-intel](https://github.com/huggingface/optimum-intel).
22
+ First make sure you have `optimum-intel` installed:
23
+ ```bash
24
+ pip install optimum[openvino]
25
+ ```
26
+ To load your model you can do as follows:
27
+ ```python
28
+ from optimum.intel import OVModelForCausalLM
29
+ model_id = "AIFunOver/Qwen2.5-Coder-32B-Instruct-openvino-8bit"
30
+ model = OVModelForCausalLM.from_pretrained(model_id)
31
+ ```