npc0
/

chatglm3-6b-32k-int4

Model card Files Files and versions Community

npc0 commited on Nov 23, 2023

Commit

4946e67

•

1 Parent(s): 0f71472

Update README.md

Files changed (1) hide show

README.md +5 -6

README.md CHANGED Viewed

@@ -15,11 +15,10 @@ ChatGLM3-6B-32k 是 ChatGLM 系列最新一代的开源模型，[THUDM/chatglm3-
 用 [ChatGLM.CPP](https://github.com/li-plus/chatglm.cpp) 基於 GGML quantize 生成 Q4_0、Q4_1 權重 weights 儲存於此倉庫。
 ## Performance
-|Model                     |GGML quantize method| HDD size |1 token\*|
-|--------------------------|--------------------|----------|---------|
-|chatglm3-32k-ggml-q4_0.bin|        q4_0        |  ?.?? GB |  ???ms  |
-|chatglm3-32k-ggml-q4_1.bin|        q4_1        |  ?.?? GB |  ???ms  |
-\* ms/token (CPU @ Platinum 8260) from [reference](https://github.com/li-plus/chatglm.cpp#performance)
 ## Getting Started
 1. Install dependency
@@ -29,7 +28,7 @@ ChatGLM3-6B-32k 是 ChatGLM 系列最新一代的开源模型，[THUDM/chatglm3-
 2. Download weight
   ```sh
-  wget https://huggingface.co/npc0/chatglm3-6b-fp16/resolve/main/chatglm3-32k-ggml-q4_0.bin
   ```
 3. Code

 用 [ChatGLM.CPP](https://github.com/li-plus/chatglm.cpp) 基於 GGML quantize 生成 Q4_0、Q4_1 權重 weights 儲存於此倉庫。
 ## Performance
+|Model                     |GGML quantize method| HDD size |
+|--------------------------|--------------------|----------|
+|chatglm3-32k-ggml-q4_0.bin|        q4_0        |  3.51 GB |
+|chatglm3-32k-ggml-q4_1.bin|        q4_1        |  ?.?? GB |
 ## Getting Started
 1. Install dependency
 2. Download weight
   ```sh
+  wget https://huggingface.co/npc0/chatglm3-6b-32k-int4/resolve/main/chatglm3-32k-ggml-q4_0.bin
   ```
 3. Code