Felladrin commited on
Commit
93085db
1 Parent(s): e8165d5

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -35,3 +35,7 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  qwen2.5-0.5b-instruct-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
37
  imatrix.dat filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  qwen2.5-0.5b-instruct-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
37
  imatrix.dat filter=lfs diff=lfs merge=lfs -text
38
+ model.shard-00001-of-00004.gguf filter=lfs diff=lfs merge=lfs -text
39
+ model.shard-00002-of-00004.gguf filter=lfs diff=lfs merge=lfs -text
40
+ model.shard-00003-of-00004.gguf filter=lfs diff=lfs merge=lfs -text
41
+ model.shard-00004-of-00004.gguf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,57 +1,5 @@
1
  ---
2
- license: apache-2.0
3
- license_link: https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct/blob/main/LICENSE
4
- language:
5
- - en
6
- pipeline_tag: text-generation
7
- base_model: Qwen/Qwen2.5-0.5B-Instruct
8
- tags:
9
- - chat
10
- - llama-cpp
11
- - gguf-my-repo
12
- library_name: transformers
13
  ---
14
 
15
- # Felladrin/Qwen2.5-0.5B-Instruct-Q8_0-GGUF
16
- This model was converted to GGUF format from [`Qwen/Qwen2.5-0.5B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
17
- Refer to the [original model card](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) for more details on the model.
18
-
19
- ## Use with llama.cpp
20
- Install llama.cpp through brew (works on Mac and Linux)
21
-
22
- ```bash
23
- brew install llama.cpp
24
-
25
- ```
26
- Invoke the llama.cpp server or the CLI.
27
-
28
- ### CLI:
29
- ```bash
30
- llama-cli --hf-repo Felladrin/Qwen2.5-0.5B-Instruct-Q8_0-GGUF --hf-file qwen2.5-0.5b-instruct-q8_0.gguf -p "The meaning to life and the universe is"
31
- ```
32
-
33
- ### Server:
34
- ```bash
35
- llama-server --hf-repo Felladrin/Qwen2.5-0.5B-Instruct-Q8_0-GGUF --hf-file qwen2.5-0.5b-instruct-q8_0.gguf -c 2048
36
- ```
37
-
38
- Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
39
-
40
- Step 1: Clone llama.cpp from GitHub.
41
- ```
42
- git clone https://github.com/ggerganov/llama.cpp
43
- ```
44
-
45
- Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
46
- ```
47
- cd llama.cpp && LLAMA_CURL=1 make
48
- ```
49
-
50
- Step 3: Run inference through the main binary.
51
- ```
52
- ./llama-cli --hf-repo Felladrin/Qwen2.5-0.5B-Instruct-Q8_0-GGUF --hf-file qwen2.5-0.5b-instruct-q8_0.gguf -p "The meaning to life and the universe is"
53
- ```
54
- or
55
- ```
56
- ./llama-server --hf-repo Felladrin/Qwen2.5-0.5B-Instruct-Q8_0-GGUF --hf-file qwen2.5-0.5b-instruct-q8_0.gguf -c 2048
57
- ```
 
1
  ---
2
+ base_model: Felladrin/gguf-Q8_0-Qwen2.5-0.5B-Instruct
 
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
+ Sharded GGUF version of [Felladrin/gguf-Q8_0-Qwen2.5-0.5B-Instruct](https://huggingface.co/Felladrin/gguf-Q8_0-Qwen2.5-0.5B-Instruct).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
model.shard-00001-of-00004.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f187cfe639d9512cd4c9955524172feaf6e470f8b16a82482eb6934fca1c540
3
+ size 155209152
model.shard-00002-of-00004.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3e160c92edaf4d835aeeee1eb85cece1b1de54424e634680096a84c86fcf1f08
3
+ size 147314400
model.shard-00003-of-00004.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:913d80c47478f6f03c800aef4951c69cdcf813095cb48953ebe8e3f0aeb84108
3
+ size 149276352
model.shard-00004-of-00004.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:86709450c24a1442d6d841307a0bdaf062aa89be2fa9f3347314d098a75dddc9
3
+ size 79268768