Update README.md
Browse files
README.md
CHANGED
@@ -19,8 +19,6 @@ MicroThinker-3B-Preview, a new model fine-tuned from the [huihui-ai/Llama-3.2-3B
|
|
19 |
|
20 |
This is just a test, but the performance is quite good.
|
21 |
|
22 |
-
The model is still being fine-tuned, but it will be ready very soon.
|
23 |
-
|
24 |
Now, I'll introduce the test environment.
|
25 |
|
26 |
The model was trained using 1 RTX 4090 GPU(24GB) .
|
@@ -58,17 +56,17 @@ swift sft --model huihui-ai/Llama-3.2-3B-Instruct-abliterated --model_type llama
|
|
58 |
```
|
59 |
|
60 |
|
61 |
-
4. Save the fine-tuned model.
|
62 |
Replace the directories below with specific ones.
|
63 |
|
64 |
```
|
65 |
-
swift infer --model huihui-ai/Llama-3.2-3B-Instruct-abliterated --adapters output/Llama-3.2-3B-Instruct-abliterated/lora/sft/v0-20250102-153619/checkpoint-1237 --merge_lora true
|
66 |
```
|
67 |
|
68 |
|
69 |
This should create a new model directory: `checkpoint-1237-merged`, Copy or move this directory to the `huihui` directory.
|
70 |
|
71 |
-
5. Perform inference on the fine-tuned model.
|
72 |
|
73 |
```
|
74 |
swift infer --model huihui/checkpoint-1237-merged --stream true --infer_backend pt --max_new_tokens 8192
|
@@ -82,15 +80,15 @@ swift sft --model huihui-ai/checkpoint-1237-merged --model_type llama3_2 --train
|
|
82 |
```
|
83 |
|
84 |
|
85 |
-
7. Save the final fine-tuned model.
|
86 |
Replace the directories below with specific ones.
|
87 |
|
88 |
```
|
89 |
-
swift infer --model huihui-ai/checkpoint-1237-merged --adapters output/Llama-3.2-3B-Instruct-abliterated/lora/sft2/v0-20250103-121319/checkpoint-
|
90 |
```
|
91 |
|
92 |
|
93 |
-
This should create a new model directory: `checkpoint-
|
94 |
|
95 |
8. Perform inference on the final fine-tuned model.
|
96 |
|
|
|
19 |
|
20 |
This is just a test, but the performance is quite good.
|
21 |
|
|
|
|
|
22 |
Now, I'll introduce the test environment.
|
23 |
|
24 |
The model was trained using 1 RTX 4090 GPU(24GB) .
|
|
|
56 |
```
|
57 |
|
58 |
|
59 |
+
4. Save the fine-tuned model. After you're done, input `exit` to exit.
|
60 |
Replace the directories below with specific ones.
|
61 |
|
62 |
```
|
63 |
+
swift infer --model huihui-ai/Llama-3.2-3B-Instruct-abliterated --adapters output/Llama-3.2-3B-Instruct-abliterated/lora/sft/v0-20250102-153619/checkpoint-1237 --stream true --merge_lora true
|
64 |
```
|
65 |
|
66 |
|
67 |
This should create a new model directory: `checkpoint-1237-merged`, Copy or move this directory to the `huihui` directory.
|
68 |
|
69 |
+
5. Perform inference on the fine-tuned model.
|
70 |
|
71 |
```
|
72 |
swift infer --model huihui/checkpoint-1237-merged --stream true --infer_backend pt --max_new_tokens 8192
|
|
|
80 |
```
|
81 |
|
82 |
|
83 |
+
7. Save the final fine-tuned model. After you're done, input `exit` to exit.
|
84 |
Replace the directories below with specific ones.
|
85 |
|
86 |
```
|
87 |
+
swift infer --model huihui-ai/checkpoint-1237-merged --adapters output/Llama-3.2-3B-Instruct-abliterated/lora/sft2/v0-20250103-121319/checkpoint-2474 --stream true --merge_lora true
|
88 |
```
|
89 |
|
90 |
|
91 |
+
This should create a new model directory: `checkpoint-2474-merged`, Rename the directory to `MicroThinker-3B-Preview`, Copy or move this directory to the `huihui` directory.
|
92 |
|
93 |
8. Perform inference on the final fine-tuned model.
|
94 |
|