Update README.md
Browse files
README.md
CHANGED
@@ -84,7 +84,7 @@ generated_ids = [
|
|
84 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
85 |
```
|
86 |
|
87 |
-
Here is an example of the question "Is 9.9 or 9.10 larger", sampled greedily. It gets the question correct, but the reasoning seems a bit weird. At the beginning its pretty strong but when it tries to solve the problem from another view it's starting to deviate.
|
88 |
```
|
89 |
<|im_start|>system
|
90 |
You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.<|im_end|>
|
|
|
84 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
85 |
```
|
86 |
|
87 |
+
Here is an example of the question "Is 9.9 or 9.10 larger", sampled greedily. It gets the question correct, but the reasoning seems a bit weird. At the beginning its pretty strong but when it tries to solve the problem from another view it's starting to deviate. Then at the end it gets back on track and outputs the correct answer:
|
88 |
```
|
89 |
<|im_start|>system
|
90 |
You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.<|im_end|>
|