Update README.md
Browse files
README.md
CHANGED
@@ -84,7 +84,7 @@ generated_ids = [
|
|
84 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
85 |
```
|
86 |
|
87 |
-
Here is an example of the question "Is 9.9 or 9.10 larger", sampled greedily. It gets the question correct, but the reasoning seems a bit weird. At the beginning its pretty strong but when it tries to solve the problem from
|
88 |
```
|
89 |
<|im_start|>system
|
90 |
You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.<|im_end|>
|
|
|
84 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
85 |
```
|
86 |
|
87 |
+
Here is an example of the question "Is 9.9 or 9.10 larger", sampled greedily. It gets the question correct, but the reasoning seems a bit weird at places. At the beginning its pretty strong but when it tries to solve the problem from other views it's starting to deviate sometimes and has some incorrect reasoning chains:
|
88 |
```
|
89 |
<|im_start|>system
|
90 |
You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.<|im_end|>
|