micaebe commited on
Commit
9fca79b
1 Parent(s): 54f7bba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +135 -0
README.md CHANGED
@@ -84,6 +84,141 @@ generated_ids = [
84
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
85
  ```
86
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
87
  For GSM8k performance comparison: the base-instruct model scores 73.2% on the test set.
88
 
89
  Disclaimer: GSM8k scores are currently only from the first 27% of the test set. Scores will be updated in the future, tested on the full dataset.
 
84
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
85
  ```
86
 
87
+ Here is an example of the question "Is 9.9 or 9.10 larger", sampled greedily. It gets the question correct, but the reasoning seems a bit weird. At the beginning its pretty strong but when it tries to solve the problem from another view it's starting to deviate.
88
+ ```
89
+ <|im_start|>system
90
+ You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.<|im_end|>
91
+ <|im_start|>user
92
+ Is 9.9 or 9.10 larger?<|im_end|>
93
+ <|im_start|>assistant
94
+ So I have this problem here: "Is 9.9 or 9.10 larger?" Okay, let's see what it's asking.
95
+
96
+ First off, both numbers are decimals with the same whole number part, which is 9 in this case. So that means they're both nine units long, right?
97
+
98
+ Now, when comparing two decimal numbers where the whole parts are equal, we just need to compare the digits after the decimal point. That makes sense because if the whole parts are the same, then the difference must be in the tenths place.
99
+
100
+ Let me write down the numbers again:
101
+
102
+ - 9.9
103
+
104
+ - 9.10
105
+
106
+ Okay, so the whole part is 9 for both, but the tenths places differ. In 9.9, the tenth is 9, and in 9.10, the tenth is 1.
107
+
108
+ Since 9 is greater than 1, it follows that 9.9 is greater than 9.10.
109
+
110
+ Wait a minute, does that make sense? Let's think about it another way.
111
+
112
+ Imagine you have 9 apples, and someone gives you an additional 0.9 of an apple. Now, do you have more apples or fewer apples compared to having 9.10 apples?
113
+
114
+ Well, 0.9 is less than 1, so adding it to 9 would give you 9.9, which is still less than 9.10.
115
+
116
+ But wait, maybe there's something else to consider. What if I were to add these numbers together?
117
+
118
+ Let's try adding them up:
119
+
120
+ 9.9 + 9.10 = ?
121
+
122
+ Hmm, that might help clarify things.
123
+
124
+ Adding 9.9 and 9.10:
125
+
126
+ First, align the decimal points:
127
+
128
+ 9.90
129
+ + 9.10
130
+ ------
131
+
132
+ Now, add the hundredths first:
133
+
134
+ 0 + 0 = 0
135
+
136
+ Then, tens:
137
+
138
+ 9 + 1 = 10
139
+
140
+ Bring down the zero from the hundredths place:
141
+
142
+ 10.0
143
+
144
+ Finally, ones:
145
+
146
+ 9 + 9 = 18
147
+
148
+ So, 9.9 + 9.10 = 18.00
149
+
150
+ That seems straightforward. But earlier, I thought that since the whole parts are the same, the comparison should be based on the tenths place only. But now, adding them shows that 9.9 is indeed smaller than 9.10.
151
+
152
+ I'm getting confused here. Maybe I need to rethink my approach.
153
+
154
+ Alternatively, perhaps I can look at the numbers as fractions to get a better understanding.
155
+
156
+ Both 9.9 and 9.10 can be written as fractions:
157
+
158
+ 9.9 = 99/10
159
+
160
+ 9.10 = 91/10
161
+
162
+ Now, comparing 99/10 and 91/10.
163
+
164
+ Clearly, 99 is greater than 91, so 99/10 > 91/10.
165
+
166
+ Therefore, 9.9 > 9.10.
167
+
168
+ But wait, that doesn't seem right either. Earlier, when I added them, 9.9 + 9.10 = 18.00, which suggests that 9.9 is less than 9.10.
169
+
170
+ Maybe I'm missing something fundamental here.
171
+
172
+ Let me think differently.
173
+
174
+ Consider the numbers as lengths on a number line.
175
+
176
+ The number line has negative numbers to the left and positive numbers to the right.
177
+
178
+ If I plot -9.9 and -9.10 on the number line, which one is farther to the left?
179
+
180
+ Well, -9.9 is further to the left than -9.10, because -9.9 is closer to zero than -9.10.
181
+
182
+ Similarly, if I plot 9.9 and 9.10 on the number line, which one is farther to the right?
183
+
184
+ Again, 9.9 is farther to the right than 9.10, because 9.9 is closer to ten than 9.10.
185
+
186
+ This reinforces that 9.9 is greater than 9.10.
187
+
188
+ Wait, but earlier, when I was trying to add them, I got 18.00, which is greater than any other possible sum involving 9.9 and 9.10.
189
+
190
+ So, perhaps the confusion lies in how I interpreted the addition.
191
+
192
+ Let me re-examine the addition:
193
+
194
+ 9.9 + 9.10 = 18.00
195
+
196
+ Yes, that's correct.
197
+
198
+ And since 18.00 is greater than any other possible sum involving 9.9 and 9.10, it confirms that 9.9 is greater than 9.10.
199
+
200
+ But wait, maybe I should consider that 9.9 is actually 9.90, not 9.9.
201
+
202
+ Because sometimes people write decimals without trailing zeros, like 9.9 instead of 9.90.
203
+
204
+ In that case, 9.90 is exactly the same as 9.9, so the comparison remains the same.
205
+
206
+ Therefore, 9.90 is greater than 9.10.
207
+
208
+ But according to standard notation, 9.9 is typically used, even though 9.90 is mathematically equivalent.
209
+
210
+ So, sticking with 9.9, which is 9.90, it's clear that 9.9 is larger than 9.10.
211
+
212
+ Hence, the answer is 9.9 is larger.
213
+
214
+ **Final Answer**
215
+
216
+ \[ \boxed{9.9} \]<|im_end|>
217
+ ```
218
+
219
+
220
+
221
+
222
  For GSM8k performance comparison: the base-instruct model scores 73.2% on the test set.
223
 
224
  Disclaimer: GSM8k scores are currently only from the first 27% of the test set. Scores will be updated in the future, tested on the full dataset.