highest score yet out of a plain vanilla 7B model!!!!
#17
by
silvacarl
- opened
I just tested this model on the hardest questions we use when evaluating models. It got 85% right, beating larger models at these questions. This is the first time I have ever seen this.
And we have tested everything.
If it can be easily fine tuned, this could be perrfect.