Can someone rate this personally?

by Noxi-V - opened Oct 22

Oct 22

I feel like ever since reflection, I do not trust benchmarks since they are practically can be trained and cheesed
anyone tried this? How good is it in a real use case?

dfurman

Owner Oct 22

Not an answer, but this model's training is completely transparent/traceable.

As per the model card, it was finetuned off a specific base model (which used to be #1 on that leaderboard) on a small sample of a specific dataset. Both are listed in the model card.

Curious to hear how it is working on real use cases as well!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment