Independent evaluation results

#35

by yaronr - opened Sep 26, 2024

yaronr

Sep 26, 2024

Dear Llama team,

I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.

I hope you find this useful.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment