Update app.py
Browse files
app.py
CHANGED
@@ -224,7 +224,7 @@ with demo:
|
|
224 |
- Win Rate represents how often a model outperforms other models in each language, averaged across all languages.
|
225 |
- The scores of instruction-tuned models might be significantly higher on humaneval-python than other languages. We use the instruction format of HumanEval. For other languages, we use base MultiPL-E prompts.
|
226 |
- For more details check the ๐ About section.
|
227 |
-
- Models with a ๐ด symbol represent external evaluation
|
228 |
""",
|
229 |
elem_classes="markdown-text",
|
230 |
)
|
|
|
224 |
- Win Rate represents how often a model outperforms other models in each language, averaged across all languages.
|
225 |
- The scores of instruction-tuned models might be significantly higher on humaneval-python than other languages. We use the instruction format of HumanEval. For other languages, we use base MultiPL-E prompts.
|
226 |
- For more details check the ๐ About section.
|
227 |
+
- Models with a ๐ด symbol represent external evaluation submission, this means that we didn't verify the results, you can find the author's submission under `Submission PR` field from `See All Columns` tab.
|
228 |
""",
|
229 |
elem_classes="markdown-text",
|
230 |
)
|