Spaces:
Running
Running
sherzod-hakimov
commited on
Commit
•
35f1405
1
Parent(s):
923aff9
small typo fix
Browse files
src/assets/text_content.py
CHANGED
@@ -13,7 +13,7 @@ The CLEM Leaderboard aims to track, rank and evaluate current cLLMs (chat-optimi
|
|
13 |
|
14 |
The benchmarking approach is described in [Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents](https://aclanthology.org/2023.emnlp-main.689.pdf).
|
15 |
|
16 |
-
The
|
17 |
|
18 |
Source code for benchmarking "clems" is available here: [Clembench](https://github.com/clembench/clembench)
|
19 |
|
|
|
13 |
|
14 |
The benchmarking approach is described in [Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents](https://aclanthology.org/2023.emnlp-main.689.pdf).
|
15 |
|
16 |
+
The multimodal benchmark is described in [Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models](https://arxiv.org/abs/2406.14035)
|
17 |
|
18 |
Source code for benchmarking "clems" is available here: [Clembench](https://github.com/clembench/clembench)
|
19 |
|