Update README.md
Browse files
README.md
CHANGED
@@ -47,7 +47,7 @@ Human Preference Data:
|
|
47 |
|
48 |
## Evaluation
|
49 |
|
50 |
-
Evaluation of the model was conducted using the PoLL (Pool of LLM) technique, assessing performance on 100 French questions with scores aggregated from six evaluations
|
51 |
(two per evaluator). The evaluators included GPT-4o, Gemini-1.5-pro, and Claude3.5-sonnet.
|
52 |
|
53 |
**Performance Scores (on a scale of 5):**
|
|
|
47 |
|
48 |
## Evaluation
|
49 |
|
50 |
+
Evaluation of the model was conducted using the PoLL (Pool of LLM) technique, assessing performance on **100 French questions** with scores aggregated from six evaluations
|
51 |
(two per evaluator). The evaluators included GPT-4o, Gemini-1.5-pro, and Claude3.5-sonnet.
|
52 |
|
53 |
**Performance Scores (on a scale of 5):**
|