Update README.md
Browse files
README.md
CHANGED
@@ -41,14 +41,15 @@ The training process included:
|
|
41 |
## Evaluation
|
42 |
The model was evaluated using the **MathQA test dataset(2985 examples)** with **accuracy** as the primary metric. The following table compares its performance to other models:
|
43 |
|
44 |
-
| Model
|
45 |
-
|
46 |
-
| Gemma-2-9b (base)
|
47 |
-
| Mistral-7B-Instruct | 22.61
|
48 |
-
| Llama-3.1-8b-Instruct | 27.37
|
49 |
-
| Llama-3.2-3b-Instruct | 23.48
|
50 |
-
| Qwen2
|
51 |
-
|
|
|
|
52 |
|
53 |
|
54 |
## How to Get Started with the Model
|
|
|
41 |
## Evaluation
|
42 |
The model was evaluated using the **MathQA test dataset(2985 examples)** with **accuracy** as the primary metric. The following table compares its performance to other models:
|
43 |
|
44 |
+
| Model | Accuracy (%) |
|
45 |
+
|-----------------------|---------------|
|
46 |
+
| Gemma-2-9b (base) | 24.02 |
|
47 |
+
| Mistral-7B-Instruct | 22.61 |
|
48 |
+
| Llama-3.1-8b-Instruct | 27.37 |
|
49 |
+
| Llama-3.2-3b-Instruct | 23.48 |
|
50 |
+
| Qwen2-Math-7B-Instruct| 33.13 |
|
51 |
+
| Qwen2.5-7B-Instruct | 38.69 |
|
52 |
+
| **mathGemma-2-9b** | **48.91** |
|
53 |
|
54 |
|
55 |
## How to Get Started with the Model
|