Update README.md
Browse files
README.md
CHANGED
@@ -45,8 +45,8 @@ model-index:
|
|
45 |
type: MT-bench
|
46 |
name: MT-bench
|
47 |
metrics:
|
48 |
-
- type:
|
49 |
-
value:
|
50 |
verified: false
|
51 |
---
|
52 |
|
@@ -59,14 +59,6 @@ The model is the result of further post-training [mistralai/Mistral-7B-Instruct-
|
|
59 |
|
60 |
The model was post-trained (freeze tuned & DPO) on a proprietary dataset consisting of diverse function calling, chat, and instruct data.
|
61 |
|
62 |
-
## Evaluation
|
63 |
-
|
64 |
-
| Model | Function Calling | MMLU | GPQA | GSM-8K | MATH | MT-bench |
|
65 |
-
|------------------------------------|------------------|-------|-------|--------|-------|----------|
|
66 |
-
| Mistral 7B Instruct v0.2 | - | 59.27 | 27.68 | 43.21 | 10.30 | 7.50 |
|
67 |
-
| Rubra Enhanced Mistral 7B Instruct v0.2 | 69.28% | 58.90 | 29.91 | 34.12 | 8.36 | 7.36 |
|
68 |
-
|
69 |
-
|
70 |
## How to use
|
71 |
|
72 |
You can use the model with the Hugging Face `transformers` and the rubra library [rubra-tools](https://github.com/rubra-ai/rubra-tools) as follows:
|
|
|
45 |
type: MT-bench
|
46 |
name: MT-bench
|
47 |
metrics:
|
48 |
+
- type: GP4-4 as Judge
|
49 |
+
value: 7.36
|
50 |
verified: false
|
51 |
---
|
52 |
|
|
|
59 |
|
60 |
The model was post-trained (freeze tuned & DPO) on a proprietary dataset consisting of diverse function calling, chat, and instruct data.
|
61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
62 |
## How to use
|
63 |
|
64 |
You can use the model with the Hugging Face `transformers` and the rubra library [rubra-tools](https://github.com/rubra-ai/rubra-tools) as follows:
|