sanjay920 commited on
Commit
3855fa9
1 Parent(s): 9bb8cd2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -48,7 +48,7 @@ model-index:
48
  name: MT-bench
49
  metrics:
50
  - type: GPT-4 as Judge
51
- value: 7.45
52
  verified: false
53
  tags:
54
  - function-calling
@@ -67,10 +67,10 @@ The model is the result of further post-training [microsoft/Phi-3-mini-128k-inst
67
 
68
  | Model | Function Calling | MMLU | GPQA | GSM-8K | MATH | MT-bench | Win | Loss | Tie | Win Rate | Loss Rate | Adjusted Win Rate |
69
  |----------------------------------------------|------------------|-------|-------|--------|-------|----------|-----|------|-----|----------|-----------|-------------------|
70
- | Phi-3 Mini 128k Instruct (June) | - | 69.36 | 27.01 | 83.7 | 32.92 | - | - | - | - | - | - | - |
71
- | Rubra Enhanced Phi-3 Mini 128k Instruct (June)| - | 67.87 | 29.69 | 79.45 | 30.80 | - | - | - | - | - | - | - |
72
- | Phi-3 Mini 128k Instruct (April) | - | 68.17 | 25.90 | 80.44 | 28.12 | 7.92 | 51 | 45 | 64 | 0.31875 | 0.28125 | **0.51875** |
73
- | Rubra Enhanced Phi-3 Mini 128k Instruct (April)| 65.71% | 66.66 | 29.24 | 74.09 | 26.84 | 7.45 | 45 | 51 | 64 | 0.28125 | 0.31875 | 0.48125 |
74
  * Commit `e2ecb24bd9dae689bb30dafcf13cbbc9dbddead5` is the last commit to have the April-based Phi-3 model. The latest in main is built off the June model
75
 
76
  ## Training Data
 
48
  name: MT-bench
49
  metrics:
50
  - type: GPT-4 as Judge
51
+ value: 8.21
52
  verified: false
53
  tags:
54
  - function-calling
 
67
 
68
  | Model | Function Calling | MMLU | GPQA | GSM-8K | MATH | MT-bench | Win | Loss | Tie | Win Rate | Loss Rate | Adjusted Win Rate |
69
  |----------------------------------------------|------------------|-------|-------|--------|-------|----------|-----|------|-----|----------|-----------|-------------------|
70
+ | Phi-3 Mini 128k Instruct (June) | - | 69.36 | 27.01 | 83.7 | 32.92 | 8.02 | 21 | 72 | 67 | 0.13125 | 0.45000 | 0.340625 |
71
+ | Rubra Enhanced Phi-3 Mini 128k Instruct (June)| 75.00% | 67.87 | 29.69 | 79.45 | 30.80 | 8.21 | 72 | 21 | 67 | 0.45000 | 0.13125 | **0.659375** |
72
+ | Phi-3 Mini 128k Instruct (April) | - | 68.17 | 25.90 | 80.44 | 28.12 | 7.92 | 51 | 45 | 64 | 0.31875 | 0.28125 | 0.51875 |
73
+ | Rubra Enhanced Phi-3 Mini 128k Instruct (April)| 65.71% | 66.66 | 29.24 | 74.09 | 26.84 | 7.45 | 45 | 51 | 64 | 0.28125 | 0.31875 | 0.48125 |
74
  * Commit `e2ecb24bd9dae689bb30dafcf13cbbc9dbddead5` is the last commit to have the April-based Phi-3 model. The latest in main is built off the June model
75
 
76
  ## Training Data