sanjay920 commited on
Commit
0251a12
1 Parent(s): aa5079d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -17
README.md CHANGED
@@ -1,20 +1,60 @@
1
- ---
2
- language:
3
- - en
4
- ---
5
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  ---
7
 
8
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
9
- should probably proofread and complete it, then remove this comment. -->
10
-
11
- # run1_short_8000
12
-
13
- This model is a fine-tuned version of [microsoft/Phi-3-mini-128k-instruct](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) on the rubra_train_v1_phi3_short dataset.
14
-
15
- ### Framework versions
16
-
17
- - Transformers 4.41.2
18
- - Pytorch 2.3.0+cu121
19
- - Datasets 2.19.2
20
- - Tokenizers 0.19.1
 
 
 
 
 
1
  ---
2
+ license: mit
3
+ model-index:
4
+ - name: Rubra-Phi-3-mini-128k-instruct
5
+ results:
6
+ - task:
7
+ type: text-generation
8
+ dataset:
9
+ type: MMLU
10
+ name: MMLU
11
+ metrics:
12
+ - type: 5-shot
13
+ value: 66.66
14
+ verified: false
15
+ - task:
16
+ type: text-generation
17
+ dataset:
18
+ type: GPQA
19
+ name: GPQA
20
+ metrics:
21
+ - type: 0-shot
22
+ value: 29.24
23
+ verified: false
24
+ - task:
25
+ type: text-generation
26
+ dataset:
27
+ type: GSM-8K
28
+ name: GSM-8K
29
+ metrics:
30
+ - type: 8-shot, CoT
31
+ value: 74.09
32
+ verified: false
33
+ - task:
34
+ type: text-generation
35
+ dataset:
36
+ type: MATH
37
+ name: MATH
38
+ metrics:
39
+ - type: 4-shot, CoT
40
+ value: 26.84
41
+ verified: false
42
+ - task:
43
+ type: text-generation
44
+ dataset:
45
+ type: MT-bench
46
+ name: MT-bench
47
+ metrics:
48
+ - type: GPT-4 as Judge
49
+ value: 7.45
50
+ verified: false
51
+ tags:
52
+ - function-calling
53
+ - tool-calling
54
+ - agentic
55
+ - rubra
56
+ language:
57
+ - en
58
  ---
59
 
60
+ # Phi-3-mini-128k-instruct