sanjay920 commited on
Commit
636af97
1 Parent(s): bb88042

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +73 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: gemma
3
+ model-index:
4
+ - name: Rubra-Meta-Llama-3-8B-Instruct
5
+ results:
6
+ - task:
7
+ type: text-generation
8
+ dataset:
9
+ type: MMLU
10
+ name: MMLU
11
+ metrics:
12
+ - type: 5-shot
13
+ value: 38.85
14
+ verified: false
15
+ - task:
16
+ type: text-generation
17
+ dataset:
18
+ type: GPQA
19
+ name: GPQA
20
+ metrics:
21
+ - type: 0-shot
22
+ value: 24.55
23
+ verified: false
24
+ - task:
25
+ type: text-generation
26
+ dataset:
27
+ type: GSM-8K
28
+ name: GSM-8K
29
+ metrics:
30
+ - type: 8-shot, CoT
31
+ value: 6.14
32
+ verified: false
33
+ - task:
34
+ type: text-generation
35
+ dataset:
36
+ type: MATH
37
+ name: MATH
38
+ metrics:
39
+ - type: 4-shot, CoT
40
+ value: 2.38
41
+ verified: false
42
+ - task:
43
+ type: text-generation
44
+ dataset:
45
+ type: MT-bench
46
+ name: MT-bench
47
+ metrics:
48
+ - type: GPT-4 as Judge
49
+ value: 5.75
50
+ verified: false
51
+ tags:
52
+ - function-calling
53
+ - tool-calling
54
+ - agentic
55
+ - rubra
56
+ - gemma
57
+ - conversational
58
+ language:
59
+ - en
60
+ ---
61
+
62
+ # Gemma-1.1 2B Instruct GGUF
63
+
64
+ Original model: [rubra-ai/gemma-1.1-2b-it](https://huggingface.co/rubra-ai/gemma-1.1-2b-it)
65
+
66
+ ## Model Description
67
+ Gemma-1.1-2B-IT is the result of post-training on the base model [google/gemma-1.1-2b-it](https://huggingface.co/google/gemma-1.1-2b-it). This model is designed for high performance in various instruction-following tasks and complex interactions, including multi-turn function calling and detailed conversations.
68
+
69
+ ## Training Data
70
+ The model underwent additional training on a proprietary dataset encompassing diverse instruction-following, chat, and function calling data. This post-training process enhances the model's ability to integrate tools and manage complex interaction scenarios effectively.
71
+
72
+ ## How to Use
73
+ Refer to https://docs.rubra.ai/inference/llamacpp for usage. Feel free to ask/open issues up in our Github repo: https://github.com/rubra-ai/rubra