rubra-ai
/

Qwen2-7B-Instruct-GGUF

function-calling

Inference Endpoints

Model card Files Files and versions Community

sanjay920 commited on Jul 1

Commit

a16e449

•

1 Parent(s): a6a37fd

Create README.md

Files changed (1) hide show

README.md +74 -0

README.md ADDED Viewed

	@@ -0,0 +1,74 @@

+---
+license: apache-2.0
+model-index:
+- name: Rubra-Qwen2-7B-Instruct
+  results:
+  - task:
+      type: text-generation
+    dataset:
+      type: MMLU
+      name: MMLU
+    metrics:
+    - type: 5-shot
+      value: 68.88
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: GPQA
+      name: GPQA
+    metrics:
+    - type: 0-shot
+      value: 30.36
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: GSM-8K
+      name: GSM-8K
+    metrics:
+    - type: 8-shot, CoT
+      value: 75.82
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: MATH
+      name: MATH
+    metrics:
+    - type: 4-shot, CoT
+      value: 28.72
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: MT-bench
+      name: MT-bench
+    metrics:
+    - type: GPT-4 as Judge
+      value: 8.08
+      verified: false
+tags:
+- function-calling
+- tool-calling
+- agentic
+- rubra
+- conversational
+language:
+- en
+- zh
+---
+# Qwen2 7B Instruct GGUF
+Original model: [rubra-ai/Qwen2-7B-Instruct](https://huggingface.co/rubra-ai/Qwen2-7B-Instruct)
+## Model description
+The model is the result of further post-training [Qwen/Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct). It is capable of complex multi-turn tool/function calling.
+## Training
+The model was post-trained (freeze tuned & DPO) on a proprietary dataset consisting of diverse function calling, chat, and instruct data.
+## How to use
+Refer to https://docs.rubra.ai/inference/llamacpp for usage. Feel free to ask/open issues up in our Github repo: https://github.com/rubra-ai/rubra