Jan commited on
Commit
9997224
β€’
1 Parent(s): 2884e80

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -0
README.md ADDED
@@ -0,0 +1,97 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ ---
6
+ <!-- header start -->
7
+ <!-- 200823 -->
8
+
9
+ <div style="width: auto; margin-left: auto; margin-right: auto">
10
+ <img src="https://github.com/janhq/jan/assets/89722390/35daac7d-b895-487c-a6ac-6663daaad78e" alt="Jan banner" style="width: 100%; min-width: 400px; display: block; margin: auto;">
11
+ </div>
12
+
13
+ <p align="center">
14
+ <a href="https://jan.ai/">Jan</a
15
+ >
16
+ - <a href="https://discord.gg/AsJ8krTT3N">Discord</a>
17
+ </p>
18
+ <!-- header end -->
19
+
20
+ # Model Description
21
+ This model uses the `passthrough` merge method from the best 7B models on the [OpenLLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard):
22
+ 1. [viethq188/LeoScorpius-7B-Chat-DPO](https://huggingface.co/viethq188/LeoScorpius-7B-Chat-DPO)
23
+ 2. [GreenNode/GreenNodeLM-7B-v1olet](https://huggingface.co/GreenNode/GreenNodeLM-7B-v1olet)
24
+
25
+ The yaml config file for this model is here:
26
+
27
+ ```yaml
28
+ slices:
29
+ - sources:
30
+ - model: "viethq188/LeoScorpius-7B-Chat-DPO"
31
+ layer_range: [0, 24]
32
+ - sources:
33
+ - model: "GreenNode/GreenNodeLM-7B-v1olet"
34
+ layer_range: [8, 32]
35
+ merge_method: passthrough
36
+ dtype: bfloat16
37
+ ```
38
+
39
+ # Prompt template
40
+
41
+ - **ChatML**
42
+
43
+ ```
44
+ <|im_start|>system
45
+ {system_message}<|im_end|>
46
+ <|im_start|>user
47
+ {prompt}<|im_end|>
48
+ <|im_start|>assistant
49
+ ```
50
+
51
+ # Run this model
52
+
53
+ You can run this model using [Jan](https://jan.ai/) on Mac, Windows, or Linux.
54
+
55
+ **Jan is an open source, ChatGPT alternative that is:**
56
+
57
+ πŸ’» **100% offline on your machine**: Your conversations remain confidential, and visible only to you.
58
+
59
+ πŸ—‚οΈ **An Open File Format**: Conversations and model settings stay on your computer and can be exported or deleted at any time.
60
+
61
+ 🌐 **OpenAI Compatible**: Local server on port `
62
+ 1337` with OpenAI compatible endpoints
63
+
64
+ 🌍 **Open Source & Free**: We build in public; check out our [Github](https://github.com/janhq)
65
+
66
+ - Please use the [Pandora-v1-13B-GGUF](https://huggingface.co/janhq/Pandora-v1-10.7B-GGUF) when using on Jan.
67
+
68
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/r7VmEBLGXpPLTu2MImM7S.png)
69
+
70
+
71
+ # About Jan
72
+ Jan believes in the need for an open-source AI ecosystem and is building the infra and tooling to allow open-source AIs to compete on a level playing field with proprietary ones.
73
+
74
+ Jan's long-term vision is to build a cognitive framework for future robots, who are practical, useful assistants for humans and businesses in everyday life.
75
+
76
+ # Jan Model Merger
77
+ This is a test project for merging models.
78
+
79
+ # Open LLM Leaderboard Evaluation Results
80
+
81
+ Detailed results can be found here.
82
+
83
+ | Metric | Value |
84
+ |-----------------------|---------------------------|
85
+ | Avg. | ?|
86
+ | ARC (25-shot) | ? |
87
+ | HellaSwag (10-shot) | ? |
88
+ | MMLU (5-shot) | ?|
89
+ | TruthfulQA (0-shot) | ? |
90
+ | Winogrande (5-shot) | ? |
91
+ | GSM8K (5-shot) | ? |
92
+
93
+ # Acknowlegement
94
+ - [mergekit](https://github.com/cg123/mergekit)
95
+ - [DARE](https://github.com/yule-BUAA/MergeLM/blob/main/README.md)
96
+ - [SLERP](https://github.com/Digitous/LLM-SLERP-Merge)
97
+ - [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness)