ehristoforu commited on
Commit
e93cf77
1 Parent(s): 4912361

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -55
README.md CHANGED
@@ -1,59 +1,51 @@
1
  ---
2
- base_model:
3
- - NousResearch/Hermes-3-Llama-3.1-8B
4
- - unsloth/Meta-Llama-3.1-8B-Instruct
5
- - nvidia/OpenMath2-Llama3.1-8B
6
- - SimpleBerry/LLaMA-O1-Base-1127
7
- - ruliad/deepthought-8b-llama-v0.01-alpha
8
- - Skywork/Skywork-o1-Open-Llama-3.1-8B
9
- - Magpie-Align/MagpieLM-8B-SFT-v0.1
10
- - arcee-ai/Llama-3.1-SuperNova-Lite
11
- - huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated
12
  library_name: transformers
13
  tags:
14
- - mergekit
15
- - merge
16
-
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ---
18
- # merge
19
-
20
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
21
-
22
- ## Merge Details
23
- ### Merge Method
24
-
25
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [unsloth/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/unsloth/Meta-Llama-3.1-8B-Instruct) as a base.
26
-
27
- ### Models Merged
28
-
29
- The following models were included in the merge:
30
- * [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)
31
- * [nvidia/OpenMath2-Llama3.1-8B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)
32
- * [SimpleBerry/LLaMA-O1-Base-1127](https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127)
33
- * [ruliad/deepthought-8b-llama-v0.01-alpha](https://huggingface.co/ruliad/deepthought-8b-llama-v0.01-alpha)
34
- * [Skywork/Skywork-o1-Open-Llama-3.1-8B](https://huggingface.co/Skywork/Skywork-o1-Open-Llama-3.1-8B)
35
- * [Magpie-Align/MagpieLM-8B-SFT-v0.1](https://huggingface.co/Magpie-Align/MagpieLM-8B-SFT-v0.1)
36
- * [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite)
37
- * [huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated](https://huggingface.co/huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated)
38
-
39
- ### Configuration
40
-
41
- The following YAML configuration was used to produce this model:
42
-
43
- ```yaml
44
- models:
45
- - model: unsloth/Meta-Llama-3.1-8B-Instruct #base model
46
- - model: ruliad/deepthought-8b-llama-v0.01-alpha #logic
47
- - model: huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated #uncensored
48
- - model: arcee-ai/Llama-3.1-SuperNova-Lite #text generation
49
- - model: Skywork/Skywork-o1-Open-Llama-3.1-8B #balancing
50
- - model: nvidia/OpenMath2-Llama3.1-8B #math
51
- - model: SimpleBerry/LLaMA-O1-Base-1127 #dpo
52
- - model: NousResearch/Hermes-3-Llama-3.1-8B #universal
53
- - model: Magpie-Align/MagpieLM-8B-SFT-v0.1 #dpo & reasoning
54
- merge_method: model_stock
55
- base_model: unsloth/Meta-Llama-3.1-8B-Instruct
56
- normalize: true
57
- int8_mask: true
58
- dtype: bfloat16
59
- ```
 
1
  ---
2
+ base_model: meta-llama/Llama-3.1-8B-Instruct
 
 
 
 
 
 
 
 
 
3
  library_name: transformers
4
  tags:
5
+ - llama
6
+ - llama3
7
+ - llama-ti
8
+ - fluently-lm
9
+ - fluently-merge
10
+ - chat
11
+ - conversational
12
+ - reflection
13
+ - reasoning
14
+ - math
15
+ - writing
16
+ pipeline_tag: text-generation
17
+ language:
18
+ - en
19
+ - code
20
+ license: apache-2.0
21
  ---
22
+
23
+ # Llama3.1 8B **TI** Instruct
24
+
25
+ Llama TI is an improved Llama (from Meta AI), some aspects of the model have been revised and some features have been added.
26
+
27
+ ## Info
28
+
29
+ ### Main
30
+
31
+ The model is based on Meta-Llama-3.1-8B-Instruct, and has the same 8.03B parameters. The Llama3 architecture (LlamaForCausalLM) has been preserved and the model launch methods are the same.
32
+
33
+ ### Differences
34
+
35
+ Thanks to additional training and advanced merging, it was possible to improve mathematical, biological, reasoning and writing skills.
36
+
37
+ Now the model can:
38
+ - Count well and solve mathematical/physical problems
39
+ - Reason/think logically
40
+ - Write creatively (in many languages)
41
+ - Code well
42
+ - Process/analyze large texts
43
+
44
+ ### Where is the base version?
45
+
46
+ It is available [here](https://hf.co/fluently-lm/Llama-TI-8B)!
47
+
48
+ ### Special thanks to:
49
+ Meta AI, NVIDIA, Arcee AI, SkyWork, NousReaserch, Unsloth and Project Fluently.
50
+
51
+ *Developed and uploaded by [ehristoforu](https://t.me/ehristoforu_ai).*