yhavinga commited on
Commit
4baaecb
1 Parent(s): a882068

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -2
README.md CHANGED
@@ -3,11 +3,16 @@ library_name: transformers
3
  tags: []
4
  ---
5
 
6
- # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
 
 
 
9
 
 
10
 
 
11
 
12
  ## Model Details
13
 
 
3
  tags: []
4
  ---
5
 
6
+ # boreas-10_7b-v1
7
 
8
+ This is the result of step 2 of the upscaling of [Boreas-7B](https://huggingface.co/yhavinga/Boreas-7B) with [mergekit](https://github.com/cg123/mergekit).
9
+ It is trying to reproduce the upscaling described in the [SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling](https://arxiv.org/abs/2312.15166)
10
+ paper.
11
+ This model is the result after step 2 from the figure below:
12
 
13
+ ![SOLAR 10.7B Depth up scaling](img_2.png)
14
 
15
+ The model was continuously pretrained on a mix of Dutch and English for 20B tokens.
16
 
17
  ## Model Details
18