JanPf commited on
Commit
ab35f28
·
verified ·
1 Parent(s): 2b67d64

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -50
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  library_name: peft
3
- base_model: LSX-UniWue/LLaMmlein_1b
4
  tags:
5
  - trl
6
  - sft
@@ -17,53 +17,7 @@ language:
17
  - de
18
  ---
19
 
20
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
21
- should probably proofread and complete it, then remove this comment. -->
22
 
23
- # LLaMmlein_1b_chat_all
24
-
25
- This model is a fine-tuned version of [LSX-UniWue/LLaMmlein_1b](https://huggingface.co/LSX-UniWue/LLaMmlein_1b) on an unknown dataset.
26
- It achieves the following results on the evaluation set:
27
- - Loss: 1.5824
28
-
29
- ## Model description
30
-
31
- More information needed
32
-
33
- ## Intended uses & limitations
34
-
35
- More information needed
36
-
37
- ## Training and evaluation data
38
-
39
- More information needed
40
-
41
- ## Training procedure
42
-
43
- ### Training hyperparameters
44
-
45
- The following hyperparameters were used during training:
46
- - learning_rate: 5e-05
47
- - train_batch_size: 8
48
- - eval_batch_size: 8
49
- - seed: 42
50
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
- - lr_scheduler_type: linear
52
- - num_epochs: 3.0
53
-
54
- ### Training results
55
-
56
- | Training Loss | Epoch | Step | Validation Loss |
57
- |:-------------:|:-----:|:-----:|:---------------:|
58
- | 1.5341 | 1.0 | 15616 | 1.5940 |
59
- | 1.4226 | 2.0 | 31232 | 1.5851 |
60
- | 1.2294 | 3.0 | 46848 | 1.5824 |
61
-
62
-
63
- ### Framework versions
64
-
65
- - PEFT 0.13.2
66
- - Transformers 4.44.2
67
- - Pytorch 2.4.1+cu121
68
- - Datasets 3.0.1
69
- - Tokenizers 0.19.1
 
1
  ---
2
  library_name: peft
3
+ base_model: LSX-UniWue/LLaMmlein_1B
4
  tags:
5
  - trl
6
  - sft
 
17
  - de
18
  ---
19
 
20
+ # LLäMmlein 1B Chat
 
21
 
22
+ This is a chat adapter for the German Tinyllama 1B language model.
23
+ Find more details on our [page](https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/)!