Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ library_name: transformers
|
|
11 |
550M parameter Transformer-XL style model trained on 100B tokens of The Pile!
|
12 |
|
13 |
This model was originally trained for the "Direct Prefrence Heads" paper, but will also be used as the basis for much of my future research.
|
14 |
-
All code used to train and run these models is available here: https://github.com/Avelina9X/
|
15 |
|
16 |
## Model Architecture
|
17 |
| Name | Value |
|
|
|
11 |
550M parameter Transformer-XL style model trained on 100B tokens of The Pile!
|
12 |
|
13 |
This model was originally trained for the "Direct Prefrence Heads" paper, but will also be used as the basis for much of my future research.
|
14 |
+
All code used to train and run these models is available here: https://github.com/Avelina9X/direct-preference-heads
|
15 |
|
16 |
## Model Architecture
|
17 |
| Name | Value |
|