Update README.md
Browse files
README.md
CHANGED
@@ -8,10 +8,10 @@ library_name: transformers
|
|
8 |
---
|
9 |
# Lovelace Medium Alpha1
|
10 |
|
11 |
-
|
12 |
|
13 |
This model was originally trained for the "Direct Prefrence Heads" paper, but will also be used as the basis for much of my future research.
|
14 |
-
All code used to train and run these models is available here: https://github.com/Avelina9X/direct-preference-heads
|
15 |
|
16 |
## Model Architecture
|
17 |
| Name | Value |
|
|
|
8 |
---
|
9 |
# Lovelace Medium Alpha1
|
10 |
|
11 |
+
551M parameter Transformer-XL style model trained on 100B tokens of The Pile!
|
12 |
|
13 |
This model was originally trained for the "Direct Prefrence Heads" paper, but will also be used as the basis for much of my future research.
|
14 |
+
All code used to train and run these models is available here: https://github.com/Avelina9X/direct-preference-heads and our paper is available here: https://arxiv.org/abs/2405.20053
|
15 |
|
16 |
## Model Architecture
|
17 |
| Name | Value |
|