Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ We make two important preprocessing steps:
|
|
21 |
|
22 |
### Pretraining Procedures
|
23 |
We train the Clinical-T5-Large model from scratch using a cased-vocab of 32,000. We train it for 780,000 steps, using a batch size of 12 per TPU pod (8 pods total), and a sequence length of 512.
|
24 |
-
This results in a batch size of 49,152. Accounting for the number of steps
|
25 |
|
26 |
# How to use the Model
|
27 |
You will first need to have credentialed PhysioNet access to use model. Why? There is reasonable evidence that these models contain leakage, especially the larger ones. Releasing a model that leaks these notes would be a data-use agreement violation. To get PhysioNet access, you must pass the CITI training.
|
|
|
21 |
|
22 |
### Pretraining Procedures
|
23 |
We train the Clinical-T5-Large model from scratch using a cased-vocab of 32,000. We train it for 780,000 steps, using a batch size of 12 per TPU pod (8 pods total), and a sequence length of 512.
|
24 |
+
This results in a batch size of 49,152. Accounting for the number of steps, this equates to 38B tokens. We were aiming for 40B, but our Google Cloud instance broke!
|
25 |
|
26 |
# How to use the Model
|
27 |
You will first need to have credentialed PhysioNet access to use model. Why? There is reasonable evidence that these models contain leakage, especially the larger ones. Releasing a model that leaks these notes would be a data-use agreement violation. To get PhysioNet access, you must pass the CITI training.
|