Update README.md
Browse files
README.md
CHANGED
@@ -1 +1,8 @@
|
|
1 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
This model is a fine-tune checkpoint of DistilBERT-base-uncased, fine-tuned on SST-2. This model reaches an accuracy of 91.3 on the dev set (for comparison, Bert bert-base-uncased version reaches an accuracy of 92.7).
|
2 |
+
|
3 |
+
Fine-tuning hyper-parameters
|
4 |
+
learning_rate = 1e-5
|
5 |
+
batch_size = 32
|
6 |
+
warmup = 600
|
7 |
+
max_seq_length = 128
|
8 |
+
num_train_epochs = 3.0
|