Model based on distilbert-base-uncased model trained on natural question short dataset. | |
Trained for one episode with AdamW optimizer and learning rate of 5e-03 and no warmup steps. | |
We achieved a f1 score of ... and an em score of ... |
Model based on distilbert-base-uncased model trained on natural question short dataset. | |
Trained for one episode with AdamW optimizer and learning rate of 5e-03 and no warmup steps. | |
We achieved a f1 score of ... and an em score of ... |