Model based on distilbert-base-uncased model trained on natural question short dataset. Trained for one episode with AdamW optimizer and learning rate of 5e-03 and no warmup steps. We achieved a f1 score of ... and an em score of ...
Model based on distilbert-base-uncased model trained on natural question short dataset. Trained for one episode with AdamW optimizer and learning rate of 5e-03 and no warmup steps. We achieved a f1 score of ... and an em score of ...