metadata

library_name: transformers
tags:
  - generated_from_trainer
metrics:
  - accuracy
model-index:
  - name: outputs
    results: []

outputs

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 128
eval_batch_size: 128
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 512
optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 1000
num_epochs: 50

Training Loss	Epoch	Step	Accuracy	Validation Loss
2.9726	0.9953	53	0.1321	2.9325
2.5775	1.9906	106	0.2765	2.4443
2.0438	2.9859	159	0.4247	1.9166
1.5681	4.0	213	0.5003	1.5682
1.3107	4.9953	266	0.5581	1.3651
1.1594	5.9906	319	0.6131	1.1995
1.0232	6.9859	372	0.6575	1.0813
0.934	8.0	426	0.7081	0.9652
0.8727	8.9953	479	0.7333	0.8802
0.7644	9.9906	532	0.7378	0.8551
0.7007	10.9859	585	0.7663	0.7584
0.6585	12.0	639	0.7673	0.7550
0.59	12.9953	692	0.7847	0.7072
0.5775	13.9906	745	0.7860	0.7042
0.5487	14.9859	798	0.7981	0.6649
0.5296	16.0	852	0.7958	0.6387
0.4866	16.9953	905	0.8125	0.6029
0.4779	17.9906	958	0.7935	0.6498
0.4418	18.9859	1011	0.8128	0.6004
0.4334	20.0	1065	0.8165	0.5995
0.4097	20.9953	1118	0.8326	0.5508
0.3947	21.9906	1171	0.8315	0.5585
0.3521	22.9859	1224	0.8328	0.5513
0.3298	24.0	1278	0.8319	0.5810
0.3216	24.9953	1331	0.8358	0.5499
0.3086	25.9906	1384	0.8394	0.5383
0.2912	26.9859	1437	0.8349	0.5845
0.2801	28.0	1491	0.8423	0.5717
0.2677	28.9953	1544	0.8434	0.5563
0.263	29.9906	1597	0.8434	0.5684
0.244	30.9859	1650	0.8408	0.5900
0.2449	32.0	1704	0.8330	0.6121
0.2276	32.9953	1757	0.8428	0.5891
0.2407	33.9906	1810	0.8374	0.6033
0.1997	34.9859	1863	0.8459	0.5969
0.2081	36.0	1917	0.8451	0.5824
0.1936	36.9953	1970	0.8470	0.5834
0.1975	37.9906	2023	0.8446	0.6106
0.1938	38.9859	2076	0.8433	0.6166
0.1874	40.0	2130	0.8538	0.5823
0.184	40.9953	2183	0.8434	0.6395
0.1584	41.9906	2236	0.8542	0.6060
0.1608	42.9859	2289	0.8479	0.6289
0.1604	44.0	2343	0.8523	0.6105
0.1398	44.9953	2396	0.8502	0.6340
0.1487	45.9906	2449	0.8489	0.6414
0.137	46.9859	2502	0.8484	0.6285
0.1223	48.0	2556	0.8507	0.6331
0.1339	48.9953	2609	0.8492	0.6295
0.1368	49.7653	2650	0.8503	0.6328