File size: 91,996 Bytes
ad540e3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
2024-03-28 09:08:20,895 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,897 Model: "SequenceTagger(
  (embeddings): StackedEmbeddings(
    (list_embedding_0): WordEmbeddings(
      'glove'
      (embedding): Embedding(400001, 100)
    )
    (list_embedding_1): FlairEmbeddings(
      (lm): LanguageModel(
        (drop): Dropout(p=0.05, inplace=False)
        (encoder): Embedding(300, 100)
        (rnn): LSTM(100, 2048)
      )
    )
    (list_embedding_2): FlairEmbeddings(
      (lm): LanguageModel(
        (drop): Dropout(p=0.05, inplace=False)
        (encoder): Embedding(300, 100)
        (rnn): LSTM(100, 2048)
      )
    )
  )
  (word_dropout): WordDropout(p=0.05)
  (locked_dropout): LockedDropout(p=0.5)
  (embedding2nn): Linear(in_features=4196, out_features=4196, bias=True)
  (rnn): LSTM(4196, 256, batch_first=True, bidirectional=True)
  (linear): Linear(in_features=512, out_features=27, bias=True)
  (loss_function): ViterbiLoss()
  (crf): CRF()
)"
2024-03-28 09:08:20,899 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,901 Corpus: 50 train + 16 dev + 2 test sentences
2024-03-28 09:08:20,903 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,905 Train:  66 sentences
2024-03-28 09:08:20,906         (train_with_dev=True, train_with_test=False)
2024-03-28 09:08:20,908 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,909 Training Params:
2024-03-28 09:08:20,910  - learning_rate: "0.1" 
2024-03-28 09:08:20,912  - mini_batch_size: "32"
2024-03-28 09:08:20,913  - max_epochs: "150"
2024-03-28 09:08:20,914  - shuffle: "True"
2024-03-28 09:08:20,915 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,917 Plugins:
2024-03-28 09:08:20,918  - AnnealOnPlateau | patience: '3', anneal_factor: '0.5', min_learning_rate: '0.0001'
2024-03-28 09:08:20,919 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,920 Final evaluation on model from best epoch (best-model.pt)
2024-03-28 09:08:20,921  - metric: "('micro avg', 'f1-score')"
2024-03-28 09:08:20,923 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,924 Computation:
2024-03-28 09:08:20,925  - compute on device: cuda:0
2024-03-28 09:08:20,927  - embedding storage: cpu
2024-03-28 09:08:20,928 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,929 Model training base path: "resources/taggers/ner-english"
2024-03-28 09:08:20,930 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:20,931 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,191 epoch 1 - iter 1/3 - loss 3.36974860 - time (sec): 0.26 - samples/sec: 1392.44 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,396 epoch 1 - iter 2/3 - loss 3.15954622 - time (sec): 0.46 - samples/sec: 1629.06 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,493 epoch 1 - iter 3/3 - loss 3.14478873 - time (sec): 0.56 - samples/sec: 1391.02 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,495 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,498 EPOCH 1 done: loss 3.1448 - lr: 0.100000
2024-03-28 09:08:21,500  - 0 epochs without improvement
2024-03-28 09:08:21,502 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,589 epoch 2 - iter 1/3 - loss 2.45131045 - time (sec): 0.08 - samples/sec: 4558.10 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,668 epoch 2 - iter 2/3 - loss 2.39363852 - time (sec): 0.16 - samples/sec: 4622.19 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,696 epoch 2 - iter 3/3 - loss 2.39924183 - time (sec): 0.19 - samples/sec: 4072.04 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,698 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,700 EPOCH 2 done: loss 2.3992 - lr: 0.100000
2024-03-28 09:08:21,705  - 0 epochs without improvement
2024-03-28 09:08:21,709 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,802 epoch 3 - iter 1/3 - loss 2.23206190 - time (sec): 0.09 - samples/sec: 4145.47 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,881 epoch 3 - iter 2/3 - loss 2.25305821 - time (sec): 0.17 - samples/sec: 4484.56 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,905 epoch 3 - iter 3/3 - loss 2.25758761 - time (sec): 0.19 - samples/sec: 4035.32 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:21,907 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,908 EPOCH 3 done: loss 2.2576 - lr: 0.100000
2024-03-28 09:08:21,910  - 0 epochs without improvement
2024-03-28 09:08:21,912 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:21,996 epoch 4 - iter 1/3 - loss 1.98101494 - time (sec): 0.08 - samples/sec: 4904.79 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,068 epoch 4 - iter 2/3 - loss 2.13153052 - time (sec): 0.15 - samples/sec: 4963.13 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,095 epoch 4 - iter 3/3 - loss 2.14371007 - time (sec): 0.18 - samples/sec: 4357.04 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,097 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,099 EPOCH 4 done: loss 2.1437 - lr: 0.100000
2024-03-28 09:08:22,101  - 0 epochs without improvement
2024-03-28 09:08:22,102 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,186 epoch 5 - iter 1/3 - loss 1.97350561 - time (sec): 0.08 - samples/sec: 5013.65 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,263 epoch 5 - iter 2/3 - loss 2.14281019 - time (sec): 0.16 - samples/sec: 4793.23 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,292 epoch 5 - iter 3/3 - loss 2.13209609 - time (sec): 0.19 - samples/sec: 4168.19 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,294 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,297 EPOCH 5 done: loss 2.1321 - lr: 0.100000
2024-03-28 09:08:22,301  - 0 epochs without improvement
2024-03-28 09:08:22,303 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,380 epoch 6 - iter 1/3 - loss 1.95909884 - time (sec): 0.07 - samples/sec: 4953.78 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,472 epoch 6 - iter 2/3 - loss 1.98597567 - time (sec): 0.17 - samples/sec: 4577.26 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,492 epoch 6 - iter 3/3 - loss 1.96322728 - time (sec): 0.19 - samples/sec: 4170.18 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,494 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,496 EPOCH 6 done: loss 1.9632 - lr: 0.100000
2024-03-28 09:08:22,499  - 0 epochs without improvement
2024-03-28 09:08:22,501 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,576 epoch 7 - iter 1/3 - loss 2.11116446 - time (sec): 0.07 - samples/sec: 4956.72 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,651 epoch 7 - iter 2/3 - loss 2.01348722 - time (sec): 0.15 - samples/sec: 5058.67 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,678 epoch 7 - iter 3/3 - loss 2.00619598 - time (sec): 0.17 - samples/sec: 4461.42 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,680 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,683 EPOCH 7 done: loss 2.0062 - lr: 0.100000
2024-03-28 09:08:22,685  - 1 epochs without improvement
2024-03-28 09:08:22,687 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,762 epoch 8 - iter 1/3 - loss 1.82821058 - time (sec): 0.07 - samples/sec: 5176.83 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,837 epoch 8 - iter 2/3 - loss 1.92655447 - time (sec): 0.15 - samples/sec: 5095.66 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,865 epoch 8 - iter 3/3 - loss 1.93318620 - time (sec): 0.18 - samples/sec: 4426.23 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:22,867 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,869 EPOCH 8 done: loss 1.9332 - lr: 0.100000
2024-03-28 09:08:22,873  - 0 epochs without improvement
2024-03-28 09:08:22,876 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:22,966 epoch 9 - iter 1/3 - loss 1.64564751 - time (sec): 0.09 - samples/sec: 4254.01 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,056 epoch 9 - iter 2/3 - loss 1.63239704 - time (sec): 0.18 - samples/sec: 4272.42 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,086 epoch 9 - iter 3/3 - loss 1.62794558 - time (sec): 0.21 - samples/sec: 3783.46 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,088 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,090 EPOCH 9 done: loss 1.6279 - lr: 0.100000
2024-03-28 09:08:23,092  - 0 epochs without improvement
2024-03-28 09:08:23,094 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,176 epoch 10 - iter 1/3 - loss 1.51423518 - time (sec): 0.08 - samples/sec: 4972.85 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,263 epoch 10 - iter 2/3 - loss 1.53373787 - time (sec): 0.16 - samples/sec: 4590.31 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,297 epoch 10 - iter 3/3 - loss 1.52787663 - time (sec): 0.20 - samples/sec: 3938.30 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,301 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,303 EPOCH 10 done: loss 1.5279 - lr: 0.100000
2024-03-28 09:08:23,306  - 0 epochs without improvement
2024-03-28 09:08:23,308 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,394 epoch 11 - iter 1/3 - loss 1.42197424 - time (sec): 0.08 - samples/sec: 4545.88 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,481 epoch 11 - iter 2/3 - loss 1.35518180 - time (sec): 0.17 - samples/sec: 4371.34 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,515 epoch 11 - iter 3/3 - loss 1.35447597 - time (sec): 0.21 - samples/sec: 3785.22 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,517 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,520 EPOCH 11 done: loss 1.3545 - lr: 0.100000
2024-03-28 09:08:23,523  - 0 epochs without improvement
2024-03-28 09:08:23,524 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,626 epoch 12 - iter 1/3 - loss 1.40993639 - time (sec): 0.10 - samples/sec: 3734.89 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,722 epoch 12 - iter 2/3 - loss 1.49738996 - time (sec): 0.20 - samples/sec: 3849.63 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,759 epoch 12 - iter 3/3 - loss 1.49414163 - time (sec): 0.23 - samples/sec: 3342.20 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,762 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,763 EPOCH 12 done: loss 1.4941 - lr: 0.100000
2024-03-28 09:08:23,765  - 1 epochs without improvement
2024-03-28 09:08:23,767 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,841 epoch 13 - iter 1/3 - loss 1.26951506 - time (sec): 0.07 - samples/sec: 5241.77 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,902 epoch 13 - iter 2/3 - loss 1.44954909 - time (sec): 0.13 - samples/sec: 5637.53 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,928 epoch 13 - iter 3/3 - loss 1.42776139 - time (sec): 0.16 - samples/sec: 4888.19 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:23,930 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:23,933 EPOCH 13 done: loss 1.4278 - lr: 0.100000
2024-03-28 09:08:23,935  - 2 epochs without improvement
2024-03-28 09:08:23,938 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,001 epoch 14 - iter 1/3 - loss 1.25857317 - time (sec): 0.06 - samples/sec: 6187.07 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,065 epoch 14 - iter 2/3 - loss 1.24984402 - time (sec): 0.13 - samples/sec: 6026.61 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,087 epoch 14 - iter 3/3 - loss 1.24510320 - time (sec): 0.15 - samples/sec: 5292.42 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,089 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,091 EPOCH 14 done: loss 1.2451 - lr: 0.100000
2024-03-28 09:08:24,093  - 0 epochs without improvement
2024-03-28 09:08:24,096 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,152 epoch 15 - iter 1/3 - loss 1.13958904 - time (sec): 0.05 - samples/sec: 6868.95 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,217 epoch 15 - iter 2/3 - loss 1.18272163 - time (sec): 0.12 - samples/sec: 6314.93 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,241 epoch 15 - iter 3/3 - loss 1.18627405 - time (sec): 0.14 - samples/sec: 5430.97 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,243 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,246 EPOCH 15 done: loss 1.1863 - lr: 0.100000
2024-03-28 09:08:24,251  - 0 epochs without improvement
2024-03-28 09:08:24,252 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,320 epoch 16 - iter 1/3 - loss 1.08566695 - time (sec): 0.06 - samples/sec: 6081.82 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,380 epoch 16 - iter 2/3 - loss 1.07927891 - time (sec): 0.12 - samples/sec: 6157.72 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,404 epoch 16 - iter 3/3 - loss 1.08804569 - time (sec): 0.15 - samples/sec: 5318.34 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,405 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,408 EPOCH 16 done: loss 1.0880 - lr: 0.100000
2024-03-28 09:08:24,411  - 0 epochs without improvement
2024-03-28 09:08:24,414 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,478 epoch 17 - iter 1/3 - loss 0.98281485 - time (sec): 0.06 - samples/sec: 6049.56 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,553 epoch 17 - iter 2/3 - loss 1.19747295 - time (sec): 0.14 - samples/sec: 5529.82 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,575 epoch 17 - iter 3/3 - loss 1.19897193 - time (sec): 0.16 - samples/sec: 4906.66 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,577 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,580 EPOCH 17 done: loss 1.1990 - lr: 0.100000
2024-03-28 09:08:24,583  - 1 epochs without improvement
2024-03-28 09:08:24,585 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,659 epoch 18 - iter 1/3 - loss 1.06397110 - time (sec): 0.07 - samples/sec: 5470.45 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,720 epoch 18 - iter 2/3 - loss 1.12673372 - time (sec): 0.13 - samples/sec: 5735.98 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,742 epoch 18 - iter 3/3 - loss 1.13683503 - time (sec): 0.16 - samples/sec: 5019.50 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,744 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,747 EPOCH 18 done: loss 1.1368 - lr: 0.100000
2024-03-28 09:08:24,750  - 2 epochs without improvement
2024-03-28 09:08:24,752 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,815 epoch 19 - iter 1/3 - loss 1.13094212 - time (sec): 0.06 - samples/sec: 6425.44 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,880 epoch 19 - iter 2/3 - loss 1.06757936 - time (sec): 0.12 - samples/sec: 6053.73 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,901 epoch 19 - iter 3/3 - loss 1.07417968 - time (sec): 0.15 - samples/sec: 5333.04 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:24,902 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,905 EPOCH 19 done: loss 1.0742 - lr: 0.100000
2024-03-28 09:08:24,907  - 0 epochs without improvement
2024-03-28 09:08:24,910 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:24,974 epoch 20 - iter 1/3 - loss 0.98265959 - time (sec): 0.06 - samples/sec: 6130.25 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,035 epoch 20 - iter 2/3 - loss 0.95606777 - time (sec): 0.12 - samples/sec: 6115.30 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,059 epoch 20 - iter 3/3 - loss 0.95184126 - time (sec): 0.15 - samples/sec: 5278.72 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,061 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,066 EPOCH 20 done: loss 0.9518 - lr: 0.100000
2024-03-28 09:08:25,068  - 0 epochs without improvement
2024-03-28 09:08:25,071 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,133 epoch 21 - iter 1/3 - loss 0.86139335 - time (sec): 0.06 - samples/sec: 6546.08 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,194 epoch 21 - iter 2/3 - loss 0.88997541 - time (sec): 0.12 - samples/sec: 6239.01 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,216 epoch 21 - iter 3/3 - loss 0.89881944 - time (sec): 0.14 - samples/sec: 5424.95 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,219 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,221 EPOCH 21 done: loss 0.8988 - lr: 0.100000
2024-03-28 09:08:25,224  - 0 epochs without improvement
2024-03-28 09:08:25,227 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,298 epoch 22 - iter 1/3 - loss 0.87613110 - time (sec): 0.07 - samples/sec: 5678.58 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,360 epoch 22 - iter 2/3 - loss 0.85298542 - time (sec): 0.13 - samples/sec: 5823.45 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,381 epoch 22 - iter 3/3 - loss 0.85040579 - time (sec): 0.15 - samples/sec: 5127.33 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,383 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,386 EPOCH 22 done: loss 0.8504 - lr: 0.100000
2024-03-28 09:08:25,389  - 0 epochs without improvement
2024-03-28 09:08:25,391 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,454 epoch 23 - iter 1/3 - loss 0.84350519 - time (sec): 0.06 - samples/sec: 6063.21 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,521 epoch 23 - iter 2/3 - loss 0.80839760 - time (sec): 0.13 - samples/sec: 5952.73 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,547 epoch 23 - iter 3/3 - loss 0.80830466 - time (sec): 0.15 - samples/sec: 5072.50 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,549 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,551 EPOCH 23 done: loss 0.8083 - lr: 0.100000
2024-03-28 09:08:25,553  - 0 epochs without improvement
2024-03-28 09:08:25,555 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,638 epoch 24 - iter 1/3 - loss 0.81519134 - time (sec): 0.08 - samples/sec: 4674.16 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,701 epoch 24 - iter 2/3 - loss 0.73801863 - time (sec): 0.14 - samples/sec: 5197.51 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,725 epoch 24 - iter 3/3 - loss 0.73577389 - time (sec): 0.17 - samples/sec: 4607.71 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,727 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,730 EPOCH 24 done: loss 0.7358 - lr: 0.100000
2024-03-28 09:08:25,733  - 0 epochs without improvement
2024-03-28 09:08:25,735 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,802 epoch 25 - iter 1/3 - loss 0.66769132 - time (sec): 0.06 - samples/sec: 5861.73 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,871 epoch 25 - iter 2/3 - loss 0.71950535 - time (sec): 0.13 - samples/sec: 5695.03 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,894 epoch 25 - iter 3/3 - loss 0.72146968 - time (sec): 0.16 - samples/sec: 4988.39 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:25,896 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,901 EPOCH 25 done: loss 0.7215 - lr: 0.100000
2024-03-28 09:08:25,902  - 0 epochs without improvement
2024-03-28 09:08:25,906 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:25,965 epoch 26 - iter 1/3 - loss 0.77873421 - time (sec): 0.06 - samples/sec: 6787.99 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,028 epoch 26 - iter 2/3 - loss 0.79412269 - time (sec): 0.12 - samples/sec: 6309.97 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,053 epoch 26 - iter 3/3 - loss 0.78410294 - time (sec): 0.14 - samples/sec: 5376.37 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,055 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,056 EPOCH 26 done: loss 0.7841 - lr: 0.100000
2024-03-28 09:08:26,057  - 1 epochs without improvement
2024-03-28 09:08:26,059 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,120 epoch 27 - iter 1/3 - loss 0.67765564 - time (sec): 0.06 - samples/sec: 6209.52 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,185 epoch 27 - iter 2/3 - loss 0.74440163 - time (sec): 0.12 - samples/sec: 6024.24 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,211 epoch 27 - iter 3/3 - loss 0.74220062 - time (sec): 0.15 - samples/sec: 5168.46 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,212 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,216 EPOCH 27 done: loss 0.7422 - lr: 0.100000
2024-03-28 09:08:26,219  - 2 epochs without improvement
2024-03-28 09:08:26,222 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,281 epoch 28 - iter 1/3 - loss 0.65576854 - time (sec): 0.06 - samples/sec: 6429.32 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,346 epoch 28 - iter 2/3 - loss 0.67840381 - time (sec): 0.12 - samples/sec: 6203.36 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,371 epoch 28 - iter 3/3 - loss 0.69483660 - time (sec): 0.15 - samples/sec: 5328.19 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,373 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,380 EPOCH 28 done: loss 0.6948 - lr: 0.100000
2024-03-28 09:08:26,383  - 0 epochs without improvement
2024-03-28 09:08:26,385 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,453 epoch 29 - iter 1/3 - loss 0.60680922 - time (sec): 0.07 - samples/sec: 5681.74 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,520 epoch 29 - iter 2/3 - loss 0.71351490 - time (sec): 0.13 - samples/sec: 5698.89 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,543 epoch 29 - iter 3/3 - loss 0.72190195 - time (sec): 0.16 - samples/sec: 4996.69 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,545 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,547 EPOCH 29 done: loss 0.7219 - lr: 0.100000
2024-03-28 09:08:26,551  - 1 epochs without improvement
2024-03-28 09:08:26,553 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,616 epoch 30 - iter 1/3 - loss 0.54127716 - time (sec): 0.06 - samples/sec: 5980.72 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,719 epoch 30 - iter 2/3 - loss 0.66156022 - time (sec): 0.16 - samples/sec: 4617.44 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,754 epoch 30 - iter 3/3 - loss 0.66835733 - time (sec): 0.20 - samples/sec: 3931.30 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,756 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,757 EPOCH 30 done: loss 0.6684 - lr: 0.100000
2024-03-28 09:08:26,759  - 0 epochs without improvement
2024-03-28 09:08:26,761 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,843 epoch 31 - iter 1/3 - loss 0.56766907 - time (sec): 0.08 - samples/sec: 4600.84 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,931 epoch 31 - iter 2/3 - loss 0.66296679 - time (sec): 0.17 - samples/sec: 4480.14 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,965 epoch 31 - iter 3/3 - loss 0.66437075 - time (sec): 0.20 - samples/sec: 3851.74 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:26,968 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:26,972 EPOCH 31 done: loss 0.6644 - lr: 0.100000
2024-03-28 09:08:26,975  - 0 epochs without improvement
2024-03-28 09:08:26,979 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,072 epoch 32 - iter 1/3 - loss 0.47235793 - time (sec): 0.09 - samples/sec: 4231.03 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,154 epoch 32 - iter 2/3 - loss 0.66783573 - time (sec): 0.17 - samples/sec: 4445.75 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,180 epoch 32 - iter 3/3 - loss 0.67434436 - time (sec): 0.20 - samples/sec: 3962.60 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,185 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,187 EPOCH 32 done: loss 0.6743 - lr: 0.100000
2024-03-28 09:08:27,189  - 1 epochs without improvement
2024-03-28 09:08:27,191 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,286 epoch 33 - iter 1/3 - loss 0.51676854 - time (sec): 0.09 - samples/sec: 4329.67 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,368 epoch 33 - iter 2/3 - loss 0.56572747 - time (sec): 0.17 - samples/sec: 4439.20 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,393 epoch 33 - iter 3/3 - loss 0.56014238 - time (sec): 0.20 - samples/sec: 3969.22 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,398 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,399 EPOCH 33 done: loss 0.5601 - lr: 0.100000
2024-03-28 09:08:27,404  - 0 epochs without improvement
2024-03-28 09:08:27,406 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,490 epoch 34 - iter 1/3 - loss 0.56149373 - time (sec): 0.08 - samples/sec: 4800.36 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,571 epoch 34 - iter 2/3 - loss 0.62482820 - time (sec): 0.16 - samples/sec: 4724.24 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,598 epoch 34 - iter 3/3 - loss 0.62822730 - time (sec): 0.19 - samples/sec: 4169.96 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,603 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,605 EPOCH 34 done: loss 0.6282 - lr: 0.100000
2024-03-28 09:08:27,608  - 1 epochs without improvement
2024-03-28 09:08:27,611 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,687 epoch 35 - iter 1/3 - loss 0.52592365 - time (sec): 0.07 - samples/sec: 5121.35 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,776 epoch 35 - iter 2/3 - loss 0.55252095 - time (sec): 0.16 - samples/sec: 4680.28 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,804 epoch 35 - iter 3/3 - loss 0.55461875 - time (sec): 0.19 - samples/sec: 4100.79 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:27,807 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,810 EPOCH 35 done: loss 0.5546 - lr: 0.100000
2024-03-28 09:08:27,812  - 0 epochs without improvement
2024-03-28 09:08:27,814 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:27,909 epoch 36 - iter 1/3 - loss 0.55149670 - time (sec): 0.09 - samples/sec: 3986.62 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,000 epoch 36 - iter 2/3 - loss 0.52226647 - time (sec): 0.18 - samples/sec: 4082.53 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,040 epoch 36 - iter 3/3 - loss 0.51511517 - time (sec): 0.22 - samples/sec: 3472.83 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,045 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,047 EPOCH 36 done: loss 0.5151 - lr: 0.100000
2024-03-28 09:08:28,050  - 0 epochs without improvement
2024-03-28 09:08:28,053 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,146 epoch 37 - iter 1/3 - loss 0.49687234 - time (sec): 0.09 - samples/sec: 4244.26 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,228 epoch 37 - iter 2/3 - loss 0.47490515 - time (sec): 0.17 - samples/sec: 4304.77 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,263 epoch 37 - iter 3/3 - loss 0.48109616 - time (sec): 0.21 - samples/sec: 3743.54 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,265 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,267 EPOCH 37 done: loss 0.4811 - lr: 0.100000
2024-03-28 09:08:28,273  - 0 epochs without improvement
2024-03-28 09:08:28,276 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,371 epoch 38 - iter 1/3 - loss 0.48292834 - time (sec): 0.09 - samples/sec: 4022.76 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,463 epoch 38 - iter 2/3 - loss 0.60736766 - time (sec): 0.19 - samples/sec: 4059.96 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,497 epoch 38 - iter 3/3 - loss 0.60272934 - time (sec): 0.22 - samples/sec: 3552.63 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,499 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,502 EPOCH 38 done: loss 0.6027 - lr: 0.100000
2024-03-28 09:08:28,505  - 1 epochs without improvement
2024-03-28 09:08:28,507 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,608 epoch 39 - iter 1/3 - loss 0.46681475 - time (sec): 0.10 - samples/sec: 4013.80 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,699 epoch 39 - iter 2/3 - loss 0.49050783 - time (sec): 0.19 - samples/sec: 4012.02 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,731 epoch 39 - iter 3/3 - loss 0.48410297 - time (sec): 0.22 - samples/sec: 3515.46 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,736 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,739 EPOCH 39 done: loss 0.4841 - lr: 0.100000
2024-03-28 09:08:28,742  - 2 epochs without improvement
2024-03-28 09:08:28,748 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,850 epoch 40 - iter 1/3 - loss 0.47415373 - time (sec): 0.10 - samples/sec: 3747.88 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,916 epoch 40 - iter 2/3 - loss 0.43865486 - time (sec): 0.17 - samples/sec: 4513.33 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,946 epoch 40 - iter 3/3 - loss 0.44022970 - time (sec): 0.20 - samples/sec: 3985.95 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:28,948 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:28,950 EPOCH 40 done: loss 0.4402 - lr: 0.100000
2024-03-28 09:08:28,953  - 0 epochs without improvement
2024-03-28 09:08:28,955 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,022 epoch 41 - iter 1/3 - loss 0.39505556 - time (sec): 0.06 - samples/sec: 6165.00 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,094 epoch 41 - iter 2/3 - loss 0.45856506 - time (sec): 0.14 - samples/sec: 5563.23 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,121 epoch 41 - iter 3/3 - loss 0.46752058 - time (sec): 0.16 - samples/sec: 4775.73 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,123 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,126 EPOCH 41 done: loss 0.4675 - lr: 0.100000
2024-03-28 09:08:29,129  - 1 epochs without improvement
2024-03-28 09:08:29,132 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,202 epoch 42 - iter 1/3 - loss 0.40104817 - time (sec): 0.07 - samples/sec: 5656.93 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,270 epoch 42 - iter 2/3 - loss 0.44249168 - time (sec): 0.14 - samples/sec: 5554.20 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,295 epoch 42 - iter 3/3 - loss 0.45211151 - time (sec): 0.16 - samples/sec: 4831.38 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,296 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,299 EPOCH 42 done: loss 0.4521 - lr: 0.100000
2024-03-28 09:08:29,301  - 2 epochs without improvement
2024-03-28 09:08:29,303 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,373 epoch 43 - iter 1/3 - loss 0.42974738 - time (sec): 0.07 - samples/sec: 5498.50 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,445 epoch 43 - iter 2/3 - loss 0.48877276 - time (sec): 0.14 - samples/sec: 5430.97 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,471 epoch 43 - iter 3/3 - loss 0.50198705 - time (sec): 0.17 - samples/sec: 4719.28 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,473 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,475 EPOCH 43 done: loss 0.5020 - lr: 0.100000
2024-03-28 09:08:29,477  - 3 epochs without improvement
2024-03-28 09:08:29,480 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,553 epoch 44 - iter 1/3 - loss 0.34393486 - time (sec): 0.07 - samples/sec: 5398.09 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,622 epoch 44 - iter 2/3 - loss 0.42744327 - time (sec): 0.14 - samples/sec: 5421.30 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,646 epoch 44 - iter 3/3 - loss 0.43221926 - time (sec): 0.16 - samples/sec: 4738.09 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,648 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,651 EPOCH 44 done: loss 0.4322 - lr: 0.100000
2024-03-28 09:08:29,654  - 0 epochs without improvement
2024-03-28 09:08:29,657 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,719 epoch 45 - iter 1/3 - loss 0.35080136 - time (sec): 0.06 - samples/sec: 6469.89 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,789 epoch 45 - iter 2/3 - loss 0.45650777 - time (sec): 0.13 - samples/sec: 5890.71 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,822 epoch 45 - iter 3/3 - loss 0.45322049 - time (sec): 0.16 - samples/sec: 4842.83 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,825 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,829 EPOCH 45 done: loss 0.4532 - lr: 0.100000
2024-03-28 09:08:29,832  - 1 epochs without improvement
2024-03-28 09:08:29,835 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:29,911 epoch 46 - iter 1/3 - loss 0.41917917 - time (sec): 0.07 - samples/sec: 5213.37 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:29,981 epoch 46 - iter 2/3 - loss 0.44351342 - time (sec): 0.14 - samples/sec: 5299.67 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,007 epoch 46 - iter 3/3 - loss 0.43757010 - time (sec): 0.17 - samples/sec: 4606.63 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,009 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,012 EPOCH 46 done: loss 0.4376 - lr: 0.100000
2024-03-28 09:08:30,018  - 2 epochs without improvement
2024-03-28 09:08:30,021 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,087 epoch 47 - iter 1/3 - loss 0.43355327 - time (sec): 0.06 - samples/sec: 6018.18 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,151 epoch 47 - iter 2/3 - loss 0.47636440 - time (sec): 0.13 - samples/sec: 5894.94 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,178 epoch 47 - iter 3/3 - loss 0.46188437 - time (sec): 0.15 - samples/sec: 5040.05 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,180 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,182 EPOCH 47 done: loss 0.4619 - lr: 0.100000
2024-03-28 09:08:30,184  - 3 epochs without improvement
2024-03-28 09:08:30,186 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,253 epoch 48 - iter 1/3 - loss 0.41156757 - time (sec): 0.06 - samples/sec: 5842.89 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,320 epoch 48 - iter 2/3 - loss 0.43935062 - time (sec): 0.13 - samples/sec: 5763.82 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,344 epoch 48 - iter 3/3 - loss 0.43557776 - time (sec): 0.16 - samples/sec: 5008.81 - lr: 0.100000 - momentum: 0.000000
2024-03-28 09:08:30,345 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,349 EPOCH 48 done: loss 0.4356 - lr: 0.100000
2024-03-28 09:08:30,352  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.05]
2024-03-28 09:08:30,355 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,421 epoch 49 - iter 1/3 - loss 0.39220638 - time (sec): 0.06 - samples/sec: 6083.85 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,488 epoch 49 - iter 2/3 - loss 0.36613670 - time (sec): 0.13 - samples/sec: 5923.39 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,508 epoch 49 - iter 3/3 - loss 0.36897407 - time (sec): 0.15 - samples/sec: 5215.96 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,510 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,518 EPOCH 49 done: loss 0.3690 - lr: 0.050000
2024-03-28 09:08:30,521  - 0 epochs without improvement
2024-03-28 09:08:30,524 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,591 epoch 50 - iter 1/3 - loss 0.33464823 - time (sec): 0.06 - samples/sec: 5912.48 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,657 epoch 50 - iter 2/3 - loss 0.35002112 - time (sec): 0.13 - samples/sec: 5806.96 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,681 epoch 50 - iter 3/3 - loss 0.35487791 - time (sec): 0.15 - samples/sec: 5046.70 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,682 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,686 EPOCH 50 done: loss 0.3549 - lr: 0.050000
2024-03-28 09:08:30,689  - 0 epochs without improvement
2024-03-28 09:08:30,692 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,758 epoch 51 - iter 1/3 - loss 0.31067178 - time (sec): 0.06 - samples/sec: 6208.08 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,829 epoch 51 - iter 2/3 - loss 0.32343769 - time (sec): 0.13 - samples/sec: 5682.84 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,856 epoch 51 - iter 3/3 - loss 0.31745014 - time (sec): 0.16 - samples/sec: 4862.36 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:30,858 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,861 EPOCH 51 done: loss 0.3175 - lr: 0.050000
2024-03-28 09:08:30,864  - 0 epochs without improvement
2024-03-28 09:08:30,868 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:30,930 epoch 52 - iter 1/3 - loss 0.27665802 - time (sec): 0.06 - samples/sec: 6067.43 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,002 epoch 52 - iter 2/3 - loss 0.30771992 - time (sec): 0.13 - samples/sec: 5702.05 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,029 epoch 52 - iter 3/3 - loss 0.30199688 - time (sec): 0.16 - samples/sec: 4909.62 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,031 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,035 EPOCH 52 done: loss 0.3020 - lr: 0.050000
2024-03-28 09:08:31,038  - 0 epochs without improvement
2024-03-28 09:08:31,040 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,108 epoch 53 - iter 1/3 - loss 0.33830257 - time (sec): 0.07 - samples/sec: 5664.12 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,176 epoch 53 - iter 2/3 - loss 0.33237701 - time (sec): 0.13 - samples/sec: 5624.94 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,206 epoch 53 - iter 3/3 - loss 0.32609021 - time (sec): 0.16 - samples/sec: 4769.13 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,209 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,211 EPOCH 53 done: loss 0.3261 - lr: 0.050000
2024-03-28 09:08:31,212  - 1 epochs without improvement
2024-03-28 09:08:31,213 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,286 epoch 54 - iter 1/3 - loss 0.32231420 - time (sec): 0.07 - samples/sec: 5390.92 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,353 epoch 54 - iter 2/3 - loss 0.29146483 - time (sec): 0.14 - samples/sec: 5477.77 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,377 epoch 54 - iter 3/3 - loss 0.29709430 - time (sec): 0.16 - samples/sec: 4806.07 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,379 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,381 EPOCH 54 done: loss 0.2971 - lr: 0.050000
2024-03-28 09:08:31,383  - 0 epochs without improvement
2024-03-28 09:08:31,386 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,449 epoch 55 - iter 1/3 - loss 0.23736323 - time (sec): 0.06 - samples/sec: 6007.27 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,518 epoch 55 - iter 2/3 - loss 0.27148632 - time (sec): 0.13 - samples/sec: 5795.54 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,544 epoch 55 - iter 3/3 - loss 0.27109961 - time (sec): 0.16 - samples/sec: 4991.08 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,546 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,548 EPOCH 55 done: loss 0.2711 - lr: 0.050000
2024-03-28 09:08:31,551  - 0 epochs without improvement
2024-03-28 09:08:31,553 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,621 epoch 56 - iter 1/3 - loss 0.25377297 - time (sec): 0.07 - samples/sec: 5827.21 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,689 epoch 56 - iter 2/3 - loss 0.22560634 - time (sec): 0.13 - samples/sec: 5653.90 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,714 epoch 56 - iter 3/3 - loss 0.23113600 - time (sec): 0.16 - samples/sec: 4917.05 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,716 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,719 EPOCH 56 done: loss 0.2311 - lr: 0.050000
2024-03-28 09:08:31,721  - 0 epochs without improvement
2024-03-28 09:08:31,723 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,794 epoch 57 - iter 1/3 - loss 0.26957983 - time (sec): 0.07 - samples/sec: 5797.05 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,871 epoch 57 - iter 2/3 - loss 0.25384182 - time (sec): 0.15 - samples/sec: 5161.24 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,895 epoch 57 - iter 3/3 - loss 0.25095547 - time (sec): 0.17 - samples/sec: 4586.07 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:31,897 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,900 EPOCH 57 done: loss 0.2510 - lr: 0.050000
2024-03-28 09:08:31,902  - 1 epochs without improvement
2024-03-28 09:08:31,904 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:31,972 epoch 58 - iter 1/3 - loss 0.27891893 - time (sec): 0.06 - samples/sec: 5933.41 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,036 epoch 58 - iter 2/3 - loss 0.29004808 - time (sec): 0.13 - samples/sec: 5876.73 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,059 epoch 58 - iter 3/3 - loss 0.28334943 - time (sec): 0.15 - samples/sec: 5115.74 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,061 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,063 EPOCH 58 done: loss 0.2833 - lr: 0.050000
2024-03-28 09:08:32,065  - 2 epochs without improvement
2024-03-28 09:08:32,067 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,134 epoch 59 - iter 1/3 - loss 0.24056747 - time (sec): 0.06 - samples/sec: 5868.06 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,202 epoch 59 - iter 2/3 - loss 0.24723329 - time (sec): 0.13 - samples/sec: 5678.02 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,228 epoch 59 - iter 3/3 - loss 0.24669100 - time (sec): 0.16 - samples/sec: 4929.81 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,229 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,231 EPOCH 59 done: loss 0.2467 - lr: 0.050000
2024-03-28 09:08:32,233  - 3 epochs without improvement
2024-03-28 09:08:32,235 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,301 epoch 60 - iter 1/3 - loss 0.36381199 - time (sec): 0.06 - samples/sec: 5833.02 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,369 epoch 60 - iter 2/3 - loss 0.30104175 - time (sec): 0.13 - samples/sec: 5787.29 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,392 epoch 60 - iter 3/3 - loss 0.30143368 - time (sec): 0.15 - samples/sec: 5058.64 - lr: 0.050000 - momentum: 0.000000
2024-03-28 09:08:32,394 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,396 EPOCH 60 done: loss 0.3014 - lr: 0.050000
2024-03-28 09:08:32,398  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.025]
2024-03-28 09:08:32,401 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,468 epoch 61 - iter 1/3 - loss 0.27097580 - time (sec): 0.07 - samples/sec: 5921.59 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,534 epoch 61 - iter 2/3 - loss 0.25132273 - time (sec): 0.13 - samples/sec: 5734.84 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,561 epoch 61 - iter 3/3 - loss 0.24710534 - time (sec): 0.16 - samples/sec: 4935.61 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,563 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,565 EPOCH 61 done: loss 0.2471 - lr: 0.025000
2024-03-28 09:08:32,568  - 1 epochs without improvement
2024-03-28 09:08:32,570 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,635 epoch 62 - iter 1/3 - loss 0.22992023 - time (sec): 0.06 - samples/sec: 6211.06 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,699 epoch 62 - iter 2/3 - loss 0.22807611 - time (sec): 0.13 - samples/sec: 6012.05 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,721 epoch 62 - iter 3/3 - loss 0.22695615 - time (sec): 0.15 - samples/sec: 5270.95 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,722 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,725 EPOCH 62 done: loss 0.2270 - lr: 0.025000
2024-03-28 09:08:32,727  - 0 epochs without improvement
2024-03-28 09:08:32,729 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,793 epoch 63 - iter 1/3 - loss 0.20431773 - time (sec): 0.06 - samples/sec: 6161.30 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,861 epoch 63 - iter 2/3 - loss 0.22571899 - time (sec): 0.13 - samples/sec: 5807.57 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,896 epoch 63 - iter 3/3 - loss 0.22545184 - time (sec): 0.17 - samples/sec: 4699.89 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:32,898 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,900 EPOCH 63 done: loss 0.2255 - lr: 0.025000
2024-03-28 09:08:32,903  - 0 epochs without improvement
2024-03-28 09:08:32,905 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:32,975 epoch 64 - iter 1/3 - loss 0.23394853 - time (sec): 0.07 - samples/sec: 5722.79 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,040 epoch 64 - iter 2/3 - loss 0.21575960 - time (sec): 0.13 - samples/sec: 5699.75 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,064 epoch 64 - iter 3/3 - loss 0.21618913 - time (sec): 0.16 - samples/sec: 4969.07 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,065 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,068 EPOCH 64 done: loss 0.2162 - lr: 0.025000
2024-03-28 09:08:33,070  - 0 epochs without improvement
2024-03-28 09:08:33,073 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,135 epoch 65 - iter 1/3 - loss 0.20376337 - time (sec): 0.06 - samples/sec: 6044.78 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,206 epoch 65 - iter 2/3 - loss 0.22490820 - time (sec): 0.13 - samples/sec: 5746.62 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,230 epoch 65 - iter 3/3 - loss 0.23571646 - time (sec): 0.16 - samples/sec: 4993.89 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,232 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,235 EPOCH 65 done: loss 0.2357 - lr: 0.025000
2024-03-28 09:08:33,237  - 1 epochs without improvement
2024-03-28 09:08:33,239 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,309 epoch 66 - iter 1/3 - loss 0.19798712 - time (sec): 0.07 - samples/sec: 5541.37 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,375 epoch 66 - iter 2/3 - loss 0.23232096 - time (sec): 0.13 - samples/sec: 5608.84 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,401 epoch 66 - iter 3/3 - loss 0.23059462 - time (sec): 0.16 - samples/sec: 4867.50 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,402 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,405 EPOCH 66 done: loss 0.2306 - lr: 0.025000
2024-03-28 09:08:33,408  - 2 epochs without improvement
2024-03-28 09:08:33,409 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,483 epoch 67 - iter 1/3 - loss 0.21222671 - time (sec): 0.07 - samples/sec: 5556.93 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,548 epoch 67 - iter 2/3 - loss 0.23658420 - time (sec): 0.14 - samples/sec: 5581.78 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,573 epoch 67 - iter 3/3 - loss 0.23513228 - time (sec): 0.16 - samples/sec: 4873.20 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,574 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,577 EPOCH 67 done: loss 0.2351 - lr: 0.025000
2024-03-28 09:08:33,579  - 3 epochs without improvement
2024-03-28 09:08:33,582 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,650 epoch 68 - iter 1/3 - loss 0.20024892 - time (sec): 0.07 - samples/sec: 5705.76 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,723 epoch 68 - iter 2/3 - loss 0.24506107 - time (sec): 0.14 - samples/sec: 5452.59 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,746 epoch 68 - iter 3/3 - loss 0.24463388 - time (sec): 0.16 - samples/sec: 4800.57 - lr: 0.025000 - momentum: 0.000000
2024-03-28 09:08:33,748 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,751 EPOCH 68 done: loss 0.2446 - lr: 0.025000
2024-03-28 09:08:33,753  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.0125]
2024-03-28 09:08:33,755 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,820 epoch 69 - iter 1/3 - loss 0.21504179 - time (sec): 0.06 - samples/sec: 6058.29 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:33,886 epoch 69 - iter 2/3 - loss 0.21413674 - time (sec): 0.13 - samples/sec: 5940.03 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:33,919 epoch 69 - iter 3/3 - loss 0.21118687 - time (sec): 0.16 - samples/sec: 4841.80 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:33,921 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,923 EPOCH 69 done: loss 0.2112 - lr: 0.012500
2024-03-28 09:08:33,925  - 0 epochs without improvement
2024-03-28 09:08:33,927 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:33,995 epoch 70 - iter 1/3 - loss 0.22739715 - time (sec): 0.07 - samples/sec: 5485.75 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,061 epoch 70 - iter 2/3 - loss 0.25014319 - time (sec): 0.13 - samples/sec: 5710.43 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,085 epoch 70 - iter 3/3 - loss 0.25337325 - time (sec): 0.16 - samples/sec: 4971.79 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,087 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,090 EPOCH 70 done: loss 0.2534 - lr: 0.012500
2024-03-28 09:08:34,092  - 1 epochs without improvement
2024-03-28 09:08:34,094 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,160 epoch 71 - iter 1/3 - loss 0.24353060 - time (sec): 0.06 - samples/sec: 5866.57 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,228 epoch 71 - iter 2/3 - loss 0.21777601 - time (sec): 0.13 - samples/sec: 5735.40 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,251 epoch 71 - iter 3/3 - loss 0.22126061 - time (sec): 0.16 - samples/sec: 5025.26 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,252 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,254 EPOCH 71 done: loss 0.2213 - lr: 0.012500
2024-03-28 09:08:34,257  - 2 epochs without improvement
2024-03-28 09:08:34,259 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,328 epoch 72 - iter 1/3 - loss 0.19077828 - time (sec): 0.07 - samples/sec: 5835.39 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,390 epoch 72 - iter 2/3 - loss 0.20655965 - time (sec): 0.13 - samples/sec: 5885.98 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,412 epoch 72 - iter 3/3 - loss 0.20427307 - time (sec): 0.15 - samples/sec: 5150.74 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,416 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,419 EPOCH 72 done: loss 0.2043 - lr: 0.012500
2024-03-28 09:08:34,421  - 0 epochs without improvement
2024-03-28 09:08:34,424 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,493 epoch 73 - iter 1/3 - loss 0.20394309 - time (sec): 0.07 - samples/sec: 5939.40 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,555 epoch 73 - iter 2/3 - loss 0.20850289 - time (sec): 0.13 - samples/sec: 5859.13 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,579 epoch 73 - iter 3/3 - loss 0.21953239 - time (sec): 0.15 - samples/sec: 5098.95 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,580 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,583 EPOCH 73 done: loss 0.2195 - lr: 0.012500
2024-03-28 09:08:34,586  - 1 epochs without improvement
2024-03-28 09:08:34,588 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,658 epoch 74 - iter 1/3 - loss 0.24642578 - time (sec): 0.07 - samples/sec: 5595.57 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,722 epoch 74 - iter 2/3 - loss 0.21810751 - time (sec): 0.13 - samples/sec: 5737.34 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,747 epoch 74 - iter 3/3 - loss 0.22412623 - time (sec): 0.16 - samples/sec: 4994.73 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,749 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,751 EPOCH 74 done: loss 0.2241 - lr: 0.012500
2024-03-28 09:08:34,754  - 2 epochs without improvement
2024-03-28 09:08:34,755 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,826 epoch 75 - iter 1/3 - loss 0.20324577 - time (sec): 0.07 - samples/sec: 5632.71 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,891 epoch 75 - iter 2/3 - loss 0.21292139 - time (sec): 0.13 - samples/sec: 5723.80 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,915 epoch 75 - iter 3/3 - loss 0.20921929 - time (sec): 0.16 - samples/sec: 4991.02 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:34,916 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,919 EPOCH 75 done: loss 0.2092 - lr: 0.012500
2024-03-28 09:08:34,921  - 3 epochs without improvement
2024-03-28 09:08:34,923 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:34,996 epoch 76 - iter 1/3 - loss 0.23860086 - time (sec): 0.07 - samples/sec: 5597.94 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:35,061 epoch 76 - iter 2/3 - loss 0.22212134 - time (sec): 0.13 - samples/sec: 5727.10 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:35,085 epoch 76 - iter 3/3 - loss 0.22672753 - time (sec): 0.16 - samples/sec: 5015.31 - lr: 0.012500 - momentum: 0.000000
2024-03-28 09:08:35,086 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,088 EPOCH 76 done: loss 0.2267 - lr: 0.012500
2024-03-28 09:08:35,090  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.00625]
2024-03-28 09:08:35,093 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,157 epoch 77 - iter 1/3 - loss 0.20140748 - time (sec): 0.06 - samples/sec: 5931.86 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,223 epoch 77 - iter 2/3 - loss 0.23543165 - time (sec): 0.13 - samples/sec: 5886.34 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,245 epoch 77 - iter 3/3 - loss 0.22959387 - time (sec): 0.15 - samples/sec: 5180.90 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,246 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,249 EPOCH 77 done: loss 0.2296 - lr: 0.006250
2024-03-28 09:08:35,251  - 1 epochs without improvement
2024-03-28 09:08:35,253 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,319 epoch 78 - iter 1/3 - loss 0.26190517 - time (sec): 0.06 - samples/sec: 5839.55 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,385 epoch 78 - iter 2/3 - loss 0.23953494 - time (sec): 0.13 - samples/sec: 5857.85 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,409 epoch 78 - iter 3/3 - loss 0.23820210 - time (sec): 0.15 - samples/sec: 5083.93 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,411 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,414 EPOCH 78 done: loss 0.2382 - lr: 0.006250
2024-03-28 09:08:35,416  - 2 epochs without improvement
2024-03-28 09:08:35,418 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,488 epoch 79 - iter 1/3 - loss 0.19539345 - time (sec): 0.07 - samples/sec: 5625.49 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,555 epoch 79 - iter 2/3 - loss 0.20920196 - time (sec): 0.13 - samples/sec: 5661.67 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,580 epoch 79 - iter 3/3 - loss 0.21356759 - time (sec): 0.16 - samples/sec: 4902.78 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,581 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,584 EPOCH 79 done: loss 0.2136 - lr: 0.006250
2024-03-28 09:08:35,587  - 3 epochs without improvement
2024-03-28 09:08:35,589 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,652 epoch 80 - iter 1/3 - loss 0.22265025 - time (sec): 0.06 - samples/sec: 6036.04 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,719 epoch 80 - iter 2/3 - loss 0.20324885 - time (sec): 0.13 - samples/sec: 5911.13 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,744 epoch 80 - iter 3/3 - loss 0.20831160 - time (sec): 0.15 - samples/sec: 5091.36 - lr: 0.006250 - momentum: 0.000000
2024-03-28 09:08:35,746 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,748 EPOCH 80 done: loss 0.2083 - lr: 0.006250
2024-03-28 09:08:35,751  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.003125]
2024-03-28 09:08:35,753 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,825 epoch 81 - iter 1/3 - loss 0.18628309 - time (sec): 0.07 - samples/sec: 5587.97 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:35,889 epoch 81 - iter 2/3 - loss 0.20686248 - time (sec): 0.13 - samples/sec: 5664.59 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:35,911 epoch 81 - iter 3/3 - loss 0.20672222 - time (sec): 0.16 - samples/sec: 4994.39 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:35,913 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,916 EPOCH 81 done: loss 0.2067 - lr: 0.003125
2024-03-28 09:08:35,918  - 1 epochs without improvement
2024-03-28 09:08:35,920 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:35,992 epoch 82 - iter 1/3 - loss 0.19897849 - time (sec): 0.07 - samples/sec: 5515.18 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,072 epoch 82 - iter 2/3 - loss 0.20879538 - time (sec): 0.15 - samples/sec: 5088.16 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,095 epoch 82 - iter 3/3 - loss 0.20445322 - time (sec): 0.17 - samples/sec: 4533.89 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,097 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,099 EPOCH 82 done: loss 0.2045 - lr: 0.003125
2024-03-28 09:08:36,102  - 2 epochs without improvement
2024-03-28 09:08:36,104 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,169 epoch 83 - iter 1/3 - loss 0.17155356 - time (sec): 0.06 - samples/sec: 6148.63 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,234 epoch 83 - iter 2/3 - loss 0.19129354 - time (sec): 0.13 - samples/sec: 5926.47 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,258 epoch 83 - iter 3/3 - loss 0.19397805 - time (sec): 0.15 - samples/sec: 5158.06 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,259 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,262 EPOCH 83 done: loss 0.1940 - lr: 0.003125
2024-03-28 09:08:36,264  - 0 epochs without improvement
2024-03-28 09:08:36,267 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,335 epoch 84 - iter 1/3 - loss 0.19546600 - time (sec): 0.07 - samples/sec: 5642.23 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,403 epoch 84 - iter 2/3 - loss 0.20660319 - time (sec): 0.13 - samples/sec: 5619.65 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,430 epoch 84 - iter 3/3 - loss 0.20619125 - time (sec): 0.16 - samples/sec: 4857.78 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,432 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,434 EPOCH 84 done: loss 0.2062 - lr: 0.003125
2024-03-28 09:08:36,436  - 1 epochs without improvement
2024-03-28 09:08:36,438 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,505 epoch 85 - iter 1/3 - loss 0.21508853 - time (sec): 0.06 - samples/sec: 5964.38 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,575 epoch 85 - iter 2/3 - loss 0.20740067 - time (sec): 0.13 - samples/sec: 5656.74 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,598 epoch 85 - iter 3/3 - loss 0.20411952 - time (sec): 0.16 - samples/sec: 4966.24 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,599 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,602 EPOCH 85 done: loss 0.2041 - lr: 0.003125
2024-03-28 09:08:36,604  - 2 epochs without improvement
2024-03-28 09:08:36,606 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,673 epoch 86 - iter 1/3 - loss 0.23413151 - time (sec): 0.07 - samples/sec: 5765.88 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,738 epoch 86 - iter 2/3 - loss 0.21575944 - time (sec): 0.13 - samples/sec: 5775.19 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,765 epoch 86 - iter 3/3 - loss 0.22781102 - time (sec): 0.16 - samples/sec: 4949.48 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,769 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,771 EPOCH 86 done: loss 0.2278 - lr: 0.003125
2024-03-28 09:08:36,773  - 3 epochs without improvement
2024-03-28 09:08:36,775 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,843 epoch 87 - iter 1/3 - loss 0.19966387 - time (sec): 0.07 - samples/sec: 5803.18 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,907 epoch 87 - iter 2/3 - loss 0.19670426 - time (sec): 0.13 - samples/sec: 5831.49 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,934 epoch 87 - iter 3/3 - loss 0.19452721 - time (sec): 0.16 - samples/sec: 4979.61 - lr: 0.003125 - momentum: 0.000000
2024-03-28 09:08:36,936 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:36,938 EPOCH 87 done: loss 0.1945 - lr: 0.003125
2024-03-28 09:08:36,940  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.0015625]
2024-03-28 09:08:36,942 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,016 epoch 88 - iter 1/3 - loss 0.22008498 - time (sec): 0.07 - samples/sec: 5298.35 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,083 epoch 88 - iter 2/3 - loss 0.19665170 - time (sec): 0.14 - samples/sec: 5437.93 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,108 epoch 88 - iter 3/3 - loss 0.20012108 - time (sec): 0.16 - samples/sec: 4768.16 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,109 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,112 EPOCH 88 done: loss 0.2001 - lr: 0.001563
2024-03-28 09:08:37,114  - 1 epochs without improvement
2024-03-28 09:08:37,117 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,189 epoch 89 - iter 1/3 - loss 0.18575605 - time (sec): 0.07 - samples/sec: 5497.97 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,256 epoch 89 - iter 2/3 - loss 0.18895481 - time (sec): 0.14 - samples/sec: 5476.09 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,281 epoch 89 - iter 3/3 - loss 0.18614399 - time (sec): 0.16 - samples/sec: 4788.19 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,283 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,285 EPOCH 89 done: loss 0.1861 - lr: 0.001563
2024-03-28 09:08:37,287  - 0 epochs without improvement
2024-03-28 09:08:37,290 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,360 epoch 90 - iter 1/3 - loss 0.20966875 - time (sec): 0.07 - samples/sec: 5526.63 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,429 epoch 90 - iter 2/3 - loss 0.18502192 - time (sec): 0.14 - samples/sec: 5561.81 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,454 epoch 90 - iter 3/3 - loss 0.18437769 - time (sec): 0.16 - samples/sec: 4814.59 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,456 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,459 EPOCH 90 done: loss 0.1844 - lr: 0.001563
2024-03-28 09:08:37,461  - 0 epochs without improvement
2024-03-28 09:08:37,463 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,542 epoch 91 - iter 1/3 - loss 0.21700736 - time (sec): 0.08 - samples/sec: 5196.77 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,608 epoch 91 - iter 2/3 - loss 0.21150135 - time (sec): 0.14 - samples/sec: 5288.80 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,633 epoch 91 - iter 3/3 - loss 0.21616639 - time (sec): 0.17 - samples/sec: 4642.85 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,634 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,637 EPOCH 91 done: loss 0.2162 - lr: 0.001563
2024-03-28 09:08:37,640  - 1 epochs without improvement
2024-03-28 09:08:37,642 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,712 epoch 92 - iter 1/3 - loss 0.17263027 - time (sec): 0.07 - samples/sec: 5502.15 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,779 epoch 92 - iter 2/3 - loss 0.20051331 - time (sec): 0.13 - samples/sec: 5610.57 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,804 epoch 92 - iter 3/3 - loss 0.19621649 - time (sec): 0.16 - samples/sec: 4883.81 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,806 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,808 EPOCH 92 done: loss 0.1962 - lr: 0.001563
2024-03-28 09:08:37,810  - 2 epochs without improvement
2024-03-28 09:08:37,812 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,876 epoch 93 - iter 1/3 - loss 0.16124828 - time (sec): 0.06 - samples/sec: 5986.96 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,940 epoch 93 - iter 2/3 - loss 0.20056266 - time (sec): 0.13 - samples/sec: 5938.83 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,967 epoch 93 - iter 3/3 - loss 0.19693890 - time (sec): 0.15 - samples/sec: 5078.24 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:37,969 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:37,971 EPOCH 93 done: loss 0.1969 - lr: 0.001563
2024-03-28 09:08:37,974  - 3 epochs without improvement
2024-03-28 09:08:37,975 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,049 epoch 94 - iter 1/3 - loss 0.19580606 - time (sec): 0.07 - samples/sec: 5265.94 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:38,114 epoch 94 - iter 2/3 - loss 0.20014706 - time (sec): 0.14 - samples/sec: 5594.41 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:38,144 epoch 94 - iter 3/3 - loss 0.19673995 - time (sec): 0.17 - samples/sec: 4713.15 - lr: 0.001563 - momentum: 0.000000
2024-03-28 09:08:38,145 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,148 EPOCH 94 done: loss 0.1967 - lr: 0.001563
2024-03-28 09:08:38,150  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.00078125]
2024-03-28 09:08:38,152 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,220 epoch 95 - iter 1/3 - loss 0.21017605 - time (sec): 0.07 - samples/sec: 5624.92 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,289 epoch 95 - iter 2/3 - loss 0.19381217 - time (sec): 0.14 - samples/sec: 5552.42 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,317 epoch 95 - iter 3/3 - loss 0.19578398 - time (sec): 0.16 - samples/sec: 4797.42 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,318 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,321 EPOCH 95 done: loss 0.1958 - lr: 0.000781
2024-03-28 09:08:38,323  - 1 epochs without improvement
2024-03-28 09:08:38,325 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,391 epoch 96 - iter 1/3 - loss 0.15655139 - time (sec): 0.06 - samples/sec: 5790.11 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,461 epoch 96 - iter 2/3 - loss 0.19074659 - time (sec): 0.13 - samples/sec: 5643.29 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,486 epoch 96 - iter 3/3 - loss 0.18613870 - time (sec): 0.16 - samples/sec: 4899.18 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,487 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,490 EPOCH 96 done: loss 0.1861 - lr: 0.000781
2024-03-28 09:08:38,493  - 2 epochs without improvement
2024-03-28 09:08:38,495 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,560 epoch 97 - iter 1/3 - loss 0.18081108 - time (sec): 0.06 - samples/sec: 5971.45 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,628 epoch 97 - iter 2/3 - loss 0.18019803 - time (sec): 0.13 - samples/sec: 5857.20 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,654 epoch 97 - iter 3/3 - loss 0.17897194 - time (sec): 0.16 - samples/sec: 5010.46 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,655 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,658 EPOCH 97 done: loss 0.1790 - lr: 0.000781
2024-03-28 09:08:38,660  - 0 epochs without improvement
2024-03-28 09:08:38,663 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,729 epoch 98 - iter 1/3 - loss 0.19285375 - time (sec): 0.06 - samples/sec: 5864.54 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,796 epoch 98 - iter 2/3 - loss 0.18586519 - time (sec): 0.13 - samples/sec: 5727.76 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,821 epoch 98 - iter 3/3 - loss 0.18403817 - time (sec): 0.16 - samples/sec: 4977.22 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:38,823 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,825 EPOCH 98 done: loss 0.1840 - lr: 0.000781
2024-03-28 09:08:38,828  - 1 epochs without improvement
2024-03-28 09:08:38,830 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:38,920 epoch 99 - iter 1/3 - loss 0.22237257 - time (sec): 0.09 - samples/sec: 4446.36 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,009 epoch 99 - iter 2/3 - loss 0.20208282 - time (sec): 0.18 - samples/sec: 4306.20 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,044 epoch 99 - iter 3/3 - loss 0.20092824 - time (sec): 0.21 - samples/sec: 3689.11 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,046 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,048 EPOCH 99 done: loss 0.2009 - lr: 0.000781
2024-03-28 09:08:39,050  - 2 epochs without improvement
2024-03-28 09:08:39,052 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,138 epoch 100 - iter 1/3 - loss 0.19105241 - time (sec): 0.08 - samples/sec: 4345.36 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,232 epoch 100 - iter 2/3 - loss 0.19192969 - time (sec): 0.18 - samples/sec: 4216.99 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,262 epoch 100 - iter 3/3 - loss 0.18671563 - time (sec): 0.21 - samples/sec: 3743.15 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,264 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,266 EPOCH 100 done: loss 0.1867 - lr: 0.000781
2024-03-28 09:08:39,268  - 3 epochs without improvement
2024-03-28 09:08:39,270 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,363 epoch 101 - iter 1/3 - loss 0.23572141 - time (sec): 0.09 - samples/sec: 4301.97 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,448 epoch 101 - iter 2/3 - loss 0.21389694 - time (sec): 0.17 - samples/sec: 4428.51 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,479 epoch 101 - iter 3/3 - loss 0.21106680 - time (sec): 0.20 - samples/sec: 3867.97 - lr: 0.000781 - momentum: 0.000000
2024-03-28 09:08:39,483 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,486 EPOCH 101 done: loss 0.2111 - lr: 0.000781
2024-03-28 09:08:39,490  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.000390625]
2024-03-28 09:08:39,495 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,578 epoch 102 - iter 1/3 - loss 0.20411731 - time (sec): 0.08 - samples/sec: 4631.87 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:39,671 epoch 102 - iter 2/3 - loss 0.17361511 - time (sec): 0.17 - samples/sec: 4339.71 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:39,700 epoch 102 - iter 3/3 - loss 0.17970168 - time (sec): 0.20 - samples/sec: 3836.73 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:39,702 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,705 EPOCH 102 done: loss 0.1797 - lr: 0.000391
2024-03-28 09:08:39,708  - 1 epochs without improvement
2024-03-28 09:08:39,711 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,799 epoch 103 - iter 1/3 - loss 0.17368401 - time (sec): 0.09 - samples/sec: 4284.76 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:39,884 epoch 103 - iter 2/3 - loss 0.22948218 - time (sec): 0.17 - samples/sec: 4427.71 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:39,911 epoch 103 - iter 3/3 - loss 0.23261170 - time (sec): 0.20 - samples/sec: 3924.92 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:39,913 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:39,915 EPOCH 103 done: loss 0.2326 - lr: 0.000391
2024-03-28 09:08:39,918  - 2 epochs without improvement
2024-03-28 09:08:39,920 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,002 epoch 104 - iter 1/3 - loss 0.18409026 - time (sec): 0.08 - samples/sec: 4455.73 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:40,092 epoch 104 - iter 2/3 - loss 0.21870441 - time (sec): 0.17 - samples/sec: 4451.58 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:40,127 epoch 104 - iter 3/3 - loss 0.21383352 - time (sec): 0.20 - samples/sec: 3802.98 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:40,130 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,131 EPOCH 104 done: loss 0.2138 - lr: 0.000391
2024-03-28 09:08:40,133  - 3 epochs without improvement
2024-03-28 09:08:40,134 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,227 epoch 105 - iter 1/3 - loss 0.20639474 - time (sec): 0.09 - samples/sec: 4308.96 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:40,329 epoch 105 - iter 2/3 - loss 0.21218268 - time (sec): 0.19 - samples/sec: 3934.47 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:40,357 epoch 105 - iter 3/3 - loss 0.21193148 - time (sec): 0.22 - samples/sec: 3521.54 - lr: 0.000391 - momentum: 0.000000
2024-03-28 09:08:40,360 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,362 EPOCH 105 done: loss 0.2119 - lr: 0.000391
2024-03-28 09:08:40,366  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [0.0001953125]
2024-03-28 09:08:40,368 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,468 epoch 106 - iter 1/3 - loss 0.16320036 - time (sec): 0.10 - samples/sec: 3747.98 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:40,565 epoch 106 - iter 2/3 - loss 0.17305550 - time (sec): 0.19 - samples/sec: 3911.81 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:40,593 epoch 106 - iter 3/3 - loss 0.17119106 - time (sec): 0.22 - samples/sec: 3498.36 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:40,595 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,597 EPOCH 106 done: loss 0.1712 - lr: 0.000195
2024-03-28 09:08:40,602  - 0 epochs without improvement
2024-03-28 09:08:40,606 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,713 epoch 107 - iter 1/3 - loss 0.20166751 - time (sec): 0.10 - samples/sec: 3547.84 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:40,807 epoch 107 - iter 2/3 - loss 0.17208012 - time (sec): 0.20 - samples/sec: 3844.10 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:40,844 epoch 107 - iter 3/3 - loss 0.17909875 - time (sec): 0.23 - samples/sec: 3328.13 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:40,848 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,851 EPOCH 107 done: loss 0.1791 - lr: 0.000195
2024-03-28 09:08:40,855  - 1 epochs without improvement
2024-03-28 09:08:40,857 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:40,961 epoch 108 - iter 1/3 - loss 0.19488302 - time (sec): 0.10 - samples/sec: 3733.82 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,054 epoch 108 - iter 2/3 - loss 0.17380854 - time (sec): 0.20 - samples/sec: 3831.73 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,096 epoch 108 - iter 3/3 - loss 0.17624595 - time (sec): 0.24 - samples/sec: 3278.64 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,101 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,102 EPOCH 108 done: loss 0.1762 - lr: 0.000195
2024-03-28 09:08:41,104  - 2 epochs without improvement
2024-03-28 09:08:41,106 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,217 epoch 109 - iter 1/3 - loss 0.21638976 - time (sec): 0.11 - samples/sec: 3421.91 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,289 epoch 109 - iter 2/3 - loss 0.20551143 - time (sec): 0.18 - samples/sec: 4150.51 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,318 epoch 109 - iter 3/3 - loss 0.21160093 - time (sec): 0.21 - samples/sec: 3709.13 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,319 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,322 EPOCH 109 done: loss 0.2116 - lr: 0.000195
2024-03-28 09:08:41,325  - 3 epochs without improvement
2024-03-28 09:08:41,327 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,398 epoch 110 - iter 1/3 - loss 0.19369786 - time (sec): 0.07 - samples/sec: 5551.41 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,471 epoch 110 - iter 2/3 - loss 0.19350566 - time (sec): 0.14 - samples/sec: 5283.12 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,501 epoch 110 - iter 3/3 - loss 0.19654441 - time (sec): 0.17 - samples/sec: 4532.03 - lr: 0.000195 - momentum: 0.000000
2024-03-28 09:08:41,503 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,506 EPOCH 110 done: loss 0.1965 - lr: 0.000195
2024-03-28 09:08:41,509  - 4 epochs without improvement (above 'patience')-> annealing learning_rate to [9.765625e-05]
2024-03-28 09:08:41,512 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,517 learning rate too small - quitting training!
2024-03-28 09:08:41,519 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:41,520 Saving model ...
2024-03-28 09:08:43,132 Done.
2024-03-28 09:08:43,136 ----------------------------------------------------------------------------------------------------
2024-03-28 09:08:43,140 Testing using last state of model ...
2024-03-28 09:08:43,249 
Results:
- F-score (micro) 0.9524
- F-score (macro) 0.9333
- Accuracy 0.9091

By class:
              precision    recall  f1-score   support

        NAME     1.0000    1.0000    1.0000         3
    GCNUMBER     1.0000    1.0000    1.0000         3
    LOCATION     1.0000    1.0000    1.0000         2
         ORG     1.0000    0.5000    0.6667         2
     COUNTRY     1.0000    1.0000    1.0000         1

   micro avg     1.0000    0.9091    0.9524        11
   macro avg     1.0000    0.9000    0.9333        11
weighted avg     1.0000    0.9091    0.9394        11

2024-03-28 09:08:43,252 ----------------------------------------------------------------------------------------------------