File size: 27,408 Bytes
8f5b167
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5421ea7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b62f7b7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
{"current_steps": 10, "total_steps": 1875, "loss": 2.0846, "lr": 9.999298177883903e-05, "epoch": 0.016, "percentage": 0.53, "elapsed_time": "0:01:57", "remaining_time": "6:04:32"}
{"current_steps": 20, "total_steps": 1875, "loss": 1.0283, "lr": 9.997192908557323e-05, "epoch": 0.032, "percentage": 1.07, "elapsed_time": "0:03:31", "remaining_time": "5:27:42"}
{"current_steps": 30, "total_steps": 1875, "loss": 0.686, "lr": 9.993684783030088e-05, "epoch": 0.048, "percentage": 1.6, "elapsed_time": "0:05:04", "remaining_time": "5:12:01"}
{"current_steps": 40, "total_steps": 1875, "loss": 0.587, "lr": 9.988774786134234e-05, "epoch": 0.064, "percentage": 2.13, "elapsed_time": "0:06:36", "remaining_time": "5:03:25"}
{"current_steps": 50, "total_steps": 1875, "loss": 0.5667, "lr": 9.982464296247522e-05, "epoch": 0.08, "percentage": 2.67, "elapsed_time": "0:08:11", "remaining_time": "4:58:41"}
{"current_steps": 60, "total_steps": 1875, "loss": 0.5057, "lr": 9.974755084906502e-05, "epoch": 0.096, "percentage": 3.2, "elapsed_time": "0:09:44", "remaining_time": "4:54:32"}
{"current_steps": 70, "total_steps": 1875, "loss": 0.5095, "lr": 9.965649316309178e-05, "epoch": 0.112, "percentage": 3.73, "elapsed_time": "0:11:16", "remaining_time": "4:50:44"}
{"current_steps": 80, "total_steps": 1875, "loss": 0.5589, "lr": 9.955149546707465e-05, "epoch": 0.128, "percentage": 4.27, "elapsed_time": "0:12:49", "remaining_time": "4:47:43"}
{"current_steps": 90, "total_steps": 1875, "loss": 0.4037, "lr": 9.94325872368957e-05, "epoch": 0.144, "percentage": 4.8, "elapsed_time": "0:14:21", "remaining_time": "4:44:50"}
{"current_steps": 100, "total_steps": 1875, "loss": 0.476, "lr": 9.929980185352526e-05, "epoch": 0.16, "percentage": 5.33, "elapsed_time": "0:15:54", "remaining_time": "4:42:19"}
{"current_steps": 110, "total_steps": 1875, "loss": 0.4732, "lr": 9.915317659365077e-05, "epoch": 0.176, "percentage": 5.87, "elapsed_time": "0:17:26", "remaining_time": "4:39:47"}
{"current_steps": 120, "total_steps": 1875, "loss": 0.4808, "lr": 9.899275261921234e-05, "epoch": 0.192, "percentage": 6.4, "elapsed_time": "0:18:58", "remaining_time": "4:37:37"}
{"current_steps": 130, "total_steps": 1875, "loss": 0.4359, "lr": 9.881857496584726e-05, "epoch": 0.208, "percentage": 6.93, "elapsed_time": "0:20:32", "remaining_time": "4:35:49"}
{"current_steps": 140, "total_steps": 1875, "loss": 0.377, "lr": 9.863069253024719e-05, "epoch": 0.224, "percentage": 7.47, "elapsed_time": "0:22:05", "remaining_time": "4:33:47"}
{"current_steps": 150, "total_steps": 1875, "loss": 0.4143, "lr": 9.842915805643155e-05, "epoch": 0.24, "percentage": 8.0, "elapsed_time": "0:23:37", "remaining_time": "4:31:46"}
{"current_steps": 160, "total_steps": 1875, "loss": 0.4396, "lr": 9.821402812094073e-05, "epoch": 0.256, "percentage": 8.53, "elapsed_time": "0:25:10", "remaining_time": "4:29:48"}
{"current_steps": 170, "total_steps": 1875, "loss": 0.4219, "lr": 9.798536311695334e-05, "epoch": 0.272, "percentage": 9.07, "elapsed_time": "0:26:42", "remaining_time": "4:27:53"}
{"current_steps": 180, "total_steps": 1875, "loss": 0.4189, "lr": 9.774322723733216e-05, "epoch": 0.288, "percentage": 9.6, "elapsed_time": "0:28:14", "remaining_time": "4:26:00"}
{"current_steps": 190, "total_steps": 1875, "loss": 0.399, "lr": 9.748768845660334e-05, "epoch": 0.304, "percentage": 10.13, "elapsed_time": "0:29:48", "remaining_time": "4:24:19"}
{"current_steps": 200, "total_steps": 1875, "loss": 0.434, "lr": 9.721881851187406e-05, "epoch": 0.32, "percentage": 10.67, "elapsed_time": "0:31:21", "remaining_time": "4:22:37"}
{"current_steps": 210, "total_steps": 1875, "loss": 0.3684, "lr": 9.693669288269372e-05, "epoch": 0.336, "percentage": 11.2, "elapsed_time": "0:32:53", "remaining_time": "4:20:49"}
{"current_steps": 220, "total_steps": 1875, "loss": 0.3468, "lr": 9.664139076986473e-05, "epoch": 0.352, "percentage": 11.73, "elapsed_time": "0:34:26", "remaining_time": "4:19:02"}
{"current_steps": 230, "total_steps": 1875, "loss": 0.4119, "lr": 9.63329950732086e-05, "epoch": 0.368, "percentage": 12.27, "elapsed_time": "0:35:58", "remaining_time": "4:17:19"}
{"current_steps": 240, "total_steps": 1875, "loss": 0.3904, "lr": 9.601159236829352e-05, "epoch": 0.384, "percentage": 12.8, "elapsed_time": "0:37:31", "remaining_time": "4:15:36"}
{"current_steps": 250, "total_steps": 1875, "loss": 0.423, "lr": 9.567727288213005e-05, "epoch": 0.4, "percentage": 13.33, "elapsed_time": "0:39:03", "remaining_time": "4:13:53"}
{"current_steps": 260, "total_steps": 1875, "loss": 0.3761, "lr": 9.533013046784189e-05, "epoch": 0.416, "percentage": 13.87, "elapsed_time": "0:40:35", "remaining_time": "4:12:11"}
{"current_steps": 270, "total_steps": 1875, "loss": 0.3351, "lr": 9.497026257831855e-05, "epoch": 0.432, "percentage": 14.4, "elapsed_time": "0:42:08", "remaining_time": "4:10:29"}
{"current_steps": 280, "total_steps": 1875, "loss": 0.3282, "lr": 9.459777023885755e-05, "epoch": 0.448, "percentage": 14.93, "elapsed_time": "0:43:40", "remaining_time": "4:08:48"}
{"current_steps": 290, "total_steps": 1875, "loss": 0.4125, "lr": 9.421275801880362e-05, "epoch": 0.464, "percentage": 15.47, "elapsed_time": "0:45:12", "remaining_time": "4:07:07"}
{"current_steps": 300, "total_steps": 1875, "loss": 0.3728, "lr": 9.381533400219318e-05, "epoch": 0.48, "percentage": 16.0, "elapsed_time": "0:46:45", "remaining_time": "4:05:27"}
{"current_steps": 310, "total_steps": 1875, "loss": 0.3094, "lr": 9.340560975741197e-05, "epoch": 0.496, "percentage": 16.53, "elapsed_time": "0:48:18", "remaining_time": "4:03:51"}
{"current_steps": 320, "total_steps": 1875, "loss": 0.3853, "lr": 9.298370030587456e-05, "epoch": 0.512, "percentage": 17.07, "elapsed_time": "0:49:50", "remaining_time": "4:02:13"}
{"current_steps": 330, "total_steps": 1875, "loss": 0.3391, "lr": 9.254972408973461e-05, "epoch": 0.528, "percentage": 17.6, "elapsed_time": "0:51:23", "remaining_time": "4:00:37"}
{"current_steps": 340, "total_steps": 1875, "loss": 0.397, "lr": 9.210380293863462e-05, "epoch": 0.544, "percentage": 18.13, "elapsed_time": "0:52:56", "remaining_time": "3:58:59"}
{"current_steps": 350, "total_steps": 1875, "loss": 0.3878, "lr": 9.164606203550497e-05, "epoch": 0.56, "percentage": 18.67, "elapsed_time": "0:54:28", "remaining_time": "3:57:21"}
{"current_steps": 360, "total_steps": 1875, "loss": 0.3415, "lr": 9.117662988142138e-05, "epoch": 0.576, "percentage": 19.2, "elapsed_time": "0:56:00", "remaining_time": "3:55:43"}
{"current_steps": 370, "total_steps": 1875, "loss": 0.4046, "lr": 9.069563825953092e-05, "epoch": 0.592, "percentage": 19.73, "elapsed_time": "0:57:33", "remaining_time": "3:54:06"}
{"current_steps": 380, "total_steps": 1875, "loss": 0.3674, "lr": 9.020322219805674e-05, "epoch": 0.608, "percentage": 20.27, "elapsed_time": "0:59:06", "remaining_time": "3:52:32"}
{"current_steps": 390, "total_steps": 1875, "loss": 0.3478, "lr": 8.969951993239177e-05, "epoch": 0.624, "percentage": 20.8, "elapsed_time": "1:00:38", "remaining_time": "3:50:55"}
{"current_steps": 400, "total_steps": 1875, "loss": 0.375, "lr": 8.9184672866292e-05, "epoch": 0.64, "percentage": 21.33, "elapsed_time": "1:02:11", "remaining_time": "3:49:18"}
{"current_steps": 410, "total_steps": 1875, "loss": 0.381, "lr": 8.865882553218037e-05, "epoch": 0.656, "percentage": 21.87, "elapsed_time": "1:03:43", "remaining_time": "3:47:42"}
{"current_steps": 420, "total_steps": 1875, "loss": 0.3594, "lr": 8.81221255505724e-05, "epoch": 0.672, "percentage": 22.4, "elapsed_time": "1:05:16", "remaining_time": "3:46:08"}
{"current_steps": 430, "total_steps": 1875, "loss": 0.3813, "lr": 8.757472358863481e-05, "epoch": 0.688, "percentage": 22.93, "elapsed_time": "1:06:48", "remaining_time": "3:44:32"}
{"current_steps": 440, "total_steps": 1875, "loss": 0.3614, "lr": 8.701677331788891e-05, "epoch": 0.704, "percentage": 23.47, "elapsed_time": "1:08:21", "remaining_time": "3:42:55"}
{"current_steps": 450, "total_steps": 1875, "loss": 0.3459, "lr": 8.644843137107059e-05, "epoch": 0.72, "percentage": 24.0, "elapsed_time": "1:09:53", "remaining_time": "3:41:19"}
{"current_steps": 460, "total_steps": 1875, "loss": 0.3318, "lr": 8.586985729815894e-05, "epoch": 0.736, "percentage": 24.53, "elapsed_time": "1:11:25", "remaining_time": "3:39:43"}
{"current_steps": 470, "total_steps": 1875, "loss": 0.3727, "lr": 8.528121352158604e-05, "epoch": 0.752, "percentage": 25.07, "elapsed_time": "1:12:58", "remaining_time": "3:38:07"}
{"current_steps": 480, "total_steps": 1875, "loss": 0.3283, "lr": 8.468266529064025e-05, "epoch": 0.768, "percentage": 25.6, "elapsed_time": "1:14:30", "remaining_time": "3:36:31"}
{"current_steps": 490, "total_steps": 1875, "loss": 0.3466, "lr": 8.4074380635076e-05, "epoch": 0.784, "percentage": 26.13, "elapsed_time": "1:16:02", "remaining_time": "3:34:56"}
{"current_steps": 500, "total_steps": 1875, "loss": 0.3554, "lr": 8.345653031794292e-05, "epoch": 0.8, "percentage": 26.67, "elapsed_time": "1:17:34", "remaining_time": "3:33:20"}
{"current_steps": 510, "total_steps": 1875, "loss": 0.3495, "lr": 8.282928778764783e-05, "epoch": 0.816, "percentage": 27.2, "elapsed_time": "1:19:15", "remaining_time": "3:32:09"}
{"current_steps": 520, "total_steps": 1875, "loss": 0.3239, "lr": 8.21928291292627e-05, "epoch": 0.832, "percentage": 27.73, "elapsed_time": "1:20:48", "remaining_time": "3:30:33"}
{"current_steps": 530, "total_steps": 1875, "loss": 0.3288, "lr": 8.154733301509248e-05, "epoch": 0.848, "percentage": 28.27, "elapsed_time": "1:22:20", "remaining_time": "3:28:57"}
{"current_steps": 540, "total_steps": 1875, "loss": 0.2646, "lr": 8.089298065451672e-05, "epoch": 0.864, "percentage": 28.8, "elapsed_time": "1:23:52", "remaining_time": "3:27:21"}
{"current_steps": 550, "total_steps": 1875, "loss": 0.3072, "lr": 8.022995574311876e-05, "epoch": 0.88, "percentage": 29.33, "elapsed_time": "1:25:24", "remaining_time": "3:25:45"}
{"current_steps": 560, "total_steps": 1875, "loss": 0.3707, "lr": 7.95584444111171e-05, "epoch": 0.896, "percentage": 29.87, "elapsed_time": "1:26:56", "remaining_time": "3:24:09"}
{"current_steps": 570, "total_steps": 1875, "loss": 0.3262, "lr": 7.887863517111338e-05, "epoch": 0.912, "percentage": 30.4, "elapsed_time": "1:28:28", "remaining_time": "3:22:33"}
{"current_steps": 580, "total_steps": 1875, "loss": 0.3383, "lr": 7.819071886517134e-05, "epoch": 0.928, "percentage": 30.93, "elapsed_time": "1:30:00", "remaining_time": "3:20:59"}
{"current_steps": 590, "total_steps": 1875, "loss": 0.311, "lr": 7.7494888611242e-05, "epoch": 0.944, "percentage": 31.47, "elapsed_time": "1:31:33", "remaining_time": "3:19:24"}
{"current_steps": 600, "total_steps": 1875, "loss": 0.3093, "lr": 7.679133974894983e-05, "epoch": 0.96, "percentage": 32.0, "elapsed_time": "1:33:05", "remaining_time": "3:17:49"}
{"current_steps": 610, "total_steps": 1875, "loss": 0.3547, "lr": 7.60802697847554e-05, "epoch": 0.976, "percentage": 32.53, "elapsed_time": "1:34:37", "remaining_time": "3:16:14"}
{"current_steps": 620, "total_steps": 1875, "loss": 0.3314, "lr": 7.536187833650947e-05, "epoch": 0.992, "percentage": 33.07, "elapsed_time": "1:36:10", "remaining_time": "3:14:40"}
{"current_steps": 630, "total_steps": 1875, "loss": 0.3445, "lr": 7.463636707741458e-05, "epoch": 1.008, "percentage": 33.6, "elapsed_time": "1:37:42", "remaining_time": "3:13:04"}
{"current_steps": 640, "total_steps": 1875, "loss": 0.2962, "lr": 7.390393967940962e-05, "epoch": 1.024, "percentage": 34.13, "elapsed_time": "1:39:14", "remaining_time": "3:11:29"}
{"current_steps": 650, "total_steps": 1875, "loss": 0.283, "lr": 7.316480175599309e-05, "epoch": 1.04, "percentage": 34.67, "elapsed_time": "1:40:46", "remaining_time": "3:09:55"}
{"current_steps": 660, "total_steps": 1875, "loss": 0.2599, "lr": 7.241916080450163e-05, "epoch": 1.056, "percentage": 35.2, "elapsed_time": "1:42:18", "remaining_time": "3:08:20"}
{"current_steps": 670, "total_steps": 1875, "loss": 0.28, "lr": 7.166722614785937e-05, "epoch": 1.072, "percentage": 35.73, "elapsed_time": "1:43:51", "remaining_time": "3:06:47"}
{"current_steps": 680, "total_steps": 1875, "loss": 0.2348, "lr": 7.090920887581506e-05, "epoch": 1.088, "percentage": 36.27, "elapsed_time": "1:45:23", "remaining_time": "3:05:12"}
{"current_steps": 690, "total_steps": 1875, "loss": 0.2721, "lr": 7.014532178568314e-05, "epoch": 1.104, "percentage": 36.8, "elapsed_time": "1:46:55", "remaining_time": "3:03:38"}
{"current_steps": 700, "total_steps": 1875, "loss": 0.3143, "lr": 6.937577932260515e-05, "epoch": 1.12, "percentage": 37.33, "elapsed_time": "1:48:28", "remaining_time": "3:02:04"}
{"current_steps": 710, "total_steps": 1875, "loss": 0.2943, "lr": 6.860079751934908e-05, "epoch": 1.1360000000000001, "percentage": 37.87, "elapsed_time": "1:50:00", "remaining_time": "3:00:30"}
{"current_steps": 720, "total_steps": 1875, "loss": 0.2593, "lr": 6.782059393566253e-05, "epoch": 1.152, "percentage": 38.4, "elapsed_time": "1:51:32", "remaining_time": "2:58:56"}
{"current_steps": 730, "total_steps": 1875, "loss": 0.3074, "lr": 6.70353875971976e-05, "epoch": 1.168, "percentage": 38.93, "elapsed_time": "1:53:04", "remaining_time": "2:57:21"}
{"current_steps": 740, "total_steps": 1875, "loss": 0.2933, "lr": 6.624539893402382e-05, "epoch": 1.184, "percentage": 39.47, "elapsed_time": "1:54:37", "remaining_time": "2:55:47"}
{"current_steps": 750, "total_steps": 1875, "loss": 0.2836, "lr": 6.545084971874738e-05, "epoch": 1.2, "percentage": 40.0, "elapsed_time": "1:56:09", "remaining_time": "2:54:13"}
{"current_steps": 760, "total_steps": 1875, "loss": 0.3033, "lr": 6.465196300425287e-05, "epoch": 1.216, "percentage": 40.53, "elapsed_time": "1:57:41", "remaining_time": "2:52:39"}
{"current_steps": 770, "total_steps": 1875, "loss": 0.3165, "lr": 6.384896306108612e-05, "epoch": 1.232, "percentage": 41.07, "elapsed_time": "1:59:13", "remaining_time": "2:51:05"}
{"current_steps": 780, "total_steps": 1875, "loss": 0.2612, "lr": 6.304207531449486e-05, "epoch": 1.248, "percentage": 41.6, "elapsed_time": "2:00:45", "remaining_time": "2:49:30"}
{"current_steps": 790, "total_steps": 1875, "loss": 0.2885, "lr": 6.223152628114537e-05, "epoch": 1.264, "percentage": 42.13, "elapsed_time": "2:02:17", "remaining_time": "2:47:57"}
{"current_steps": 800, "total_steps": 1875, "loss": 0.3141, "lr": 6.141754350553279e-05, "epoch": 1.28, "percentage": 42.67, "elapsed_time": "2:03:49", "remaining_time": "2:46:23"}
{"current_steps": 810, "total_steps": 1875, "loss": 0.2858, "lr": 6.0600355496102745e-05, "epoch": 1.296, "percentage": 43.2, "elapsed_time": "2:05:21", "remaining_time": "2:44:49"}
{"current_steps": 820, "total_steps": 1875, "loss": 0.2567, "lr": 5.9780191661102415e-05, "epoch": 1.312, "percentage": 43.73, "elapsed_time": "2:06:54", "remaining_time": "2:43:16"}
{"current_steps": 830, "total_steps": 1875, "loss": 0.2991, "lr": 5.8957282244179124e-05, "epoch": 1.328, "percentage": 44.27, "elapsed_time": "2:08:26", "remaining_time": "2:41:42"}
{"current_steps": 840, "total_steps": 1875, "loss": 0.2465, "lr": 5.813185825974419e-05, "epoch": 1.3439999999999999, "percentage": 44.8, "elapsed_time": "2:09:58", "remaining_time": "2:40:08"}
{"current_steps": 850, "total_steps": 1875, "loss": 0.2682, "lr": 5.730415142812059e-05, "epoch": 1.3599999999999999, "percentage": 45.33, "elapsed_time": "2:11:31", "remaining_time": "2:38:35"}
{"current_steps": 860, "total_steps": 1875, "loss": 0.2987, "lr": 5.6474394110492344e-05, "epoch": 1.376, "percentage": 45.87, "elapsed_time": "2:13:03", "remaining_time": "2:37:02"}
{"current_steps": 870, "total_steps": 1875, "loss": 0.2888, "lr": 5.564281924367408e-05, "epoch": 1.392, "percentage": 46.4, "elapsed_time": "2:14:35", "remaining_time": "2:35:28"}
{"current_steps": 880, "total_steps": 1875, "loss": 0.2946, "lr": 5.480966027471889e-05, "epoch": 1.408, "percentage": 46.93, "elapsed_time": "2:16:07", "remaining_time": "2:33:55"}
{"current_steps": 890, "total_steps": 1875, "loss": 0.282, "lr": 5.3975151095382995e-05, "epoch": 1.424, "percentage": 47.47, "elapsed_time": "2:17:39", "remaining_time": "2:32:21"}
{"current_steps": 900, "total_steps": 1875, "loss": 0.2832, "lr": 5.313952597646568e-05, "epoch": 1.44, "percentage": 48.0, "elapsed_time": "2:19:11", "remaining_time": "2:30:47"}
{"current_steps": 910, "total_steps": 1875, "loss": 0.3014, "lr": 5.230301950204262e-05, "epoch": 1.456, "percentage": 48.53, "elapsed_time": "2:20:43", "remaining_time": "2:29:13"}
{"current_steps": 920, "total_steps": 1875, "loss": 0.2739, "lr": 5.1465866503611426e-05, "epoch": 1.472, "percentage": 49.07, "elapsed_time": "2:22:15", "remaining_time": "2:27:39"}
{"current_steps": 930, "total_steps": 1875, "loss": 0.2696, "lr": 5.062830199416764e-05, "epoch": 1.488, "percentage": 49.6, "elapsed_time": "2:23:46", "remaining_time": "2:26:06"}
{"current_steps": 940, "total_steps": 1875, "loss": 0.2958, "lr": 4.979056110222981e-05, "epoch": 1.504, "percentage": 50.13, "elapsed_time": "2:25:18", "remaining_time": "2:24:32"}
{"current_steps": 950, "total_steps": 1875, "loss": 0.2482, "lr": 4.895287900583216e-05, "epoch": 1.52, "percentage": 50.67, "elapsed_time": "2:26:50", "remaining_time": "2:22:58"}
{"current_steps": 960, "total_steps": 1875, "loss": 0.2789, "lr": 4.811549086650327e-05, "epoch": 1.536, "percentage": 51.2, "elapsed_time": "2:28:22", "remaining_time": "2:21:25"}
{"current_steps": 970, "total_steps": 1875, "loss": 0.2563, "lr": 4.7278631763249554e-05, "epoch": 1.552, "percentage": 51.73, "elapsed_time": "2:29:55", "remaining_time": "2:19:52"}
{"current_steps": 980, "total_steps": 1875, "loss": 0.259, "lr": 4.6442536626561675e-05, "epoch": 1.568, "percentage": 52.27, "elapsed_time": "2:31:27", "remaining_time": "2:18:19"}
{"current_steps": 990, "total_steps": 1875, "loss": 0.2623, "lr": 4.560744017246284e-05, "epoch": 1.584, "percentage": 52.8, "elapsed_time": "2:32:59", "remaining_time": "2:16:46"}
{"current_steps": 1000, "total_steps": 1875, "loss": 0.2867, "lr": 4.477357683661734e-05, "epoch": 1.6, "percentage": 53.33, "elapsed_time": "2:34:32", "remaining_time": "2:15:13"}
{"current_steps": 1010, "total_steps": 1875, "loss": 0.2851, "lr": 4.394118070851749e-05, "epoch": 1.616, "percentage": 53.87, "elapsed_time": "2:36:12", "remaining_time": "2:13:47"}
{"current_steps": 1020, "total_steps": 1875, "loss": 0.269, "lr": 4.31104854657681e-05, "epoch": 1.6320000000000001, "percentage": 54.4, "elapsed_time": "2:37:44", "remaining_time": "2:12:13"}
{"current_steps": 1030, "total_steps": 1875, "loss": 0.2526, "lr": 4.228172430848644e-05, "epoch": 1.6480000000000001, "percentage": 54.93, "elapsed_time": "2:39:16", "remaining_time": "2:10:39"}
{"current_steps": 1040, "total_steps": 1875, "loss": 0.2788, "lr": 4.1455129893836174e-05, "epoch": 1.6640000000000001, "percentage": 55.47, "elapsed_time": "2:40:48", "remaining_time": "2:09:06"}
{"current_steps": 1050, "total_steps": 1875, "loss": 0.2439, "lr": 4.063093427071376e-05, "epoch": 1.6800000000000002, "percentage": 56.0, "elapsed_time": "2:42:19", "remaining_time": "2:07:32"}
{"current_steps": 1060, "total_steps": 1875, "loss": 0.2701, "lr": 3.9809368814605766e-05, "epoch": 1.696, "percentage": 56.53, "elapsed_time": "2:43:51", "remaining_time": "2:05:59"}
{"current_steps": 1070, "total_steps": 1875, "loss": 0.2665, "lr": 3.899066416263493e-05, "epoch": 1.712, "percentage": 57.07, "elapsed_time": "2:45:23", "remaining_time": "2:04:25"}
{"current_steps": 1080, "total_steps": 1875, "loss": 0.3048, "lr": 3.817505014881378e-05, "epoch": 1.728, "percentage": 57.6, "elapsed_time": "2:46:54", "remaining_time": "2:02:52"}
{"current_steps": 1090, "total_steps": 1875, "loss": 0.2298, "lr": 3.736275573952354e-05, "epoch": 1.744, "percentage": 58.13, "elapsed_time": "2:48:26", "remaining_time": "2:01:18"}
{"current_steps": 1100, "total_steps": 1875, "loss": 0.2568, "lr": 3.655400896923672e-05, "epoch": 1.76, "percentage": 58.67, "elapsed_time": "2:49:58", "remaining_time": "1:59:45"}
{"current_steps": 1110, "total_steps": 1875, "loss": 0.2685, "lr": 3.5749036876501194e-05, "epoch": 1.776, "percentage": 59.2, "elapsed_time": "2:51:30", "remaining_time": "1:58:11"}
{"current_steps": 1120, "total_steps": 1875, "loss": 0.2675, "lr": 3.494806544020398e-05, "epoch": 1.792, "percentage": 59.73, "elapsed_time": "2:53:01", "remaining_time": "1:56:38"}
{"current_steps": 1130, "total_steps": 1875, "loss": 0.2539, "lr": 3.4151319516132416e-05, "epoch": 1.808, "percentage": 60.27, "elapsed_time": "2:54:33", "remaining_time": "1:55:05"}
{"current_steps": 1140, "total_steps": 1875, "loss": 0.26, "lr": 3.335902277385067e-05, "epoch": 1.8239999999999998, "percentage": 60.8, "elapsed_time": "2:56:05", "remaining_time": "1:53:32"}
{"current_steps": 1150, "total_steps": 1875, "loss": 0.262, "lr": 3.257139763390925e-05, "epoch": 1.8399999999999999, "percentage": 61.33, "elapsed_time": "2:57:37", "remaining_time": "1:51:58"}
{"current_steps": 1160, "total_steps": 1875, "loss": 0.2381, "lr": 3.178866520540509e-05, "epoch": 1.8559999999999999, "percentage": 61.87, "elapsed_time": "2:59:09", "remaining_time": "1:50:25"}
{"current_steps": 1170, "total_steps": 1875, "loss": 0.2682, "lr": 3.101104522390995e-05, "epoch": 1.8719999999999999, "percentage": 62.4, "elapsed_time": "3:00:40", "remaining_time": "1:48:52"}
{"current_steps": 1180, "total_steps": 1875, "loss": 0.2902, "lr": 3.023875598978419e-05, "epoch": 1.888, "percentage": 62.93, "elapsed_time": "3:02:12", "remaining_time": "1:47:19"}
{"current_steps": 1190, "total_steps": 1875, "loss": 0.2624, "lr": 2.9472014306893603e-05, "epoch": 1.904, "percentage": 63.47, "elapsed_time": "3:03:44", "remaining_time": "1:45:45"}
{"current_steps": 1200, "total_steps": 1875, "loss": 0.2686, "lr": 2.8711035421746367e-05, "epoch": 1.92, "percentage": 64.0, "elapsed_time": "3:05:15", "remaining_time": "1:44:12"}
{"current_steps": 1210, "total_steps": 1875, "loss": 0.2443, "lr": 2.795603296306708e-05, "epoch": 1.936, "percentage": 64.53, "elapsed_time": "3:06:47", "remaining_time": "1:42:39"}
{"current_steps": 1220, "total_steps": 1875, "loss": 0.2752, "lr": 2.7207218881825014e-05, "epoch": 1.952, "percentage": 65.07, "elapsed_time": "3:08:19", "remaining_time": "1:41:06"}
{"current_steps": 1230, "total_steps": 1875, "loss": 0.3108, "lr": 2.6464803391733374e-05, "epoch": 1.968, "percentage": 65.6, "elapsed_time": "3:09:51", "remaining_time": "1:39:33"}
{"current_steps": 1240, "total_steps": 1875, "loss": 0.2517, "lr": 2.5728994910236304e-05, "epoch": 1.984, "percentage": 66.13, "elapsed_time": "3:11:22", "remaining_time": "1:38:00"}
{"current_steps": 1250, "total_steps": 1875, "loss": 0.2494, "lr": 2.500000000000001e-05, "epoch": 2.0, "percentage": 66.67, "elapsed_time": "3:12:54", "remaining_time": "1:36:27"}
{"current_steps": 1260, "total_steps": 1875, "loss": 0.2276, "lr": 2.4278023310924673e-05, "epoch": 2.016, "percentage": 67.2, "elapsed_time": "3:14:26", "remaining_time": "1:34:54"}
{"current_steps": 1270, "total_steps": 1875, "loss": 0.2145, "lr": 2.3563267522693415e-05, "epoch": 2.032, "percentage": 67.73, "elapsed_time": "3:15:57", "remaining_time": "1:33:21"}
{"current_steps": 1280, "total_steps": 1875, "loss": 0.1993, "lr": 2.2855933287874138e-05, "epoch": 2.048, "percentage": 68.27, "elapsed_time": "3:17:29", "remaining_time": "1:31:48"}
{"current_steps": 1290, "total_steps": 1875, "loss": 0.2402, "lr": 2.215621917559062e-05, "epoch": 2.064, "percentage": 68.8, "elapsed_time": "3:19:00", "remaining_time": "1:30:15"}
{"current_steps": 1300, "total_steps": 1875, "loss": 0.198, "lr": 2.1464321615778422e-05, "epoch": 2.08, "percentage": 69.33, "elapsed_time": "3:20:32", "remaining_time": "1:28:42"}
{"current_steps": 1310, "total_steps": 1875, "loss": 0.1847, "lr": 2.07804348440414e-05, "epoch": 2.096, "percentage": 69.87, "elapsed_time": "3:22:04", "remaining_time": "1:27:09"}
{"current_steps": 1320, "total_steps": 1875, "loss": 0.2217, "lr": 2.0104750847124075e-05, "epoch": 2.112, "percentage": 70.4, "elapsed_time": "3:23:36", "remaining_time": "1:25:36"}
{"current_steps": 1330, "total_steps": 1875, "loss": 0.219, "lr": 1.9437459309015427e-05, "epoch": 2.128, "percentage": 70.93, "elapsed_time": "3:25:07", "remaining_time": "1:24:03"}
{"current_steps": 1340, "total_steps": 1875, "loss": 0.2219, "lr": 1.8778747557699224e-05, "epoch": 2.144, "percentage": 71.47, "elapsed_time": "3:26:39", "remaining_time": "1:22:30"}
{"current_steps": 1350, "total_steps": 1875, "loss": 0.2043, "lr": 1.8128800512565513e-05, "epoch": 2.16, "percentage": 72.0, "elapsed_time": "3:28:11", "remaining_time": "1:20:57"}
{"current_steps": 1360, "total_steps": 1875, "loss": 0.1848, "lr": 1.7487800632498545e-05, "epoch": 2.176, "percentage": 72.53, "elapsed_time": "3:29:42", "remaining_time": "1:19:24"}
{"current_steps": 1370, "total_steps": 1875, "loss": 0.2063, "lr": 1.685592786465524e-05, "epoch": 2.192, "percentage": 73.07, "elapsed_time": "3:31:14", "remaining_time": "1:17:51"}
{"current_steps": 1380, "total_steps": 1875, "loss": 0.203, "lr": 1.6233359593948777e-05, "epoch": 2.208, "percentage": 73.6, "elapsed_time": "3:32:46", "remaining_time": "1:16:19"}
{"current_steps": 1390, "total_steps": 1875, "loss": 0.1998, "lr": 1.5620270593251635e-05, "epoch": 2.224, "percentage": 74.13, "elapsed_time": "3:34:17", "remaining_time": "1:14:46"}
{"current_steps": 1400, "total_steps": 1875, "loss": 0.2029, "lr": 1.5016832974331724e-05, "epoch": 2.24, "percentage": 74.67, "elapsed_time": "3:35:49", "remaining_time": "1:13:13"}
{"current_steps": 1410, "total_steps": 1875, "loss": 0.1829, "lr": 1.4423216139535734e-05, "epoch": 2.2560000000000002, "percentage": 75.2, "elapsed_time": "3:37:21", "remaining_time": "1:11:40"}
{"current_steps": 1420, "total_steps": 1875, "loss": 0.2131, "lr": 1.3839586734232906e-05, "epoch": 2.2720000000000002, "percentage": 75.73, "elapsed_time": "3:38:53", "remaining_time": "1:10:08"}
{"current_steps": 1430, "total_steps": 1875, "loss": 0.2099, "lr": 1.3266108600032929e-05, "epoch": 2.288, "percentage": 76.27, "elapsed_time": "3:40:25", "remaining_time": "1:08:35"}
{"current_steps": 1440, "total_steps": 1875, "loss": 0.203, "lr": 1.2702942728790895e-05, "epoch": 2.304, "percentage": 76.8, "elapsed_time": "3:41:56", "remaining_time": "1:07:02"}
{"current_steps": 1450, "total_steps": 1875, "loss": 0.2154, "lr": 1.2150247217412186e-05, "epoch": 2.32, "percentage": 77.33, "elapsed_time": "3:43:28", "remaining_time": "1:05:30"}
{"current_steps": 1460, "total_steps": 1875, "loss": 0.1994, "lr": 1.160817722347014e-05, "epoch": 2.336, "percentage": 77.87, "elapsed_time": "3:44:59", "remaining_time": "1:03:57"}
{"current_steps": 1470, "total_steps": 1875, "loss": 0.2255, "lr": 1.1076884921648834e-05, "epoch": 2.352, "percentage": 78.4, "elapsed_time": "3:46:31", "remaining_time": "1:02:24"}
{"current_steps": 1480, "total_steps": 1875, "loss": 0.2216, "lr": 1.0556519461023301e-05, "epoch": 2.368, "percentage": 78.93, "elapsed_time": "3:48:03", "remaining_time": "1:00:51"}
{"current_steps": 1490, "total_steps": 1875, "loss": 0.1979, "lr": 1.0047226923189024e-05, "epoch": 2.384, "percentage": 79.47, "elapsed_time": "3:49:35", "remaining_time": "0:59:19"}
{"current_steps": 1500, "total_steps": 1875, "loss": 0.2081, "lr": 9.549150281252633e-06, "epoch": 2.4, "percentage": 80.0, "elapsed_time": "3:51:07", "remaining_time": "0:57:46"}