77qq commited on
Commit
c024e94
1 Parent(s): f7423f1

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. replay.mp4 +2 -2
  3. sf_log.txt +391 -0
README.md CHANGED
@@ -15,7 +15,7 @@ model-index:
15
  type: doom_health_gathering_supreme
16
  metrics:
17
  - type: mean_reward
18
- value: 10.15 +/- 5.72
19
  name: mean_reward
20
  verified: false
21
  ---
 
15
  type: doom_health_gathering_supreme
16
  metrics:
17
  - type: mean_reward
18
+ value: 10.10 +/- 4.72
19
  name: mean_reward
20
  verified: false
21
  ---
replay.mp4 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b26246a983a0c59ba00e9df2d5ca078ee8e2c72ddcf6046e352ea1ece92313d1
3
- size 19730427
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0038fb71ee414fc9494abd994e635c76890bd7239f073b56aa224c2aeb245631
3
+ size 19219856
sf_log.txt CHANGED
@@ -1090,3 +1090,394 @@ main_loop: 1028.0605
1090
  [2024-10-16 02:53:41,262][00603] Avg episode rewards: #0: 23.852, true rewards: #0: 10.152
1091
  [2024-10-16 02:53:41,263][00603] Avg episode reward: 23.852, avg true_objective: 10.152
1092
  [2024-10-16 02:54:36,945][00603] Replay video saved to /content/train_dir/default_experiment/replay.mp4!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1090
  [2024-10-16 02:53:41,262][00603] Avg episode rewards: #0: 23.852, true rewards: #0: 10.152
1091
  [2024-10-16 02:53:41,263][00603] Avg episode reward: 23.852, avg true_objective: 10.152
1092
  [2024-10-16 02:54:36,945][00603] Replay video saved to /content/train_dir/default_experiment/replay.mp4!
1093
+ [2024-10-16 02:54:51,855][00603] The model has been pushed to https://huggingface.co/77qq/rl_course_vizdoom_health_gathering_supreme
1094
+ [2024-10-16 02:57:20,711][00603] Loading legacy config file train_dir/doom_health_gathering_supreme_2222/cfg.json instead of train_dir/doom_health_gathering_supreme_2222/config.json
1095
+ [2024-10-16 02:57:20,713][00603] Loading existing experiment configuration from train_dir/doom_health_gathering_supreme_2222/config.json
1096
+ [2024-10-16 02:57:20,715][00603] Overriding arg 'experiment' with value 'doom_health_gathering_supreme_2222' passed from command line
1097
+ [2024-10-16 02:57:20,717][00603] Overriding arg 'train_dir' with value 'train_dir' passed from command line
1098
+ [2024-10-16 02:57:20,719][00603] Overriding arg 'num_workers' with value 1 passed from command line
1099
+ [2024-10-16 02:57:20,721][00603] Adding new argument 'lr_adaptive_min'=1e-06 that is not in the saved config file!
1100
+ [2024-10-16 02:57:20,722][00603] Adding new argument 'lr_adaptive_max'=0.01 that is not in the saved config file!
1101
+ [2024-10-16 02:57:20,724][00603] Adding new argument 'env_gpu_observations'=True that is not in the saved config file!
1102
+ [2024-10-16 02:57:20,725][00603] Adding new argument 'no_render'=True that is not in the saved config file!
1103
+ [2024-10-16 02:57:20,726][00603] Adding new argument 'save_video'=True that is not in the saved config file!
1104
+ [2024-10-16 02:57:20,727][00603] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
1105
+ [2024-10-16 02:57:20,728][00603] Adding new argument 'video_name'=None that is not in the saved config file!
1106
+ [2024-10-16 02:57:20,729][00603] Adding new argument 'max_num_frames'=1000000000.0 that is not in the saved config file!
1107
+ [2024-10-16 02:57:20,730][00603] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
1108
+ [2024-10-16 02:57:20,731][00603] Adding new argument 'push_to_hub'=False that is not in the saved config file!
1109
+ [2024-10-16 02:57:20,732][00603] Adding new argument 'hf_repository'=None that is not in the saved config file!
1110
+ [2024-10-16 02:57:20,733][00603] Adding new argument 'policy_index'=0 that is not in the saved config file!
1111
+ [2024-10-16 02:57:20,734][00603] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
1112
+ [2024-10-16 02:57:20,735][00603] Adding new argument 'train_script'=None that is not in the saved config file!
1113
+ [2024-10-16 02:57:20,736][00603] Adding new argument 'enjoy_script'=None that is not in the saved config file!
1114
+ [2024-10-16 02:57:20,737][00603] Using frameskip 1 and render_action_repeat=4 for evaluation
1115
+ [2024-10-16 02:57:20,770][00603] RunningMeanStd input shape: (3, 72, 128)
1116
+ [2024-10-16 02:57:20,772][00603] RunningMeanStd input shape: (1,)
1117
+ [2024-10-16 02:57:20,783][00603] ConvEncoder: input_channels=3
1118
+ [2024-10-16 02:57:20,828][00603] Conv encoder output size: 512
1119
+ [2024-10-16 02:57:20,829][00603] Policy head output size: 512
1120
+ [2024-10-16 02:57:20,853][00603] Loading state from checkpoint train_dir/doom_health_gathering_supreme_2222/checkpoint_p0/checkpoint_000539850_4422451200.pth...
1121
+ [2024-10-16 02:57:21,290][00603] Num frames 100...
1122
+ [2024-10-16 02:57:21,418][00603] Num frames 200...
1123
+ [2024-10-16 02:57:21,542][00603] Num frames 300...
1124
+ [2024-10-16 02:57:21,661][00603] Num frames 400...
1125
+ [2024-10-16 02:57:21,782][00603] Num frames 500...
1126
+ [2024-10-16 02:57:21,906][00603] Num frames 600...
1127
+ [2024-10-16 02:57:22,054][00603] Num frames 700...
1128
+ [2024-10-16 02:57:22,182][00603] Num frames 800...
1129
+ [2024-10-16 02:57:22,309][00603] Num frames 900...
1130
+ [2024-10-16 02:57:22,433][00603] Num frames 1000...
1131
+ [2024-10-16 02:57:22,561][00603] Num frames 1100...
1132
+ [2024-10-16 02:57:22,680][00603] Num frames 1200...
1133
+ [2024-10-16 02:57:22,805][00603] Num frames 1300...
1134
+ [2024-10-16 02:57:22,935][00603] Num frames 1400...
1135
+ [2024-10-16 02:57:23,058][00603] Num frames 1500...
1136
+ [2024-10-16 02:57:23,180][00603] Num frames 1600...
1137
+ [2024-10-16 02:57:23,303][00603] Num frames 1700...
1138
+ [2024-10-16 02:57:23,428][00603] Num frames 1800...
1139
+ [2024-10-16 02:57:23,560][00603] Num frames 1900...
1140
+ [2024-10-16 02:57:23,683][00603] Num frames 2000...
1141
+ [2024-10-16 02:57:23,826][00603] Num frames 2100...
1142
+ [2024-10-16 02:57:23,879][00603] Avg episode rewards: #0: 62.999, true rewards: #0: 21.000
1143
+ [2024-10-16 02:57:23,881][00603] Avg episode reward: 62.999, avg true_objective: 21.000
1144
+ [2024-10-16 02:57:24,058][00603] Num frames 2200...
1145
+ [2024-10-16 02:57:24,227][00603] Num frames 2300...
1146
+ [2024-10-16 02:57:24,393][00603] Num frames 2400...
1147
+ [2024-10-16 02:57:24,561][00603] Num frames 2500...
1148
+ [2024-10-16 02:57:24,723][00603] Num frames 2600...
1149
+ [2024-10-16 02:57:24,888][00603] Num frames 2700...
1150
+ [2024-10-16 02:57:25,058][00603] Num frames 2800...
1151
+ [2024-10-16 02:57:25,225][00603] Num frames 2900...
1152
+ [2024-10-16 02:57:25,403][00603] Num frames 3000...
1153
+ [2024-10-16 02:57:25,585][00603] Num frames 3100...
1154
+ [2024-10-16 02:57:25,758][00603] Num frames 3200...
1155
+ [2024-10-16 02:57:25,949][00603] Num frames 3300...
1156
+ [2024-10-16 02:57:26,124][00603] Num frames 3400...
1157
+ [2024-10-16 02:57:26,248][00603] Num frames 3500...
1158
+ [2024-10-16 02:57:26,376][00603] Num frames 3600...
1159
+ [2024-10-16 02:57:26,499][00603] Num frames 3700...
1160
+ [2024-10-16 02:57:26,629][00603] Num frames 3800...
1161
+ [2024-10-16 02:57:26,751][00603] Num frames 3900...
1162
+ [2024-10-16 02:57:26,874][00603] Num frames 4000...
1163
+ [2024-10-16 02:57:27,007][00603] Num frames 4100...
1164
+ [2024-10-16 02:57:27,133][00603] Num frames 4200...
1165
+ [2024-10-16 02:57:27,185][00603] Avg episode rewards: #0: 63.999, true rewards: #0: 21.000
1166
+ [2024-10-16 02:57:27,186][00603] Avg episode reward: 63.999, avg true_objective: 21.000
1167
+ [2024-10-16 02:57:27,310][00603] Num frames 4300...
1168
+ [2024-10-16 02:57:27,431][00603] Num frames 4400...
1169
+ [2024-10-16 02:57:27,551][00603] Num frames 4500...
1170
+ [2024-10-16 02:57:27,684][00603] Num frames 4600...
1171
+ [2024-10-16 02:57:27,807][00603] Num frames 4700...
1172
+ [2024-10-16 02:57:27,933][00603] Num frames 4800...
1173
+ [2024-10-16 02:57:28,056][00603] Num frames 4900...
1174
+ [2024-10-16 02:57:28,180][00603] Num frames 5000...
1175
+ [2024-10-16 02:57:28,309][00603] Num frames 5100...
1176
+ [2024-10-16 02:57:28,442][00603] Num frames 5200...
1177
+ [2024-10-16 02:57:28,580][00603] Num frames 5300...
1178
+ [2024-10-16 02:57:28,700][00603] Avg episode rewards: #0: 54.839, true rewards: #0: 17.840
1179
+ [2024-10-16 02:57:28,702][00603] Avg episode reward: 54.839, avg true_objective: 17.840
1180
+ [2024-10-16 02:57:28,764][00603] Num frames 5400...
1181
+ [2024-10-16 02:57:28,888][00603] Num frames 5500...
1182
+ [2024-10-16 02:57:29,022][00603] Num frames 5600...
1183
+ [2024-10-16 02:57:29,146][00603] Num frames 5700...
1184
+ [2024-10-16 02:57:29,268][00603] Num frames 5800...
1185
+ [2024-10-16 02:57:29,391][00603] Num frames 5900...
1186
+ [2024-10-16 02:57:29,512][00603] Num frames 6000...
1187
+ [2024-10-16 02:57:29,635][00603] Num frames 6100...
1188
+ [2024-10-16 02:57:29,766][00603] Num frames 6200...
1189
+ [2024-10-16 02:57:29,889][00603] Num frames 6300...
1190
+ [2024-10-16 02:57:30,015][00603] Num frames 6400...
1191
+ [2024-10-16 02:57:30,135][00603] Num frames 6500...
1192
+ [2024-10-16 02:57:30,255][00603] Num frames 6600...
1193
+ [2024-10-16 02:57:30,383][00603] Num frames 6700...
1194
+ [2024-10-16 02:57:30,505][00603] Num frames 6800...
1195
+ [2024-10-16 02:57:30,627][00603] Num frames 6900...
1196
+ [2024-10-16 02:57:30,760][00603] Num frames 7000...
1197
+ [2024-10-16 02:57:30,881][00603] Num frames 7100...
1198
+ [2024-10-16 02:57:31,015][00603] Num frames 7200...
1199
+ [2024-10-16 02:57:31,138][00603] Num frames 7300...
1200
+ [2024-10-16 02:57:31,265][00603] Num frames 7400...
1201
+ [2024-10-16 02:57:31,385][00603] Avg episode rewards: #0: 57.379, true rewards: #0: 18.630
1202
+ [2024-10-16 02:57:31,387][00603] Avg episode reward: 57.379, avg true_objective: 18.630
1203
+ [2024-10-16 02:57:31,449][00603] Num frames 7500...
1204
+ [2024-10-16 02:57:31,568][00603] Num frames 7600...
1205
+ [2024-10-16 02:57:31,689][00603] Num frames 7700...
1206
+ [2024-10-16 02:57:31,816][00603] Num frames 7800...
1207
+ [2024-10-16 02:57:31,943][00603] Num frames 7900...
1208
+ [2024-10-16 02:57:32,065][00603] Num frames 8000...
1209
+ [2024-10-16 02:57:32,183][00603] Num frames 8100...
1210
+ [2024-10-16 02:57:32,312][00603] Num frames 8200...
1211
+ [2024-10-16 02:57:32,436][00603] Num frames 8300...
1212
+ [2024-10-16 02:57:32,561][00603] Num frames 8400...
1213
+ [2024-10-16 02:57:32,683][00603] Num frames 8500...
1214
+ [2024-10-16 02:57:32,814][00603] Num frames 8600...
1215
+ [2024-10-16 02:57:32,947][00603] Num frames 8700...
1216
+ [2024-10-16 02:57:33,071][00603] Num frames 8800...
1217
+ [2024-10-16 02:57:33,195][00603] Num frames 8900...
1218
+ [2024-10-16 02:57:33,320][00603] Num frames 9000...
1219
+ [2024-10-16 02:57:33,448][00603] Num frames 9100...
1220
+ [2024-10-16 02:57:33,576][00603] Num frames 9200...
1221
+ [2024-10-16 02:57:33,697][00603] Num frames 9300...
1222
+ [2024-10-16 02:57:33,831][00603] Num frames 9400...
1223
+ [2024-10-16 02:57:33,965][00603] Num frames 9500...
1224
+ [2024-10-16 02:57:34,084][00603] Avg episode rewards: #0: 58.303, true rewards: #0: 19.104
1225
+ [2024-10-16 02:57:34,086][00603] Avg episode reward: 58.303, avg true_objective: 19.104
1226
+ [2024-10-16 02:57:34,146][00603] Num frames 9600...
1227
+ [2024-10-16 02:57:34,274][00603] Num frames 9700...
1228
+ [2024-10-16 02:57:34,405][00603] Num frames 9800...
1229
+ [2024-10-16 02:57:34,531][00603] Num frames 9900...
1230
+ [2024-10-16 02:57:34,654][00603] Num frames 10000...
1231
+ [2024-10-16 02:57:34,778][00603] Num frames 10100...
1232
+ [2024-10-16 02:57:34,910][00603] Num frames 10200...
1233
+ [2024-10-16 02:57:35,049][00603] Num frames 10300...
1234
+ [2024-10-16 02:57:35,173][00603] Num frames 10400...
1235
+ [2024-10-16 02:57:35,299][00603] Num frames 10500...
1236
+ [2024-10-16 02:57:35,421][00603] Num frames 10600...
1237
+ [2024-10-16 02:57:35,546][00603] Num frames 10700...
1238
+ [2024-10-16 02:57:35,677][00603] Num frames 10800...
1239
+ [2024-10-16 02:57:35,803][00603] Num frames 10900...
1240
+ [2024-10-16 02:57:35,939][00603] Num frames 11000...
1241
+ [2024-10-16 02:57:36,063][00603] Num frames 11100...
1242
+ [2024-10-16 02:57:36,223][00603] Num frames 11200...
1243
+ [2024-10-16 02:57:36,397][00603] Num frames 11300...
1244
+ [2024-10-16 02:57:36,563][00603] Num frames 11400...
1245
+ [2024-10-16 02:57:36,735][00603] Num frames 11500...
1246
+ [2024-10-16 02:57:36,907][00603] Avg episode rewards: #0: 58.110, true rewards: #0: 19.278
1247
+ [2024-10-16 02:57:36,909][00603] Avg episode reward: 58.110, avg true_objective: 19.278
1248
+ [2024-10-16 02:57:36,971][00603] Num frames 11600...
1249
+ [2024-10-16 02:57:37,138][00603] Num frames 11700...
1250
+ [2024-10-16 02:57:37,305][00603] Num frames 11800...
1251
+ [2024-10-16 02:57:37,477][00603] Num frames 11900...
1252
+ [2024-10-16 02:57:37,651][00603] Num frames 12000...
1253
+ [2024-10-16 02:57:37,829][00603] Num frames 12100...
1254
+ [2024-10-16 02:57:38,019][00603] Num frames 12200...
1255
+ [2024-10-16 02:57:38,195][00603] Num frames 12300...
1256
+ [2024-10-16 02:57:38,374][00603] Num frames 12400...
1257
+ [2024-10-16 02:57:38,520][00603] Num frames 12500...
1258
+ [2024-10-16 02:57:38,642][00603] Num frames 12600...
1259
+ [2024-10-16 02:57:38,767][00603] Num frames 12700...
1260
+ [2024-10-16 02:57:38,890][00603] Num frames 12800...
1261
+ [2024-10-16 02:57:39,031][00603] Num frames 12900...
1262
+ [2024-10-16 02:57:39,154][00603] Num frames 13000...
1263
+ [2024-10-16 02:57:39,278][00603] Num frames 13100...
1264
+ [2024-10-16 02:57:39,403][00603] Num frames 13200...
1265
+ [2024-10-16 02:57:39,526][00603] Num frames 13300...
1266
+ [2024-10-16 02:57:39,651][00603] Num frames 13400...
1267
+ [2024-10-16 02:57:39,780][00603] Num frames 13500...
1268
+ [2024-10-16 02:57:39,906][00603] Num frames 13600...
1269
+ [2024-10-16 02:57:40,063][00603] Avg episode rewards: #0: 58.380, true rewards: #0: 19.524
1270
+ [2024-10-16 02:57:40,065][00603] Avg episode reward: 58.380, avg true_objective: 19.524
1271
+ [2024-10-16 02:57:40,107][00603] Num frames 13700...
1272
+ [2024-10-16 02:57:40,232][00603] Num frames 13800...
1273
+ [2024-10-16 02:57:40,355][00603] Num frames 13900...
1274
+ [2024-10-16 02:57:40,482][00603] Num frames 14000...
1275
+ [2024-10-16 02:57:40,606][00603] Num frames 14100...
1276
+ [2024-10-16 02:57:40,728][00603] Num frames 14200...
1277
+ [2024-10-16 02:57:40,852][00603] Num frames 14300...
1278
+ [2024-10-16 02:57:40,986][00603] Num frames 14400...
1279
+ [2024-10-16 02:57:41,113][00603] Num frames 14500...
1280
+ [2024-10-16 02:57:41,240][00603] Num frames 14600...
1281
+ [2024-10-16 02:57:41,367][00603] Num frames 14700...
1282
+ [2024-10-16 02:57:41,493][00603] Num frames 14800...
1283
+ [2024-10-16 02:57:41,621][00603] Num frames 14900...
1284
+ [2024-10-16 02:57:41,747][00603] Num frames 15000...
1285
+ [2024-10-16 02:57:41,876][00603] Num frames 15100...
1286
+ [2024-10-16 02:57:42,021][00603] Num frames 15200...
1287
+ [2024-10-16 02:57:42,150][00603] Num frames 15300...
1288
+ [2024-10-16 02:57:42,279][00603] Num frames 15400...
1289
+ [2024-10-16 02:57:42,405][00603] Num frames 15500...
1290
+ [2024-10-16 02:57:42,530][00603] Num frames 15600...
1291
+ [2024-10-16 02:57:42,655][00603] Num frames 15700...
1292
+ [2024-10-16 02:57:42,796][00603] Avg episode rewards: #0: 58.957, true rewards: #0: 19.709
1293
+ [2024-10-16 02:57:42,798][00603] Avg episode reward: 58.957, avg true_objective: 19.709
1294
+ [2024-10-16 02:57:42,844][00603] Num frames 15800...
1295
+ [2024-10-16 02:57:42,976][00603] Num frames 15900...
1296
+ [2024-10-16 02:57:43,106][00603] Num frames 16000...
1297
+ [2024-10-16 02:57:43,235][00603] Num frames 16100...
1298
+ [2024-10-16 02:57:43,360][00603] Num frames 16200...
1299
+ [2024-10-16 02:57:43,483][00603] Num frames 16300...
1300
+ [2024-10-16 02:57:43,611][00603] Num frames 16400...
1301
+ [2024-10-16 02:57:43,737][00603] Num frames 16500...
1302
+ [2024-10-16 02:57:43,864][00603] Num frames 16600...
1303
+ [2024-10-16 02:57:43,995][00603] Num frames 16700...
1304
+ [2024-10-16 02:57:44,128][00603] Num frames 16800...
1305
+ [2024-10-16 02:57:44,254][00603] Num frames 16900...
1306
+ [2024-10-16 02:57:44,384][00603] Num frames 17000...
1307
+ [2024-10-16 02:57:44,510][00603] Num frames 17100...
1308
+ [2024-10-16 02:57:44,635][00603] Num frames 17200...
1309
+ [2024-10-16 02:57:44,763][00603] Num frames 17300...
1310
+ [2024-10-16 02:57:44,889][00603] Num frames 17400...
1311
+ [2024-10-16 02:57:45,033][00603] Num frames 17500...
1312
+ [2024-10-16 02:57:45,164][00603] Num frames 17600...
1313
+ [2024-10-16 02:57:45,292][00603] Num frames 17700...
1314
+ [2024-10-16 02:57:45,419][00603] Num frames 17800...
1315
+ [2024-10-16 02:57:45,558][00603] Avg episode rewards: #0: 59.851, true rewards: #0: 19.852
1316
+ [2024-10-16 02:57:45,560][00603] Avg episode reward: 59.851, avg true_objective: 19.852
1317
+ [2024-10-16 02:57:45,603][00603] Num frames 17900...
1318
+ [2024-10-16 02:57:45,728][00603] Num frames 18000...
1319
+ [2024-10-16 02:57:45,853][00603] Num frames 18100...
1320
+ [2024-10-16 02:57:45,986][00603] Num frames 18200...
1321
+ [2024-10-16 02:57:46,115][00603] Num frames 18300...
1322
+ [2024-10-16 02:57:46,248][00603] Num frames 18400...
1323
+ [2024-10-16 02:57:46,373][00603] Num frames 18500...
1324
+ [2024-10-16 02:57:46,496][00603] Num frames 18600...
1325
+ [2024-10-16 02:57:46,619][00603] Num frames 18700...
1326
+ [2024-10-16 02:57:46,743][00603] Num frames 18800...
1327
+ [2024-10-16 02:57:46,867][00603] Num frames 18900...
1328
+ [2024-10-16 02:57:47,001][00603] Num frames 19000...
1329
+ [2024-10-16 02:57:47,128][00603] Num frames 19100...
1330
+ [2024-10-16 02:57:47,261][00603] Num frames 19200...
1331
+ [2024-10-16 02:57:47,388][00603] Num frames 19300...
1332
+ [2024-10-16 02:57:47,513][00603] Num frames 19400...
1333
+ [2024-10-16 02:57:47,641][00603] Num frames 19500...
1334
+ [2024-10-16 02:57:47,769][00603] Num frames 19600...
1335
+ [2024-10-16 02:57:47,895][00603] Num frames 19700...
1336
+ [2024-10-16 02:57:48,030][00603] Num frames 19800...
1337
+ [2024-10-16 02:57:48,158][00603] Num frames 19900...
1338
+ [2024-10-16 02:57:48,307][00603] Avg episode rewards: #0: 60.566, true rewards: #0: 19.967
1339
+ [2024-10-16 02:57:48,310][00603] Avg episode reward: 60.566, avg true_objective: 19.967
1340
+ [2024-10-16 02:59:39,055][00603] Replay video saved to train_dir/doom_health_gathering_supreme_2222/replay.mp4!
1341
+ [2024-10-16 03:00:01,234][00603] Loading existing experiment configuration from /content/train_dir/default_experiment/config.json
1342
+ [2024-10-16 03:00:01,237][00603] Overriding arg 'num_workers' with value 1 passed from command line
1343
+ [2024-10-16 03:00:01,239][00603] Adding new argument 'no_render'=True that is not in the saved config file!
1344
+ [2024-10-16 03:00:01,240][00603] Adding new argument 'save_video'=True that is not in the saved config file!
1345
+ [2024-10-16 03:00:01,242][00603] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
1346
+ [2024-10-16 03:00:01,244][00603] Adding new argument 'video_name'=None that is not in the saved config file!
1347
+ [2024-10-16 03:00:01,246][00603] Adding new argument 'max_num_frames'=100000 that is not in the saved config file!
1348
+ [2024-10-16 03:00:01,248][00603] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
1349
+ [2024-10-16 03:00:01,248][00603] Adding new argument 'push_to_hub'=True that is not in the saved config file!
1350
+ [2024-10-16 03:00:01,249][00603] Adding new argument 'hf_repository'='77qq/rl_course_vizdoom_health_gathering_supreme' that is not in the saved config file!
1351
+ [2024-10-16 03:00:01,250][00603] Adding new argument 'policy_index'=0 that is not in the saved config file!
1352
+ [2024-10-16 03:00:01,251][00603] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
1353
+ [2024-10-16 03:00:01,252][00603] Adding new argument 'train_script'=None that is not in the saved config file!
1354
+ [2024-10-16 03:00:01,253][00603] Adding new argument 'enjoy_script'=None that is not in the saved config file!
1355
+ [2024-10-16 03:00:01,254][00603] Using frameskip 1 and render_action_repeat=4 for evaluation
1356
+ [2024-10-16 03:00:01,282][00603] RunningMeanStd input shape: (3, 72, 128)
1357
+ [2024-10-16 03:00:01,285][00603] RunningMeanStd input shape: (1,)
1358
+ [2024-10-16 03:00:01,297][00603] ConvEncoder: input_channels=3
1359
+ [2024-10-16 03:00:01,335][00603] Conv encoder output size: 512
1360
+ [2024-10-16 03:00:01,337][00603] Policy head output size: 512
1361
+ [2024-10-16 03:00:01,356][00603] Loading state from checkpoint /content/train_dir/default_experiment/checkpoint_p0/checkpoint_000000978_4005888.pth...
1362
+ [2024-10-16 03:00:01,779][00603] Num frames 100...
1363
+ [2024-10-16 03:00:01,899][00603] Num frames 200...
1364
+ [2024-10-16 03:00:02,041][00603] Num frames 300...
1365
+ [2024-10-16 03:00:02,129][00603] Avg episode rewards: #0: 5.260, true rewards: #0: 3.260
1366
+ [2024-10-16 03:00:02,130][00603] Avg episode reward: 5.260, avg true_objective: 3.260
1367
+ [2024-10-16 03:00:02,224][00603] Num frames 400...
1368
+ [2024-10-16 03:00:02,344][00603] Num frames 500...
1369
+ [2024-10-16 03:00:02,470][00603] Num frames 600...
1370
+ [2024-10-16 03:00:02,588][00603] Num frames 700...
1371
+ [2024-10-16 03:00:02,710][00603] Num frames 800...
1372
+ [2024-10-16 03:00:02,834][00603] Num frames 900...
1373
+ [2024-10-16 03:00:02,960][00603] Num frames 1000...
1374
+ [2024-10-16 03:00:03,078][00603] Num frames 1100...
1375
+ [2024-10-16 03:00:03,209][00603] Avg episode rewards: #0: 12.820, true rewards: #0: 5.820
1376
+ [2024-10-16 03:00:03,211][00603] Avg episode reward: 12.820, avg true_objective: 5.820
1377
+ [2024-10-16 03:00:03,257][00603] Num frames 1200...
1378
+ [2024-10-16 03:00:03,376][00603] Num frames 1300...
1379
+ [2024-10-16 03:00:03,503][00603] Num frames 1400...
1380
+ [2024-10-16 03:00:03,633][00603] Num frames 1500...
1381
+ [2024-10-16 03:00:03,756][00603] Num frames 1600...
1382
+ [2024-10-16 03:00:03,881][00603] Num frames 1700...
1383
+ [2024-10-16 03:00:04,009][00603] Num frames 1800...
1384
+ [2024-10-16 03:00:04,131][00603] Num frames 1900...
1385
+ [2024-10-16 03:00:04,253][00603] Num frames 2000...
1386
+ [2024-10-16 03:00:04,376][00603] Num frames 2100...
1387
+ [2024-10-16 03:00:04,513][00603] Num frames 2200...
1388
+ [2024-10-16 03:00:04,635][00603] Num frames 2300...
1389
+ [2024-10-16 03:00:04,759][00603] Num frames 2400...
1390
+ [2024-10-16 03:00:04,887][00603] Num frames 2500...
1391
+ [2024-10-16 03:00:05,016][00603] Num frames 2600...
1392
+ [2024-10-16 03:00:05,136][00603] Num frames 2700...
1393
+ [2024-10-16 03:00:05,263][00603] Num frames 2800...
1394
+ [2024-10-16 03:00:05,392][00603] Avg episode rewards: #0: 22.200, true rewards: #0: 9.533
1395
+ [2024-10-16 03:00:05,394][00603] Avg episode reward: 22.200, avg true_objective: 9.533
1396
+ [2024-10-16 03:00:05,447][00603] Num frames 2900...
1397
+ [2024-10-16 03:00:05,575][00603] Num frames 3000...
1398
+ [2024-10-16 03:00:05,699][00603] Num frames 3100...
1399
+ [2024-10-16 03:00:05,821][00603] Num frames 3200...
1400
+ [2024-10-16 03:00:05,950][00603] Num frames 3300...
1401
+ [2024-10-16 03:00:06,070][00603] Num frames 3400...
1402
+ [2024-10-16 03:00:06,189][00603] Num frames 3500...
1403
+ [2024-10-16 03:00:06,310][00603] Num frames 3600...
1404
+ [2024-10-16 03:00:06,436][00603] Num frames 3700...
1405
+ [2024-10-16 03:00:06,609][00603] Num frames 3800...
1406
+ [2024-10-16 03:00:06,783][00603] Num frames 3900...
1407
+ [2024-10-16 03:00:06,951][00603] Num frames 4000...
1408
+ [2024-10-16 03:00:07,111][00603] Num frames 4100...
1409
+ [2024-10-16 03:00:07,279][00603] Num frames 4200...
1410
+ [2024-10-16 03:00:07,441][00603] Num frames 4300...
1411
+ [2024-10-16 03:00:07,610][00603] Num frames 4400...
1412
+ [2024-10-16 03:00:07,772][00603] Avg episode rewards: #0: 25.400, true rewards: #0: 11.150
1413
+ [2024-10-16 03:00:07,774][00603] Avg episode reward: 25.400, avg true_objective: 11.150
1414
+ [2024-10-16 03:00:07,841][00603] Num frames 4500...
1415
+ [2024-10-16 03:00:08,017][00603] Num frames 4600...
1416
+ [2024-10-16 03:00:08,188][00603] Num frames 4700...
1417
+ [2024-10-16 03:00:08,363][00603] Num frames 4800...
1418
+ [2024-10-16 03:00:08,539][00603] Num frames 4900...
1419
+ [2024-10-16 03:00:08,718][00603] Num frames 5000...
1420
+ [2024-10-16 03:00:08,886][00603] Num frames 5100...
1421
+ [2024-10-16 03:00:09,021][00603] Avg episode rewards: #0: 23.528, true rewards: #0: 10.328
1422
+ [2024-10-16 03:00:09,023][00603] Avg episode reward: 23.528, avg true_objective: 10.328
1423
+ [2024-10-16 03:00:09,070][00603] Num frames 5200...
1424
+ [2024-10-16 03:00:09,188][00603] Num frames 5300...
1425
+ [2024-10-16 03:00:09,309][00603] Num frames 5400...
1426
+ [2024-10-16 03:00:09,427][00603] Num frames 5500...
1427
+ [2024-10-16 03:00:09,557][00603] Num frames 5600...
1428
+ [2024-10-16 03:00:09,680][00603] Num frames 5700...
1429
+ [2024-10-16 03:00:09,812][00603] Num frames 5800...
1430
+ [2024-10-16 03:00:09,939][00603] Num frames 5900...
1431
+ [2024-10-16 03:00:10,063][00603] Num frames 6000...
1432
+ [2024-10-16 03:00:10,184][00603] Num frames 6100...
1433
+ [2024-10-16 03:00:10,303][00603] Num frames 6200...
1434
+ [2024-10-16 03:00:10,424][00603] Num frames 6300...
1435
+ [2024-10-16 03:00:10,547][00603] Num frames 6400...
1436
+ [2024-10-16 03:00:10,671][00603] Avg episode rewards: #0: 24.590, true rewards: #0: 10.757
1437
+ [2024-10-16 03:00:10,674][00603] Avg episode reward: 24.590, avg true_objective: 10.757
1438
+ [2024-10-16 03:00:10,741][00603] Num frames 6500...
1439
+ [2024-10-16 03:00:10,861][00603] Num frames 6600...
1440
+ [2024-10-16 03:00:10,991][00603] Num frames 6700...
1441
+ [2024-10-16 03:00:11,111][00603] Num frames 6800...
1442
+ [2024-10-16 03:00:11,231][00603] Num frames 6900...
1443
+ [2024-10-16 03:00:11,352][00603] Num frames 7000...
1444
+ [2024-10-16 03:00:11,475][00603] Num frames 7100...
1445
+ [2024-10-16 03:00:11,600][00603] Num frames 7200...
1446
+ [2024-10-16 03:00:11,683][00603] Avg episode rewards: #0: 22.746, true rewards: #0: 10.317
1447
+ [2024-10-16 03:00:11,684][00603] Avg episode reward: 22.746, avg true_objective: 10.317
1448
+ [2024-10-16 03:00:11,791][00603] Num frames 7300...
1449
+ [2024-10-16 03:00:11,912][00603] Num frames 7400...
1450
+ [2024-10-16 03:00:12,038][00603] Num frames 7500...
1451
+ [2024-10-16 03:00:12,157][00603] Num frames 7600...
1452
+ [2024-10-16 03:00:12,259][00603] Avg episode rewards: #0: 20.673, true rewards: #0: 9.547
1453
+ [2024-10-16 03:00:12,261][00603] Avg episode reward: 20.673, avg true_objective: 9.547
1454
+ [2024-10-16 03:00:12,339][00603] Num frames 7700...
1455
+ [2024-10-16 03:00:12,458][00603] Num frames 7800...
1456
+ [2024-10-16 03:00:12,577][00603] Num frames 7900...
1457
+ [2024-10-16 03:00:12,700][00603] Num frames 8000...
1458
+ [2024-10-16 03:00:12,831][00603] Num frames 8100...
1459
+ [2024-10-16 03:00:12,959][00603] Num frames 8200...
1460
+ [2024-10-16 03:00:13,080][00603] Num frames 8300...
1461
+ [2024-10-16 03:00:13,205][00603] Num frames 8400...
1462
+ [2024-10-16 03:00:13,328][00603] Num frames 8500...
1463
+ [2024-10-16 03:00:13,447][00603] Num frames 8600...
1464
+ [2024-10-16 03:00:13,564][00603] Num frames 8700...
1465
+ [2024-10-16 03:00:13,684][00603] Num frames 8800...
1466
+ [2024-10-16 03:00:13,815][00603] Num frames 8900...
1467
+ [2024-10-16 03:00:13,943][00603] Num frames 9000...
1468
+ [2024-10-16 03:00:14,062][00603] Num frames 9100...
1469
+ [2024-10-16 03:00:14,183][00603] Num frames 9200...
1470
+ [2024-10-16 03:00:14,247][00603] Avg episode rewards: #0: 23.118, true rewards: #0: 10.229
1471
+ [2024-10-16 03:00:14,248][00603] Avg episode reward: 23.118, avg true_objective: 10.229
1472
+ [2024-10-16 03:00:14,362][00603] Num frames 9300...
1473
+ [2024-10-16 03:00:14,482][00603] Num frames 9400...
1474
+ [2024-10-16 03:00:14,602][00603] Num frames 9500...
1475
+ [2024-10-16 03:00:14,727][00603] Num frames 9600...
1476
+ [2024-10-16 03:00:14,854][00603] Num frames 9700...
1477
+ [2024-10-16 03:00:14,981][00603] Num frames 9800...
1478
+ [2024-10-16 03:00:15,100][00603] Num frames 9900...
1479
+ [2024-10-16 03:00:15,221][00603] Num frames 10000...
1480
+ [2024-10-16 03:00:15,341][00603] Num frames 10100...
1481
+ [2024-10-16 03:00:15,402][00603] Avg episode rewards: #0: 22.502, true rewards: #0: 10.102
1482
+ [2024-10-16 03:00:15,404][00603] Avg episode reward: 22.502, avg true_objective: 10.102
1483
+ [2024-10-16 03:01:10,044][00603] Replay video saved to /content/train_dir/default_experiment/replay.mp4!