-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathopen_entity_train.log
3523 lines (3306 loc) · 319 KB
/
open_entity_train.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
Neither PyTorch nor TensorFlow >= 2.0 have been found.Models won't be available and only tokenizers, configurationand file/data utilities can be used.
W0114 14:48:26.656342 6263 device_context.cc:447] Please NOTE: device: 0, GPU Compute Capability: 7.0, Driver API Version: 10.1, Runtime API Version: 10.1
W0114 14:48:26.660746 6263 device_context.cc:465] device: 0, cuDNN Version: 7.6.
0%| | 0/1998 [00:00<?, ?it/s] 8%|▊ | 153/1998 [00:00<00:01, 1526.34it/s] 16%|█▋ | 328/1998 [00:00<00:01, 1584.63it/s] 26%|██▋ | 525/1998 [00:00<00:00, 1680.88it/s] 37%|███▋ | 739/1998 [00:00<00:00, 1794.03it/s] 48%|████▊ | 953/1998 [00:00<00:00, 1884.84it/s] 57%|█████▋ | 1144/1998 [00:00<00:00, 1888.51it/s] 67%|██████▋ | 1331/1998 [00:00<00:00, 1882.54it/s] 79%|███████▊ | 1573/1998 [00:00<00:00, 2014.05it/s] 91%|█████████ | 1809/1998 [00:00<00:00, 2106.54it/s]100%|██████████| 1998/1998 [00:00<00:00, 2038.22it/s]
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/layers.py:1436: UserWarning: Skip loading for typing.weight. typing.weight is not found in the provided dict.
warnings.warn(("Skip loading for {}. ".format(key) + str(err)))
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/layers.py:1436: UserWarning: Skip loading for typing.bias. typing.bias is not found in the provided dict.
warnings.warn(("Skip loading for {}. ".format(key) + str(err)))
0%| | 0/1497 [00:00<?, ?it/s]epoch: 0 loss: 0.3724277 f1: 0.0000000: 0%| | 0/1497 [00:00<?, ?it/s]epoch: 0 loss: 0.3724277 f1: 0.0000000: 0%| | 1/1497 [00:00<22:39, 1.10it/s]epoch: 0 loss: 0.3899120 f1: 0.0000000: 0%| | 1/1497 [00:01<22:39, 1.10it/s]epoch: 0 loss: 0.3899120 f1: 0.0000000: 0%| | 2/1497 [00:01<19:26, 1.28it/s]epoch: 0 loss: 0.3362266 f1: 0.0000000: 0%| | 2/1497 [00:01<19:26, 1.28it/s]epoch: 0 loss: 0.3362266 f1: 0.0000000: 0%| | 3/1497 [00:01<17:15, 1.44it/s]epoch: 0 loss: 0.3738964 f1: 0.0000000: 0%| | 3/1497 [00:02<17:15, 1.44it/s]epoch: 0 loss: 0.3738964 f1: 0.0000000: 0%| | 4/1497 [00:02<15:43, 1.58it/s]epoch: 0 loss: 0.3631856 f1: 0.0000000: 0%| | 4/1497 [00:02<15:43, 1.58it/s]epoch: 0 loss: 0.3631856 f1: 0.0000000: 0%| | 5/1497 [00:02<14:38, 1.70it/s]epoch: 0 loss: 0.3372850 f1: 0.0000000: 0%| | 5/1497 [00:03<14:38, 1.70it/s]epoch: 0 loss: 0.3372850 f1: 0.0000000: 0%| | 6/1497 [00:03<13:55, 1.79it/s]epoch: 0 loss: 0.3826302 f1: 0.0000000: 0%| | 6/1497 [00:03<13:55, 1.79it/s]epoch: 0 loss: 0.3826302 f1: 0.0000000: 0%| | 7/1497 [00:03<13:21, 1.86it/s]epoch: 0 loss: 0.3998697 f1: 0.0000000: 0%| | 7/1497 [00:04<13:21, 1.86it/s]epoch: 0 loss: 0.3998697 f1: 0.0000000: 1%| | 8/1497 [00:04<12:56, 1.92it/s]epoch: 0 loss: 0.3724897 f1: 0.0000000: 1%| | 8/1497 [00:04<12:56, 1.92it/s]epoch: 0 loss: 0.3724897 f1: 0.0000000: 1%| | 9/1497 [00:04<12:47, 1.94it/s]epoch: 0 loss: 0.4026419 f1: 0.0000000: 1%| | 9/1497 [00:05<12:47, 1.94it/s]epoch: 0 loss: 0.4026419 f1: 0.0000000: 1%| | 10/1497 [00:05<12:36, 1.97it/s]epoch: 0 loss: 0.3945037 f1: 0.0000000: 1%| | 10/1497 [00:05<12:36, 1.97it/s]epoch: 0 loss: 0.3945037 f1: 0.0000000: 1%| | 11/1497 [00:05<12:35, 1.97it/s]epoch: 0 loss: 0.3374860 f1: 0.0000000: 1%| | 11/1497 [00:06<12:35, 1.97it/s]epoch: 0 loss: 0.3374860 f1: 0.0000000: 1%| | 12/1497 [00:06<12:25, 1.99it/s]epoch: 0 loss: 0.3745956 f1: 0.0000000: 1%| | 12/1497 [00:06<12:25, 1.99it/s]epoch: 0 loss: 0.3745956 f1: 0.0000000: 1%| | 13/1497 [00:06<12:16, 2.01it/s]epoch: 0 loss: 0.3399995 f1: 0.0000000: 1%| | 13/1497 [00:07<12:16, 2.01it/s]epoch: 0 loss: 0.3399995 f1: 0.0000000: 1%| | 14/1497 [00:07<12:20, 2.00it/s]epoch: 0 loss: 0.3396432 f1: 0.0000000: 1%| | 14/1497 [00:07<12:20, 2.00it/s]epoch: 0 loss: 0.3396432 f1: 0.0000000: 1%| | 15/1497 [00:07<12:22, 1.99it/s]epoch: 0 loss: 0.3975611 f1: 0.0000000: 1%| | 15/1497 [00:08<12:22, 1.99it/s]epoch: 0 loss: 0.3975611 f1: 0.0000000: 1%| | 16/1497 [00:08<12:20, 2.00it/s]epoch: 0 loss: 0.3673398 f1: 0.0000000: 1%| | 16/1497 [00:08<12:20, 2.00it/s]epoch: 0 loss: 0.3673398 f1: 0.0000000: 1%| | 17/1497 [00:08<12:20, 2.00it/s]epoch: 0 loss: 0.3337799 f1: 0.0000000: 1%| | 17/1497 [00:09<12:20, 2.00it/s]epoch: 0 loss: 0.3337799 f1: 0.0000000: 1%| | 18/1497 [00:09<12:18, 2.00it/s]epoch: 0 loss: 0.3107176 f1: 0.0000000: 1%| | 18/1497 [00:09<12:18, 2.00it/s]epoch: 0 loss: 0.3107176 f1: 0.0000000: 1%|▏ | 19/1497 [00:09<12:20, 2.00it/s]epoch: 0 loss: 0.3714368 f1: 0.0000000: 1%|▏ | 19/1497 [00:10<12:20, 2.00it/s]epoch: 0 loss: 0.3714368 f1: 0.0000000: 1%|▏ | 20/1497 [00:10<12:18, 2.00it/s]epoch: 0 loss: 0.3510816 f1: 0.0000000: 1%|▏ | 20/1497 [00:10<12:18, 2.00it/s]epoch: 0 loss: 0.3510816 f1: 0.0000000: 1%|▏ | 21/1497 [00:10<12:16, 2.00it/s]epoch: 0 loss: 0.3633123 f1: 0.0000000: 1%|▏ | 21/1497 [00:11<12:16, 2.00it/s]epoch: 0 loss: 0.3633123 f1: 0.0000000: 1%|▏ | 22/1497 [00:11<12:16, 2.00it/s]epoch: 0 loss: 0.3385642 f1: 0.0000000: 1%|▏ | 22/1497 [00:11<12:16, 2.00it/s]epoch: 0 loss: 0.3385642 f1: 0.0000000: 2%|▏ | 23/1497 [00:11<12:15, 2.01it/s]epoch: 0 loss: 0.3175578 f1: 0.0000000: 2%|▏ | 23/1497 [00:12<12:15, 2.01it/s]epoch: 0 loss: 0.3175578 f1: 0.0000000: 2%|▏ | 24/1497 [00:12<12:15, 2.00it/s]epoch: 0 loss: 0.3294161 f1: 0.0000000: 2%|▏ | 24/1497 [00:12<12:15, 2.00it/s]epoch: 0 loss: 0.3294161 f1: 0.0000000: 2%|▏ | 25/1497 [00:12<12:15, 2.00it/s]epoch: 0 loss: 0.3961508 f1: 0.0000000: 2%|▏ | 25/1497 [00:13<12:15, 2.00it/s]epoch: 0 loss: 0.3961508 f1: 0.0000000: 2%|▏ | 26/1497 [00:13<12:15, 2.00it/s]epoch: 0 loss: 0.3467003 f1: 0.0000000: 2%|▏ | 26/1497 [00:13<12:15, 2.00it/s]epoch: 0 loss: 0.3467003 f1: 0.0000000: 2%|▏ | 27/1497 [00:13<12:12, 2.01it/s]epoch: 0 loss: 0.3159914 f1: 0.0000000: 2%|▏ | 27/1497 [00:14<12:12, 2.01it/s]epoch: 0 loss: 0.3159914 f1: 0.0000000: 2%|▏ | 28/1497 [00:14<12:13, 2.00it/s]epoch: 0 loss: 0.3129136 f1: 0.0000000: 2%|▏ | 28/1497 [00:14<12:13, 2.00it/s]epoch: 0 loss: 0.3129136 f1: 0.0000000: 2%|▏ | 29/1497 [00:14<12:16, 1.99it/s]epoch: 0 loss: 0.3180925 f1: 0.0000000: 2%|▏ | 29/1497 [00:15<12:16, 1.99it/s]epoch: 0 loss: 0.3180925 f1: 0.0000000: 2%|▏ | 30/1497 [00:15<12:15, 1.99it/s]epoch: 0 loss: 0.2712103 f1: 0.0000000: 2%|▏ | 30/1497 [00:15<12:15, 1.99it/s]epoch: 0 loss: 0.2712103 f1: 0.0000000: 2%|▏ | 31/1497 [00:15<12:14, 2.00it/s]epoch: 0 loss: 0.2889816 f1: 0.0000000: 2%|▏ | 31/1497 [00:16<12:14, 2.00it/s]epoch: 0 loss: 0.2889816 f1: 0.0000000: 2%|▏ | 32/1497 [00:16<12:12, 2.00it/s]epoch: 0 loss: 0.3079190 f1: 0.0000000: 2%|▏ | 32/1497 [00:16<12:12, 2.00it/s]epoch: 0 loss: 0.3079190 f1: 0.0000000: 2%|▏ | 33/1497 [00:16<12:11, 2.00it/s]epoch: 0 loss: 0.3211264 f1: 0.0000000: 2%|▏ | 33/1497 [00:17<12:11, 2.00it/s]epoch: 0 loss: 0.3211264 f1: 0.0000000: 2%|▏ | 34/1497 [00:17<12:12, 2.00it/s]epoch: 0 loss: 0.3313974 f1: 0.0000000: 2%|▏ | 34/1497 [00:17<12:12, 2.00it/s]epoch: 0 loss: 0.3313974 f1: 0.0000000: 2%|▏ | 35/1497 [00:17<12:09, 2.00it/s]epoch: 0 loss: 0.2739582 f1: 0.0000000: 2%|▏ | 35/1497 [00:18<12:09, 2.00it/s]epoch: 0 loss: 0.2739582 f1: 0.0000000: 2%|▏ | 36/1497 [00:18<12:05, 2.01it/s]epoch: 0 loss: 0.2457094 f1: 0.0000000: 2%|▏ | 36/1497 [00:18<12:05, 2.01it/s]epoch: 0 loss: 0.2457094 f1: 0.0000000: 2%|▏ | 37/1497 [00:18<12:00, 2.03it/s]epoch: 0 loss: 0.2618712 f1: 0.0000000: 2%|▏ | 37/1497 [00:19<12:00, 2.03it/s]epoch: 0 loss: 0.2618712 f1: 0.0000000: 3%|▎ | 38/1497 [00:19<12:00, 2.03it/s]epoch: 0 loss: 0.2945471 f1: 0.0000000: 3%|▎ | 38/1497 [00:19<12:00, 2.03it/s]epoch: 0 loss: 0.2945471 f1: 0.0000000: 3%|▎ | 39/1497 [00:19<11:58, 2.03it/s]epoch: 0 loss: 0.2049470 f1: 0.0000000: 3%|▎ | 39/1497 [00:20<11:58, 2.03it/s]epoch: 0 loss: 0.2049470 f1: 0.0000000: 3%|▎ | 40/1497 [00:20<11:57, 2.03it/s]epoch: 0 loss: 0.2269126 f1: 0.0000000: 3%|▎ | 40/1497 [00:20<11:57, 2.03it/s]epoch: 0 loss: 0.2269126 f1: 0.0000000: 3%|▎ | 41/1497 [00:20<12:02, 2.01it/s]epoch: 0 loss: 0.2828192 f1: 0.0000000: 3%|▎ | 41/1497 [00:21<12:02, 2.01it/s]epoch: 0 loss: 0.2828192 f1: 0.0000000: 3%|▎ | 42/1497 [00:21<11:55, 2.03it/s]epoch: 0 loss: 0.1907831 f1: 0.0000000: 3%|▎ | 42/1497 [00:21<11:55, 2.03it/s]epoch: 0 loss: 0.1907831 f1: 0.0000000: 3%|▎ | 43/1497 [00:21<11:55, 2.03it/s]epoch: 0 loss: 0.2434448 f1: 0.0000000: 3%|▎ | 43/1497 [00:22<11:55, 2.03it/s]epoch: 0 loss: 0.2434448 f1: 0.0000000: 3%|▎ | 44/1497 [00:22<11:57, 2.03it/s]epoch: 0 loss: 0.0884956 f1: 0.0000000: 3%|▎ | 44/1497 [00:22<11:57, 2.03it/s]epoch: 0 loss: 0.0884956 f1: 0.0000000: 3%|▎ | 45/1497 [00:22<11:59, 2.02it/s]epoch: 0 loss: 0.1036541 f1: 0.0000000: 3%|▎ | 45/1497 [00:23<11:59, 2.02it/s]epoch: 0 loss: 0.1036541 f1: 0.0000000: 3%|▎ | 46/1497 [00:23<11:58, 2.02it/s]epoch: 0 loss: 0.1798892 f1: 0.0000000: 3%|▎ | 46/1497 [00:23<11:58, 2.02it/s]epoch: 0 loss: 0.1798892 f1: 0.0000000: 3%|▎ | 47/1497 [00:23<12:03, 2.00it/s]epoch: 0 loss: 0.2223373 f1: 0.0000000: 3%|▎ | 47/1497 [00:24<12:03, 2.00it/s]epoch: 0 loss: 0.2223373 f1: 0.0000000: 3%|▎ | 48/1497 [00:24<12:01, 2.01it/s]epoch: 0 loss: 0.0731502 f1: 0.0000000: 3%|▎ | 48/1497 [00:24<12:01, 2.01it/s]epoch: 0 loss: 0.0731502 f1: 0.0000000: 3%|▎ | 49/1497 [00:24<11:59, 2.01it/s]epoch: 0 loss: 0.2375898 f1: 0.0000000: 3%|▎ | 49/1497 [00:25<11:59, 2.01it/s]epoch: 0 loss: 0.2375898 f1: 0.0000000: 3%|▎ | 50/1497 [00:25<11:59, 2.01it/s]epoch: 0 loss: 0.2303034 f1: 0.0000000: 3%|▎ | 50/1497 [00:25<11:59, 2.01it/s]epoch: 0 loss: 0.2303034 f1: 0.0000000: 3%|▎ | 51/1497 [00:25<12:03, 2.00it/s]epoch: 0 loss: 0.1507848 f1: 0.0000000: 3%|▎ | 51/1497 [00:26<12:03, 2.00it/s]epoch: 0 loss: 0.1507848 f1: 0.0000000: 3%|▎ | 52/1497 [00:26<12:10, 1.98it/s]epoch: 0 loss: 0.1542218 f1: 0.0000000: 3%|▎ | 52/1497 [00:26<12:10, 1.98it/s]epoch: 0 loss: 0.1542218 f1: 0.0000000: 4%|▎ | 53/1497 [00:26<12:04, 1.99it/s]epoch: 0 loss: 0.1554600 f1: 0.0000000: 4%|▎ | 53/1497 [00:27<12:04, 1.99it/s]epoch: 0 loss: 0.1554600 f1: 0.0000000: 4%|▎ | 54/1497 [00:27<12:03, 1.99it/s]epoch: 0 loss: 0.1885582 f1: 0.0000000: 4%|▎ | 54/1497 [00:27<12:03, 1.99it/s]epoch: 0 loss: 0.1885582 f1: 0.0000000: 4%|▎ | 55/1497 [00:27<12:02, 2.00it/s]epoch: 0 loss: 0.0963390 f1: 0.0000000: 4%|▎ | 55/1497 [00:28<12:02, 2.00it/s]epoch: 0 loss: 0.0963390 f1: 0.0000000: 4%|▎ | 56/1497 [00:28<12:13, 1.96it/s]epoch: 0 loss: 0.0839978 f1: 0.0000000: 4%|▎ | 56/1497 [00:28<12:13, 1.96it/s]epoch: 0 loss: 0.0839978 f1: 0.0000000: 4%|▍ | 57/1497 [00:28<12:10, 1.97it/s]epoch: 0 loss: 0.1261560 f1: 0.0000000: 4%|▍ | 57/1497 [00:29<12:10, 1.97it/s]epoch: 0 loss: 0.1261560 f1: 0.0000000: 4%|▍ | 58/1497 [00:29<12:06, 1.98it/s]epoch: 0 loss: 0.1386163 f1: 0.0000000: 4%|▍ | 58/1497 [00:29<12:06, 1.98it/s]epoch: 0 loss: 0.1386163 f1: 0.0000000: 4%|▍ | 59/1497 [00:29<12:05, 1.98it/s]epoch: 0 loss: 0.0899991 f1: 0.0000000: 4%|▍ | 59/1497 [00:30<12:05, 1.98it/s]epoch: 0 loss: 0.0899991 f1: 0.0000000: 4%|▍ | 60/1497 [00:30<12:05, 1.98it/s]epoch: 0 loss: 0.2630520 f1: 0.0000000: 4%|▍ | 60/1497 [00:30<12:05, 1.98it/s]epoch: 0 loss: 0.2630520 f1: 0.0000000: 4%|▍ | 61/1497 [00:30<12:02, 1.99it/s]epoch: 0 loss: 0.1824401 f1: 0.0000000: 4%|▍ | 61/1497 [00:31<12:02, 1.99it/s]epoch: 0 loss: 0.1824401 f1: 0.0000000: 4%|▍ | 62/1497 [00:31<12:01, 1.99it/s]epoch: 0 loss: 0.3225296 f1: 0.0000000: 4%|▍ | 62/1497 [00:31<12:01, 1.99it/s]epoch: 0 loss: 0.3225296 f1: 0.0000000: 4%|▍ | 63/1497 [00:31<11:59, 1.99it/s]epoch: 0 loss: 0.0958273 f1: 0.0000000: 4%|▍ | 63/1497 [00:32<11:59, 1.99it/s]epoch: 0 loss: 0.0958273 f1: 0.0000000: 4%|▍ | 64/1497 [00:32<11:58, 1.99it/s]epoch: 0 loss: 0.1355120 f1: 0.0000000: 4%|▍ | 64/1497 [00:32<11:58, 1.99it/s]epoch: 0 loss: 0.1355120 f1: 0.0000000: 4%|▍ | 65/1497 [00:32<12:08, 1.96it/s]epoch: 0 loss: 0.1958239 f1: 0.0000000: 4%|▍ | 65/1497 [00:33<12:08, 1.96it/s]epoch: 0 loss: 0.1958239 f1: 0.0000000: 4%|▍ | 66/1497 [00:33<12:04, 1.97it/s]epoch: 0 loss: 0.1321862 f1: 0.0000000: 4%|▍ | 66/1497 [00:33<12:04, 1.97it/s]epoch: 0 loss: 0.1321862 f1: 0.0000000: 4%|▍ | 67/1497 [00:33<12:01, 1.98it/s]epoch: 0 loss: 0.0522069 f1: 0.0000000: 4%|▍ | 67/1497 [00:34<12:01, 1.98it/s]epoch: 0 loss: 0.0522069 f1: 0.0000000: 5%|▍ | 68/1497 [00:34<11:57, 1.99it/s]epoch: 0 loss: 0.2529097 f1: 0.0000000: 5%|▍ | 68/1497 [00:34<11:57, 1.99it/s]epoch: 0 loss: 0.2529097 f1: 0.0000000: 5%|▍ | 69/1497 [00:34<11:55, 1.99it/s]epoch: 0 loss: 0.1226426 f1: 0.0000000: 5%|▍ | 69/1497 [00:35<11:55, 1.99it/s]epoch: 0 loss: 0.1226426 f1: 0.0000000: 5%|▍ | 70/1497 [00:35<11:54, 2.00it/s]epoch: 0 loss: 0.0816366 f1: 0.0000000: 5%|▍ | 70/1497 [00:35<11:54, 2.00it/s]epoch: 0 loss: 0.0816366 f1: 0.0000000: 5%|▍ | 71/1497 [00:35<11:54, 2.00it/s]epoch: 0 loss: 0.1666372 f1: 0.0000000: 5%|▍ | 71/1497 [00:36<11:54, 2.00it/s]epoch: 0 loss: 0.1666372 f1: 0.0000000: 5%|▍ | 72/1497 [00:36<11:58, 1.98it/s]epoch: 0 loss: 0.1201888 f1: 0.0000000: 5%|▍ | 72/1497 [00:36<11:58, 1.98it/s]epoch: 0 loss: 0.1201888 f1: 0.0000000: 5%|▍ | 73/1497 [00:36<11:56, 1.99it/s]epoch: 0 loss: 0.1979781 f1: 0.0000000: 5%|▍ | 73/1497 [00:37<11:56, 1.99it/s]epoch: 0 loss: 0.1979781 f1: 0.0000000: 5%|▍ | 74/1497 [00:37<12:01, 1.97it/s]epoch: 0 loss: 0.1365510 f1: 0.0000000: 5%|▍ | 74/1497 [00:37<12:01, 1.97it/s]epoch: 0 loss: 0.1365510 f1: 0.0000000: 5%|▌ | 75/1497 [00:37<12:02, 1.97it/s]epoch: 0 loss: 0.1377068 f1: 0.0000000: 5%|▌ | 75/1497 [00:38<12:02, 1.97it/s]epoch: 0 loss: 0.1377068 f1: 0.0000000: 5%|▌ | 76/1497 [00:38<12:12, 1.94it/s]epoch: 0 loss: 0.1064331 f1: 0.0000000: 5%|▌ | 76/1497 [00:38<12:12, 1.94it/s]epoch: 0 loss: 0.1064331 f1: 0.0000000: 5%|▌ | 77/1497 [00:38<12:09, 1.95it/s]epoch: 0 loss: 0.0898482 f1: 0.0000000: 5%|▌ | 77/1497 [00:39<12:09, 1.95it/s]epoch: 0 loss: 0.0898482 f1: 0.0000000: 5%|▌ | 78/1497 [00:39<12:08, 1.95it/s]epoch: 0 loss: 0.0515544 f1: 0.0000000: 5%|▌ | 78/1497 [00:39<12:08, 1.95it/s]epoch: 0 loss: 0.0515544 f1: 0.0000000: 5%|▌ | 79/1497 [00:39<12:03, 1.96it/s]epoch: 0 loss: 0.0567444 f1: 0.0000000: 5%|▌ | 79/1497 [00:40<12:03, 1.96it/s]epoch: 0 loss: 0.0567444 f1: 0.0000000: 5%|▌ | 80/1497 [00:40<11:57, 1.98it/s]epoch: 0 loss: 0.1159253 f1: 0.0000000: 5%|▌ | 80/1497 [00:40<11:57, 1.98it/s]epoch: 0 loss: 0.1159253 f1: 0.0000000: 5%|▌ | 81/1497 [00:40<11:56, 1.98it/s]epoch: 0 loss: 0.2130250 f1: 0.0000000: 5%|▌ | 81/1497 [00:41<11:56, 1.98it/s]epoch: 0 loss: 0.2130250 f1: 0.0000000: 5%|▌ | 82/1497 [00:41<11:57, 1.97it/s]epoch: 0 loss: 0.1966398 f1: 0.0000000: 5%|▌ | 82/1497 [00:41<11:57, 1.97it/s]epoch: 0 loss: 0.1966398 f1: 0.0000000: 6%|▌ | 83/1497 [00:41<11:54, 1.98it/s]epoch: 0 loss: 0.1579647 f1: 0.0000000: 6%|▌ | 83/1497 [00:42<11:54, 1.98it/s]epoch: 0 loss: 0.1579647 f1: 0.0000000: 6%|▌ | 84/1497 [00:42<11:50, 1.99it/s]epoch: 0 loss: 0.0859058 f1: 0.0000000: 6%|▌ | 84/1497 [00:42<11:50, 1.99it/s]epoch: 0 loss: 0.0859058 f1: 0.0000000: 6%|▌ | 85/1497 [00:42<11:50, 1.99it/s]epoch: 0 loss: 0.1886652 f1: 0.0000000: 6%|▌ | 85/1497 [00:43<11:50, 1.99it/s]epoch: 0 loss: 0.1886652 f1: 0.0000000: 6%|▌ | 86/1497 [00:43<11:57, 1.97it/s]epoch: 0 loss: 0.1610538 f1: 0.0000000: 6%|▌ | 86/1497 [00:43<11:57, 1.97it/s]epoch: 0 loss: 0.1610538 f1: 0.0000000: 6%|▌ | 87/1497 [00:43<11:57, 1.96it/s]epoch: 0 loss: 0.1522547 f1: 0.0000000: 6%|▌ | 87/1497 [00:44<11:57, 1.96it/s]epoch: 0 loss: 0.1522547 f1: 0.0000000: 6%|▌ | 88/1497 [00:44<11:56, 1.97it/s]epoch: 0 loss: 0.1284843 f1: 0.0000000: 6%|▌ | 88/1497 [00:44<11:56, 1.97it/s]epoch: 0 loss: 0.1284843 f1: 0.0000000: 6%|▌ | 89/1497 [00:44<11:52, 1.97it/s]epoch: 0 loss: 0.0697888 f1: 0.0000000: 6%|▌ | 89/1497 [00:45<11:52, 1.97it/s]epoch: 0 loss: 0.0697888 f1: 0.0000000: 6%|▌ | 90/1497 [00:45<11:50, 1.98it/s]epoch: 0 loss: 0.1119738 f1: 0.0000000: 6%|▌ | 90/1497 [00:45<11:50, 1.98it/s]epoch: 0 loss: 0.1119738 f1: 0.0000000: 6%|▌ | 91/1497 [00:45<11:48, 1.98it/s]epoch: 0 loss: 0.0800475 f1: 0.0000000: 6%|▌ | 91/1497 [00:46<11:48, 1.98it/s]epoch: 0 loss: 0.0800475 f1: 0.0000000: 6%|▌ | 92/1497 [00:46<11:48, 1.98it/s]epoch: 0 loss: 0.2014199 f1: 0.0000000: 6%|▌ | 92/1497 [00:46<11:48, 1.98it/s]epoch: 0 loss: 0.2014199 f1: 0.0000000: 6%|▌ | 93/1497 [00:46<12:02, 1.94it/s]epoch: 0 loss: 0.1145555 f1: 0.0000000: 6%|▌ | 93/1497 [00:47<12:02, 1.94it/s]epoch: 0 loss: 0.1145555 f1: 0.0000000: 6%|▋ | 94/1497 [00:47<11:59, 1.95it/s]epoch: 0 loss: 0.2430634 f1: 0.0000000: 6%|▋ | 94/1497 [00:48<11:59, 1.95it/s]epoch: 0 loss: 0.2430634 f1: 0.0000000: 6%|▋ | 95/1497 [00:48<11:55, 1.96it/s]epoch: 0 loss: 0.1819540 f1: 0.0000000: 6%|▋ | 95/1497 [00:48<11:55, 1.96it/s]epoch: 0 loss: 0.1819540 f1: 0.0000000: 6%|▋ | 96/1497 [00:48<11:50, 1.97it/s]epoch: 0 loss: 0.1548341 f1: 0.0000000: 6%|▋ | 96/1497 [00:49<11:50, 1.97it/s]epoch: 0 loss: 0.1548341 f1: 0.0000000: 6%|▋ | 97/1497 [00:49<11:47, 1.98it/s]epoch: 0 loss: 0.1339061 f1: 0.0000000: 6%|▋ | 97/1497 [00:49<11:47, 1.98it/s]epoch: 0 loss: 0.1339061 f1: 0.0000000: 7%|▋ | 98/1497 [00:49<11:44, 1.99it/s]epoch: 0 loss: 0.2119035 f1: 0.0000000: 7%|▋ | 98/1497 [00:50<11:44, 1.99it/s]epoch: 0 loss: 0.2119035 f1: 0.0000000: 7%|▋ | 99/1497 [00:50<11:44, 1.98it/s]epoch: 0 loss: 0.2477578 f1: 0.0000000: 7%|▋ | 99/1497 [00:50<11:44, 1.98it/s]epoch: 0 loss: 0.2477578 f1: 0.0000000: 7%|▋ | 100/1497 [00:50<11:41, 1.99it/s]epoch: 0 loss: 0.0791997 f1: 0.0000000: 7%|▋ | 100/1497 [00:51<11:41, 1.99it/s]epoch: 0 loss: 0.0791997 f1: 0.0000000: 7%|▋ | 101/1497 [00:51<11:41, 1.99it/s]epoch: 0 loss: 0.0629635 f1: 0.0000000: 7%|▋ | 101/1497 [00:51<11:41, 1.99it/s]epoch: 0 loss: 0.0629635 f1: 0.0000000: 7%|▋ | 102/1497 [00:51<11:43, 1.98it/s]epoch: 0 loss: 0.0859300 f1: 0.0000000: 7%|▋ | 102/1497 [00:52<11:43, 1.98it/s]epoch: 0 loss: 0.0859300 f1: 0.0000000: 7%|▋ | 103/1497 [00:52<11:42, 1.98it/s]epoch: 0 loss: 0.2293633 f1: 0.0000000: 7%|▋ | 103/1497 [00:52<11:42, 1.98it/s]epoch: 0 loss: 0.2293633 f1: 0.0000000: 7%|▋ | 104/1497 [00:52<11:39, 1.99it/s]epoch: 0 loss: 0.1247713 f1: 0.0000000: 7%|▋ | 104/1497 [00:53<11:39, 1.99it/s]epoch: 0 loss: 0.1247713 f1: 0.0000000: 7%|▋ | 105/1497 [00:53<11:40, 1.99it/s]epoch: 0 loss: 0.0905276 f1: 0.0000000: 7%|▋ | 105/1497 [00:53<11:40, 1.99it/s]epoch: 0 loss: 0.0905276 f1: 0.0000000: 7%|▋ | 106/1497 [00:53<11:39, 1.99it/s]epoch: 0 loss: 0.1554017 f1: 0.0000000: 7%|▋ | 106/1497 [00:54<11:39, 1.99it/s]epoch: 0 loss: 0.1554017 f1: 0.0000000: 7%|▋ | 107/1497 [00:54<11:38, 1.99it/s]epoch: 0 loss: 0.0721851 f1: 0.0000000: 7%|▋ | 107/1497 [00:54<11:38, 1.99it/s]epoch: 0 loss: 0.0721851 f1: 0.0000000: 7%|▋ | 108/1497 [00:54<11:36, 1.99it/s]epoch: 0 loss: 0.1225980 f1: 0.0000000: 7%|▋ | 108/1497 [00:55<11:36, 1.99it/s]epoch: 0 loss: 0.1225980 f1: 0.0000000: 7%|▋ | 109/1497 [00:55<11:35, 2.00it/s]epoch: 0 loss: 0.2256702 f1: 0.0000000: 7%|▋ | 109/1497 [00:55<11:35, 2.00it/s]epoch: 0 loss: 0.2256702 f1: 0.0000000: 7%|▋ | 110/1497 [00:55<11:33, 2.00it/s]epoch: 0 loss: 0.2021801 f1: 0.0000000: 7%|▋ | 110/1497 [00:56<11:33, 2.00it/s]epoch: 0 loss: 0.2021801 f1: 0.0000000: 7%|▋ | 111/1497 [00:56<11:36, 1.99it/s]epoch: 0 loss: 0.0476796 f1: 0.0000000: 7%|▋ | 111/1497 [00:56<11:36, 1.99it/s]epoch: 0 loss: 0.0476796 f1: 0.0000000: 7%|▋ | 112/1497 [00:56<11:36, 1.99it/s]epoch: 0 loss: 0.0573719 f1: 0.0000000: 7%|▋ | 112/1497 [00:57<11:36, 1.99it/s]epoch: 0 loss: 0.0573719 f1: 0.0000000: 8%|▊ | 113/1497 [00:57<11:37, 1.99it/s]epoch: 0 loss: 0.1704011 f1: 0.0000000: 8%|▊ | 113/1497 [00:57<11:37, 1.99it/s]epoch: 0 loss: 0.1704011 f1: 0.0000000: 8%|▊ | 114/1497 [00:57<11:38, 1.98it/s]epoch: 0 loss: 0.1499180 f1: 0.0000000: 8%|▊ | 114/1497 [00:58<11:38, 1.98it/s]epoch: 0 loss: 0.1499180 f1: 0.0000000: 8%|▊ | 115/1497 [00:58<11:36, 1.98it/s]epoch: 0 loss: 0.0478954 f1: 0.0000000: 8%|▊ | 115/1497 [00:58<11:36, 1.98it/s]epoch: 0 loss: 0.0478954 f1: 0.0000000: 8%|▊ | 116/1497 [00:58<11:36, 1.98it/s]epoch: 0 loss: 0.2235315 f1: 0.0000000: 8%|▊ | 116/1497 [00:59<11:36, 1.98it/s]epoch: 0 loss: 0.2235315 f1: 0.0000000: 8%|▊ | 117/1497 [00:59<11:35, 1.98it/s]epoch: 0 loss: 0.1281027 f1: 0.0000000: 8%|▊ | 117/1497 [00:59<11:35, 1.98it/s]epoch: 0 loss: 0.1281027 f1: 0.0000000: 8%|▊ | 118/1497 [00:59<11:35, 1.98it/s]epoch: 0 loss: 0.0777110 f1: 0.0000000: 8%|▊ | 118/1497 [01:00<11:35, 1.98it/s]epoch: 0 loss: 0.0777110 f1: 0.0000000: 8%|▊ | 119/1497 [01:00<11:35, 1.98it/s]epoch: 0 loss: 0.0471477 f1: 0.0000000: 8%|▊ | 119/1497 [01:00<11:35, 1.98it/s]epoch: 0 loss: 0.0471477 f1: 0.0000000: 8%|▊ | 120/1497 [01:00<11:33, 1.98it/s]epoch: 0 loss: 0.2142379 f1: 0.0000000: 8%|▊ | 120/1497 [01:01<11:33, 1.98it/s]epoch: 0 loss: 0.2142379 f1: 0.0000000: 8%|▊ | 121/1497 [01:01<11:35, 1.98it/s]epoch: 0 loss: 0.0863403 f1: 0.0000000: 8%|▊ | 121/1497 [01:01<11:35, 1.98it/s]epoch: 0 loss: 0.0863403 f1: 0.0000000: 8%|▊ | 122/1497 [01:01<11:36, 1.98it/s]epoch: 0 loss: 0.1894515 f1: 0.0000000: 8%|▊ | 122/1497 [01:02<11:36, 1.98it/s]epoch: 0 loss: 0.1894515 f1: 0.0000000: 8%|▊ | 123/1497 [01:02<11:39, 1.97it/s]epoch: 0 loss: 0.2073819 f1: 0.0000000: 8%|▊ | 123/1497 [01:02<11:39, 1.97it/s]epoch: 0 loss: 0.2073819 f1: 0.0000000: 8%|▊ | 124/1497 [01:02<11:35, 1.97it/s]epoch: 0 loss: 0.0587228 f1: 0.0000000: 8%|▊ | 124/1497 [01:03<11:35, 1.97it/s]epoch: 0 loss: 0.0587228 f1: 0.0000000: 8%|▊ | 125/1497 [01:03<11:34, 1.98it/s]epoch: 0 loss: 0.1694562 f1: 0.0000000: 8%|▊ | 125/1497 [01:03<11:34, 1.98it/s]epoch: 0 loss: 0.1694562 f1: 0.0000000: 8%|▊ | 126/1497 [01:03<11:34, 1.98it/s]epoch: 0 loss: 0.1264699 f1: 0.0000000: 8%|▊ | 126/1497 [01:04<11:34, 1.98it/s]epoch: 0 loss: 0.1264699 f1: 0.0000000: 8%|▊ | 127/1497 [01:04<11:32, 1.98it/s]epoch: 0 loss: 0.0959699 f1: 0.0000000: 8%|▊ | 127/1497 [01:04<11:32, 1.98it/s]epoch: 0 loss: 0.0959699 f1: 0.0000000: 9%|▊ | 128/1497 [01:04<11:30, 1.98it/s]epoch: 0 loss: 0.1024669 f1: 0.0000000: 9%|▊ | 128/1497 [01:05<11:30, 1.98it/s]epoch: 0 loss: 0.1024669 f1: 0.0000000: 9%|▊ | 129/1497 [01:05<11:28, 1.99it/s]epoch: 0 loss: 0.3194963 f1: 0.0000000: 9%|▊ | 129/1497 [01:05<11:28, 1.99it/s]epoch: 0 loss: 0.3194963 f1: 0.0000000: 9%|▊ | 130/1497 [01:05<11:26, 1.99it/s]epoch: 0 loss: 0.0796435 f1: 0.0000000: 9%|▊ | 130/1497 [01:06<11:26, 1.99it/s]epoch: 0 loss: 0.0796435 f1: 0.0000000: 9%|▉ | 131/1497 [01:06<11:25, 1.99it/s]epoch: 0 loss: 0.1316320 f1: 0.0000000: 9%|▉ | 131/1497 [01:06<11:25, 1.99it/s]epoch: 0 loss: 0.1316320 f1: 0.0000000: 9%|▉ | 132/1497 [01:06<11:25, 1.99it/s]epoch: 0 loss: 0.3779401 f1: 0.0000000: 9%|▉ | 132/1497 [01:07<11:25, 1.99it/s]epoch: 0 loss: 0.3779401 f1: 0.0000000: 9%|▉ | 133/1497 [01:07<11:24, 1.99it/s]epoch: 0 loss: 0.1459579 f1: 0.0000000: 9%|▉ | 133/1497 [01:07<11:24, 1.99it/s]epoch: 0 loss: 0.1459579 f1: 0.0000000: 9%|▉ | 134/1497 [01:07<11:29, 1.98it/s]epoch: 0 loss: 0.1704030 f1: 0.0000000: 9%|▉ | 134/1497 [01:08<11:29, 1.98it/s]epoch: 0 loss: 0.1704030 f1: 0.0000000: 9%|▉ | 135/1497 [01:08<11:31, 1.97it/s]epoch: 0 loss: 0.0851787 f1: 0.0000000: 9%|▉ | 135/1497 [01:08<11:31, 1.97it/s]epoch: 0 loss: 0.0851787 f1: 0.0000000: 9%|▉ | 136/1497 [01:08<11:29, 1.98it/s]epoch: 0 loss: 0.1324430 f1: 0.0000000: 9%|▉ | 136/1497 [01:09<11:29, 1.98it/s]epoch: 0 loss: 0.1324430 f1: 0.0000000: 9%|▉ | 137/1497 [01:09<11:27, 1.98it/s]epoch: 0 loss: 0.1505529 f1: 0.0000000: 9%|▉ | 137/1497 [01:09<11:27, 1.98it/s]epoch: 0 loss: 0.1505529 f1: 0.0000000: 9%|▉ | 138/1497 [01:09<11:24, 1.99it/s]epoch: 0 loss: 0.1167661 f1: 0.0000000: 9%|▉ | 138/1497 [01:10<11:24, 1.99it/s]epoch: 0 loss: 0.1167661 f1: 0.0000000: 9%|▉ | 139/1497 [01:10<11:24, 1.98it/s]epoch: 0 loss: 0.1207244 f1: 0.0000000: 9%|▉ | 139/1497 [01:10<11:24, 1.98it/s]epoch: 0 loss: 0.1207244 f1: 0.0000000: 9%|▉ | 140/1497 [01:10<11:24, 1.98it/s]epoch: 0 loss: 0.0922387 f1: 0.0000000: 9%|▉ | 140/1497 [01:11<11:24, 1.98it/s]epoch: 0 loss: 0.0922387 f1: 0.0000000: 9%|▉ | 141/1497 [01:11<11:22, 1.99it/s]epoch: 0 loss: 0.1209432 f1: 0.0000000: 9%|▉ | 141/1497 [01:11<11:22, 1.99it/s]epoch: 0 loss: 0.1209432 f1: 0.0000000: 9%|▉ | 142/1497 [01:11<11:20, 1.99it/s]epoch: 0 loss: 0.0723279 f1: 0.0000000: 9%|▉ | 142/1497 [01:12<11:20, 1.99it/s]epoch: 0 loss: 0.0723279 f1: 0.0000000: 10%|▉ | 143/1497 [01:12<11:21, 1.99it/s]epoch: 0 loss: 0.1982922 f1: 0.0000000: 10%|▉ | 143/1497 [01:12<11:21, 1.99it/s]epoch: 0 loss: 0.1982922 f1: 0.0000000: 10%|▉ | 144/1497 [01:12<11:18, 1.99it/s]epoch: 0 loss: 0.3324464 f1: 0.0000000: 10%|▉ | 144/1497 [01:13<11:18, 1.99it/s]epoch: 0 loss: 0.3324464 f1: 0.0000000: 10%|▉ | 145/1497 [01:13<11:18, 1.99it/s]epoch: 0 loss: 0.2933735 f1: 0.0000000: 10%|▉ | 145/1497 [01:13<11:18, 1.99it/s]epoch: 0 loss: 0.2933735 f1: 0.0000000: 10%|▉ | 146/1497 [01:13<11:19, 1.99it/s]epoch: 0 loss: 0.1826224 f1: 0.0000000: 10%|▉ | 146/1497 [01:14<11:19, 1.99it/s]epoch: 0 loss: 0.1826224 f1: 0.0000000: 10%|▉ | 147/1497 [01:14<11:19, 1.99it/s]epoch: 0 loss: 0.1836676 f1: 0.0000000: 10%|▉ | 147/1497 [01:14<11:19, 1.99it/s]epoch: 0 loss: 0.1836676 f1: 0.0000000: 10%|▉ | 148/1497 [01:14<11:20, 1.98it/s]epoch: 0 loss: 0.1445538 f1: 0.0000000: 10%|▉ | 148/1497 [01:15<11:20, 1.98it/s]epoch: 0 loss: 0.1445538 f1: 0.0000000: 10%|▉ | 149/1497 [01:15<11:18, 1.99it/s]epoch: 0 loss: 0.0677355 f1: 0.0000000: 10%|▉ | 149/1497 [01:15<11:18, 1.99it/s]epoch: 0 loss: 0.0677355 f1: 0.0000000: 10%|█ | 150/1497 [01:15<11:10, 2.01it/s]epoch: 0 loss: 0.0932870 f1: 0.0000000: 10%|█ | 150/1497 [01:16<11:10, 2.01it/s]epoch: 0 loss: 0.0932870 f1: 0.0000000: 10%|█ | 151/1497 [01:16<11:09, 2.01it/s]epoch: 0 loss: 0.1047660 f1: 0.0000000: 10%|█ | 151/1497 [01:16<11:09, 2.01it/s]epoch: 0 loss: 0.1047660 f1: 0.0000000: 10%|█ | 152/1497 [01:16<11:04, 2.02it/s]epoch: 0 loss: 0.0674286 f1: 0.0000000: 10%|█ | 152/1497 [01:17<11:04, 2.02it/s]epoch: 0 loss: 0.0674286 f1: 0.0000000: 10%|█ | 153/1497 [01:17<11:03, 2.03it/s]epoch: 0 loss: 0.1734456 f1: 0.0000000: 10%|█ | 153/1497 [01:17<11:03, 2.03it/s]epoch: 0 loss: 0.1734456 f1: 0.0000000: 10%|█ | 154/1497 [01:17<11:02, 2.03it/s]epoch: 0 loss: 0.1001964 f1: 0.0000000: 10%|█ | 154/1497 [01:18<11:02, 2.03it/s]epoch: 0 loss: 0.1001964 f1: 0.0000000: 10%|█ | 155/1497 [01:18<11:02, 2.03it/s]epoch: 0 loss: 0.1092759 f1: 0.0000000: 10%|█ | 155/1497 [01:18<11:02, 2.03it/s]epoch: 0 loss: 0.1092759 f1: 0.0000000: 10%|█ | 156/1497 [01:18<11:03, 2.02it/s]epoch: 0 loss: 0.0804516 f1: 0.0000000: 10%|█ | 156/1497 [01:19<11:03, 2.02it/s]epoch: 0 loss: 0.0804516 f1: 0.0000000: 10%|█ | 157/1497 [01:19<10:58, 2.03it/s]epoch: 0 loss: 0.1247776 f1: 0.0000000: 10%|█ | 157/1497 [01:19<10:58, 2.03it/s]epoch: 0 loss: 0.1247776 f1: 0.0000000: 11%|█ | 158/1497 [01:19<11:02, 2.02it/s]epoch: 0 loss: 0.1437990 f1: 0.0000000: 11%|█ | 158/1497 [01:20<11:02, 2.02it/s]epoch: 0 loss: 0.1437990 f1: 0.0000000: 11%|█ | 159/1497 [01:20<11:03, 2.02it/s]epoch: 0 loss: 0.0642896 f1: 0.0000000: 11%|█ | 159/1497 [01:20<11:03, 2.02it/s]epoch: 0 loss: 0.0642896 f1: 0.0000000: 11%|█ | 160/1497 [01:20<11:02, 2.02it/s]epoch: 0 loss: 0.0682605 f1: 0.0000000: 11%|█ | 160/1497 [01:21<11:02, 2.02it/s]epoch: 0 loss: 0.0682605 f1: 0.0000000: 11%|█ | 161/1497 [01:21<11:04, 2.01it/s]epoch: 0 loss: 0.2294874 f1: 0.0000000: 11%|█ | 161/1497 [01:21<11:04, 2.01it/s]epoch: 0 loss: 0.2294874 f1: 0.0000000: 11%|█ | 162/1497 [01:21<11:05, 2.01it/s]epoch: 0 loss: 0.2740670 f1: 0.0000000: 11%|█ | 162/1497 [01:22<11:05, 2.01it/s]epoch: 0 loss: 0.2740670 f1: 0.0000000: 11%|█ | 163/1497 [01:22<11:08, 2.00it/s]epoch: 0 loss: 0.1935652 f1: 0.0000000: 11%|█ | 163/1497 [01:22<11:08, 2.00it/s]epoch: 0 loss: 0.1935652 f1: 0.0000000: 11%|█ | 164/1497 [01:22<11:10, 1.99it/s]epoch: 0 loss: 0.1384450 f1: 0.0000000: 11%|█ | 164/1497 [01:23<11:10, 1.99it/s]epoch: 0 loss: 0.1384450 f1: 0.0000000: 11%|█ | 165/1497 [01:23<11:09, 1.99it/s]epoch: 0 loss: 0.1401762 f1: 0.0000000: 11%|█ | 165/1497 [01:23<11:09, 1.99it/s]epoch: 0 loss: 0.1401762 f1: 0.0000000: 11%|█ | 166/1497 [01:23<11:06, 2.00it/s]epoch: 0 loss: 0.1807749 f1: 0.0000000: 11%|█ | 166/1497 [01:24<11:06, 2.00it/s]epoch: 0 loss: 0.1807749 f1: 0.0000000: 11%|█ | 167/1497 [01:24<11:05, 2.00it/s]epoch: 0 loss: 0.1407316 f1: 0.0000000: 11%|█ | 167/1497 [01:24<11:05, 2.00it/s]epoch: 0 loss: 0.1407316 f1: 0.0000000: 11%|█ | 168/1497 [01:24<11:02, 2.01it/s]epoch: 0 loss: 0.1278411 f1: 0.0000000: 11%|█ | 168/1497 [01:25<11:02, 2.01it/s]epoch: 0 loss: 0.1278411 f1: 0.0000000: 11%|█▏ | 169/1497 [01:25<11:02, 2.00it/s]epoch: 0 loss: 0.0363018 f1: 0.0000000: 11%|█▏ | 169/1497 [01:25<11:02, 2.00it/s]epoch: 0 loss: 0.0363018 f1: 0.0000000: 11%|█▏ | 170/1497 [01:25<10:59, 2.01it/s]epoch: 0 loss: 0.0606760 f1: 0.0000000: 11%|█▏ | 170/1497 [01:26<10:59, 2.01it/s]epoch: 0 loss: 0.0606760 f1: 0.0000000: 11%|█▏ | 171/1497 [01:26<11:03, 2.00it/s]epoch: 0 loss: 0.0703732 f1: 0.0000000: 11%|█▏ | 171/1497 [01:26<11:03, 2.00it/s]epoch: 0 loss: 0.0703732 f1: 0.0000000: 11%|█▏ | 172/1497 [01:26<10:59, 2.01it/s]epoch: 0 loss: 0.0554084 f1: 0.0000000: 11%|█▏ | 172/1497 [01:27<10:59, 2.01it/s]epoch: 0 loss: 0.0554084 f1: 0.0000000: 12%|█▏ | 173/1497 [01:27<11:01, 2.00it/s]epoch: 0 loss: 0.0640387 f1: 0.0000000: 12%|█▏ | 173/1497 [01:27<11:01, 2.00it/s]epoch: 0 loss: 0.0640387 f1: 0.0000000: 12%|█▏ | 174/1497 [01:27<11:01, 2.00it/s]epoch: 0 loss: 0.2030228 f1: 0.0000000: 12%|█▏ | 174/1497 [01:28<11:01, 2.00it/s]epoch: 0 loss: 0.2030228 f1: 0.0000000: 12%|█▏ | 175/1497 [01:28<11:08, 1.98it/s]epoch: 0 loss: 0.2090143 f1: 0.0000000: 12%|█▏ | 175/1497 [01:28<11:08, 1.98it/s]epoch: 0 loss: 0.2090143 f1: 0.0000000: 12%|█▏ | 176/1497 [01:28<11:05, 1.98it/s]epoch: 0 loss: 0.1050888 f1: 0.0000000: 12%|█▏ | 176/1497 [01:29<11:05, 1.98it/s]epoch: 0 loss: 0.1050888 f1: 0.0000000: 12%|█▏ | 177/1497 [01:29<11:07, 1.98it/s]epoch: 0 loss: 0.1481480 f1: 0.0000000: 12%|█▏ | 177/1497 [01:29<11:07, 1.98it/s]epoch: 0 loss: 0.1481480 f1: 0.0000000: 12%|█▏ | 178/1497 [01:29<11:05, 1.98it/s]epoch: 0 loss: 0.2182374 f1: 0.0000000: 12%|█▏ | 178/1497 [01:30<11:05, 1.98it/s]epoch: 0 loss: 0.2182374 f1: 0.0000000: 12%|█▏ | 179/1497 [01:30<11:02, 1.99it/s]epoch: 0 loss: 0.1006052 f1: 0.0000000: 12%|█▏ | 179/1497 [01:30<11:02, 1.99it/s]epoch: 0 loss: 0.1006052 f1: 0.0000000: 12%|█▏ | 180/1497 [01:30<10:55, 2.01it/s]epoch: 0 loss: 0.2064305 f1: 0.0000000: 12%|█▏ | 180/1497 [01:31<10:55, 2.01it/s]epoch: 0 loss: 0.2064305 f1: 0.0000000: 12%|█▏ | 181/1497 [01:31<10:54, 2.01it/s]epoch: 0 loss: 0.1624128 f1: 0.0000000: 12%|█▏ | 181/1497 [01:31<10:54, 2.01it/s]epoch: 0 loss: 0.1624128 f1: 0.0000000: 12%|█▏ | 182/1497 [01:31<10:53, 2.01it/s]epoch: 0 loss: 0.1224984 f1: 0.0000000: 12%|█▏ | 182/1497 [01:32<10:53, 2.01it/s]epoch: 0 loss: 0.1224984 f1: 0.0000000: 12%|█▏ | 183/1497 [01:32<10:56, 2.00it/s]epoch: 0 loss: 0.0445238 f1: 0.0000000: 12%|█▏ | 183/1497 [01:32<10:56, 2.00it/s]epoch: 0 loss: 0.0445238 f1: 0.0000000: 12%|█▏ | 184/1497 [01:32<10:55, 2.00it/s]epoch: 0 loss: 0.1207968 f1: 0.0000000: 12%|█▏ | 184/1497 [01:33<10:55, 2.00it/s]epoch: 0 loss: 0.1207968 f1: 0.0000000: 12%|█▏ | 185/1497 [01:33<10:55, 2.00it/s]epoch: 0 loss: 0.2137541 f1: 0.0000000: 12%|█▏ | 185/1497 [01:33<10:55, 2.00it/s]epoch: 0 loss: 0.2137541 f1: 0.0000000: 12%|█▏ | 186/1497 [01:33<10:54, 2.00it/s]epoch: 0 loss: 0.0913142 f1: 0.0000000: 12%|█▏ | 186/1497 [01:34<10:54, 2.00it/s]epoch: 0 loss: 0.0913142 f1: 0.0000000: 12%|█▏ | 187/1497 [01:34<10:53, 2.01it/s]epoch: 0 loss: 0.1171499 f1: 0.0000000: 12%|█▏ | 187/1497 [01:34<10:53, 2.01it/s]epoch: 0 loss: 0.1171499 f1: 0.0000000: 13%|█▎ | 188/1497 [01:34<10:53, 2.00it/s]epoch: 0 loss: 0.1815142 f1: 0.0000000: 13%|█▎ | 188/1497 [01:35<10:53, 2.00it/s]epoch: 0 loss: 0.1815142 f1: 0.0000000: 13%|█▎ | 189/1497 [01:35<10:51, 2.01it/s]epoch: 0 loss: 0.1723540 f1: 0.0000000: 13%|█▎ | 189/1497 [01:35<10:51, 2.01it/s]epoch: 0 loss: 0.1723540 f1: 0.0000000: 13%|█▎ | 190/1497 [01:35<10:50, 2.01it/s]epoch: 0 loss: 0.1793617 f1: 0.0000000: 13%|█▎ | 190/1497 [01:36<10:50, 2.01it/s]epoch: 0 loss: 0.1793617 f1: 0.0000000: 13%|█▎ | 191/1497 [01:36<10:50, 2.01it/s]epoch: 0 loss: 0.1111070 f1: 0.0000000: 13%|█▎ | 191/1497 [01:36<10:50, 2.01it/s]epoch: 0 loss: 0.1111070 f1: 0.0000000: 13%|█▎ | 192/1497 [01:36<10:51, 2.00it/s]epoch: 0 loss: 0.1207217 f1: 0.0000000: 13%|█▎ | 192/1497 [01:37<10:51, 2.00it/s]epoch: 0 loss: 0.1207217 f1: 0.0000000: 13%|█▎ | 193/1497 [01:37<10:52, 2.00it/s]epoch: 0 loss: 0.1325953 f1: 0.0000000: 13%|█▎ | 193/1497 [01:37<10:52, 2.00it/s]epoch: 0 loss: 0.1325953 f1: 0.0000000: 13%|█▎ | 194/1497 [01:37<10:50, 2.00it/s]epoch: 0 loss: 0.1390782 f1: 0.0000000: 13%|█▎ | 194/1497 [01:38<10:50, 2.00it/s]epoch: 0 loss: 0.1390782 f1: 0.0000000: 13%|█▎ | 195/1497 [01:38<10:46, 2.01it/s]epoch: 0 loss: 0.0751760 f1: 0.0000000: 13%|█▎ | 195/1497 [01:38<10:46, 2.01it/s]epoch: 0 loss: 0.0751760 f1: 0.0000000: 13%|█▎ | 196/1497 [01:38<10:42, 2.02it/s]epoch: 0 loss: 0.2235571 f1: 0.0000000: 13%|█▎ | 196/1497 [01:39<10:42, 2.02it/s]epoch: 0 loss: 0.2235571 f1: 0.0000000: 13%|█▎ | 197/1497 [01:39<10:42, 2.02it/s]epoch: 0 loss: 0.1587195 f1: 0.0000000: 13%|█▎ | 197/1497 [01:39<10:42, 2.02it/s]epoch: 0 loss: 0.1587195 f1: 0.0000000: 13%|█▎ | 198/1497 [01:39<10:41, 2.03it/s]epoch: 0 loss: 0.1238014 f1: 0.0000000: 13%|█▎ | 198/1497 [01:40<10:41, 2.03it/s]epoch: 0 loss: 0.1238014 f1: 0.0000000: 13%|█▎ | 199/1497 [01:40<10:38, 2.03it/s]epoch: 0 loss: 0.2333678 f1: 0.0000000: 13%|█▎ | 199/1497 [01:40<10:38, 2.03it/s]epoch: 0 loss: 0.2333678 f1: 0.0000000: 13%|█▎ | 200/1497 [01:40<10:36, 2.04it/s]epoch: 0 loss: 0.0542340 f1: 0.0000000: 13%|█▎ | 200/1497 [01:41<10:36, 2.04it/s]epoch: 0 loss: 0.0542340 f1: 0.0000000: 13%|█▎ | 201/1497 [01:41<10:40, 2.02it/s]epoch: 0 loss: 0.2027061 f1: 0.0000000: 13%|█▎ | 201/1497 [01:41<10:40, 2.02it/s]epoch: 0 loss: 0.2027061 f1: 0.0000000: 13%|█▎ | 202/1497 [01:41<10:39, 2.03it/s]epoch: 0 loss: 0.0748738 f1: 0.0000000: 13%|█▎ | 202/1497 [01:42<10:39, 2.03it/s]epoch: 0 loss: 0.0748738 f1: 0.0000000: 14%|█▎ | 203/1497 [01:42<10:38, 2.03it/s]epoch: 0 loss: 0.1436922 f1: 0.0000000: 14%|█▎ | 203/1497 [01:42<10:38, 2.03it/s]epoch: 0 loss: 0.1436922 f1: 0.0000000: 14%|█▎ | 204/1497 [01:42<10:36, 2.03it/s]epoch: 0 loss: 0.1364989 f1: 0.0000000: 14%|█▎ | 204/1497 [01:43<10:36, 2.03it/s]epoch: 0 loss: 0.1364989 f1: 0.0000000: 14%|█▎ | 205/1497 [01:43<10:37, 2.03it/s]epoch: 0 loss: 0.1556441 f1: 0.0000000: 14%|█▎ | 205/1497 [01:43<10:37, 2.03it/s]epoch: 0 loss: 0.1556441 f1: 0.0000000: 14%|█▍ | 206/1497 [01:43<10:40, 2.01it/s]epoch: 0 loss: 0.2226967 f1: 0.0000000: 14%|█▍ | 206/1497 [01:44<10:40, 2.01it/s]epoch: 0 loss: 0.2226967 f1: 0.0000000: 14%|█▍ | 207/1497 [01:44<10:38, 2.02it/s]epoch: 0 loss: 0.0745670 f1: 0.0000000: 14%|█▍ | 207/1497 [01:44<10:38, 2.02it/s]epoch: 0 loss: 0.0745670 f1: 0.0000000: 14%|█▍ | 208/1497 [01:44<10:38, 2.02it/s]epoch: 0 loss: 0.1970018 f1: 0.0000000: 14%|█▍ | 208/1497 [01:45<10:38, 2.02it/s]epoch: 0 loss: 0.1970018 f1: 0.0000000: 14%|█▍ | 209/1497 [01:45<10:32, 2.04it/s]epoch: 0 loss: 0.2273294 f1: 0.0000000: 14%|█▍ | 209/1497 [01:45<10:32, 2.04it/s]epoch: 0 loss: 0.2273294 f1: 0.0000000: 14%|█▍ | 210/1497 [01:45<10:29, 2.04it/s]epoch: 0 loss: 0.0781054 f1: 0.0000000: 14%|█▍ | 210/1497 [01:45<10:29, 2.04it/s]epoch: 0 loss: 0.0781054 f1: 0.0000000: 14%|█▍ | 211/1497 [01:45<10:29, 2.04it/s]epoch: 0 loss: 0.1220121 f1: 0.0000000: 14%|█▍ | 211/1497 [01:46<10:29, 2.04it/s]epoch: 0 loss: 0.1220121 f1: 0.0000000: 14%|█▍ | 212/1497 [01:46<10:28, 2.05it/s]epoch: 0 loss: 0.1602774 f1: 0.0000000: 14%|█▍ | 212/1497 [01:46<10:28, 2.05it/s]epoch: 0 loss: 0.1602774 f1: 0.0000000: 14%|█▍ | 213/1497 [01:46<10:31, 2.03it/s]epoch: 0 loss: 0.1358947 f1: 0.0000000: 14%|█▍ | 213/1497 [01:47<10:31, 2.03it/s]epoch: 0 loss: 0.1358947 f1: 0.0000000: 14%|█▍ | 214/1497 [01:47<10:38, 2.01it/s]epoch: 0 loss: 0.1252152 f1: 0.0000000: 14%|█▍ | 214/1497 [01:47<10:38, 2.01it/s]epoch: 0 loss: 0.1252152 f1: 0.0000000: 14%|█▍ | 215/1497 [01:47<10:35, 2.02it/s]epoch: 0 loss: 0.1599118 f1: 0.0000000: 14%|█▍ | 215/1497 [01:48<10:35, 2.02it/s]epoch: 0 loss: 0.1599118 f1: 0.0000000: 14%|█▍ | 216/1497 [01:48<10:47, 1.98it/s]epoch: 0 loss: 0.0549209 f1: 0.0000000: 14%|█▍ | 216/1497 [01:49<10:47, 1.98it/s]epoch: 0 loss: 0.0549209 f1: 0.0000000: 14%|█▍ | 217/1497 [01:49<10:52, 1.96it/s]epoch: 0 loss: 0.1409313 f1: 0.0000000: 14%|█▍ | 217/1497 [01:49<10:52, 1.96it/s]epoch: 0 loss: 0.1409313 f1: 0.0000000: 15%|█▍ | 218/1497 [01:49<10:46, 1.98it/s]epoch: 0 loss: 0.0721054 f1: 0.0000000: 15%|█▍ | 218/1497 [01:50<10:46, 1.98it/s]epoch: 0 loss: 0.0721054 f1: 0.0000000: 15%|█▍ | 219/1497 [01:50<10:39, 2.00it/s]epoch: 0 loss: 0.1175719 f1: 0.0000000: 15%|█▍ | 219/1497 [01:50<10:39, 2.00it/s]epoch: 0 loss: 0.1175719 f1: 0.0000000: 15%|█▍ | 220/1497 [01:50<10:37, 2.00it/s]epoch: 0 loss: 0.0565142 f1: 0.0000000: 15%|█▍ | 220/1497 [01:51<10:37, 2.00it/s]epoch: 0 loss: 0.0565142 f1: 0.0000000: 15%|█▍ | 221/1497 [01:51<10:32, 2.02it/s]epoch: 0 loss: 0.0696474 f1: 0.0000000: 15%|█▍ | 221/1497 [01:51<10:32, 2.02it/s]epoch: 0 loss: 0.0696474 f1: 0.0000000: 15%|█▍ | 222/1497 [01:51<10:26, 2.04it/s]epoch: 0 loss: 0.2201471 f1: 0.0000000: 15%|█▍ | 222/1497 [01:51<10:26, 2.04it/s]epoch: 0 loss: 0.2201471 f1: 0.0000000: 15%|█▍ | 223/1497 [01:51<10:28, 2.03it/s]epoch: 0 loss: 0.0548607 f1: 0.0000000: 15%|█▍ | 223/1497 [01:52<10:28, 2.03it/s]epoch: 0 loss: 0.0548607 f1: 0.0000000: 15%|█▍ | 224/1497 [01:52<10:28, 2.03it/s]epoch: 0 loss: 0.2170752 f1: 0.0000000: 15%|█▍ | 224/1497 [01:52<10:28, 2.03it/s]epoch: 0 loss: 0.2170752 f1: 0.0000000: 15%|█▌ | 225/1497 [01:52<10:23, 2.04it/s]epoch: 0 loss: 0.2085129 f1: 0.0000000: 15%|█▌ | 225/1497 [01:53<10:23, 2.04it/s]epoch: 0 loss: 0.2085129 f1: 0.0000000: 15%|█▌ | 226/1497 [01:53<10:23, 2.04it/s]epoch: 0 loss: 0.0416001 f1: 0.0000000: 15%|█▌ | 226/1497 [01:53<10:23, 2.04it/s]epoch: 0 loss: 0.0416001 f1: 0.0000000: 15%|█▌ | 227/1497 [01:53<10:21, 2.04it/s]epoch: 0 loss: 0.1003876 f1: 0.0000000: 15%|█▌ | 227/1497 [01:54<10:21, 2.04it/s]epoch: 0 loss: 0.1003876 f1: 0.0000000: 15%|█▌ | 228/1497 [01:54<10:18, 2.05it/s]epoch: 0 loss: 0.0730709 f1: 0.0000000: 15%|█▌ | 228/1497 [01:54<10:18, 2.05it/s]epoch: 0 loss: 0.0730709 f1: 0.0000000: 15%|█▌ | 229/1497 [01:54<10:18, 2.05it/s]epoch: 0 loss: 0.0584531 f1: 0.0000000: 15%|█▌ | 229/1497 [01:55<10:18, 2.05it/s]epoch: 0 loss: 0.0584531 f1: 0.0000000: 15%|█▌ | 230/1497 [01:55<10:20, 2.04it/s]epoch: 0 loss: 0.0486321 f1: 0.0000000: 15%|█▌ | 230/1497 [01:55<10:20, 2.04it/s]epoch: 0 loss: 0.0486321 f1: 0.0000000: 15%|█▌ | 231/1497 [01:55<10:24, 2.03it/s]epoch: 0 loss: 0.1134908 f1: 0.0000000: 15%|█▌ | 231/1497 [01:56<10:24, 2.03it/s]epoch: 0 loss: 0.1134908 f1: 0.0000000: 15%|█▌ | 232/1497 [01:56<10:24, 2.03it/s]epoch: 0 loss: 0.1040146 f1: 0.0000000: 15%|█▌ | 232/1497 [01:56<10:24, 2.03it/s]epoch: 0 loss: 0.1040146 f1: 0.0000000: 16%|█▌ | 233/1497 [01:56<10:22, 2.03it/s]epoch: 0 loss: 0.1879240 f1: 0.0000000: 16%|█▌ | 233/1497 [01:57<10:22, 2.03it/s]epoch: 0 loss: 0.1879240 f1: 0.0000000: 16%|█▌ | 234/1497 [01:57<10:28, 2.01it/s]epoch: 0 loss: 0.0945829 f1: 0.0000000: 16%|█▌ | 234/1497 [01:57<10:28, 2.01it/s]epoch: 0 loss: 0.0945829 f1: 0.0000000: 16%|█▌ | 235/1497 [01:57<10:29, 2.00it/s]epoch: 0 loss: 0.1194879 f1: 0.0000000: 16%|█▌ | 235/1497 [01:58<10:29, 2.00it/s]epoch: 0 loss: 0.1194879 f1: 0.0000000: 16%|█▌ | 236/1497 [01:58<10:34, 1.99it/s]epoch: 0 loss: 0.0853996 f1: 0.0000000: 16%|█▌ | 236/1497 [01:58<10:34, 1.99it/s]epoch: 0 loss: 0.0853996 f1: 0.0000000: 16%|█▌ | 237/1497 [01:58<10:34, 1.98it/s]epoch: 0 loss: 0.0750955 f1: 0.0000000: 16%|█▌ | 237/1497 [01:59<10:34, 1.98it/s]epoch: 0 loss: 0.0750955 f1: 0.0000000: 16%|█▌ | 238/1497 [01:59<10:37, 1.97it/s]epoch: 0 loss: 0.0569779 f1: 0.0000000: 16%|█▌ | 238/1497 [01:59<10:37, 1.97it/s]epoch: 0 loss: 0.0569779 f1: 0.0000000: 16%|█▌ | 239/1497 [01:59<10:31, 1.99it/s]epoch: 0 loss: 0.2090297 f1: 0.0000000: 16%|█▌ | 239/1497 [02:00<10:31, 1.99it/s]epoch: 0 loss: 0.2090297 f1: 0.0000000: 16%|█▌ | 240/1497 [02:00<10:27, 2.00it/s]epoch: 0 loss: 0.0942163 f1: 0.0000000: 16%|█▌ | 240/1497 [02:00<10:27, 2.00it/s]epoch: 0 loss: 0.0942163 f1: 0.0000000: 16%|█▌ | 241/1497 [02:00<10:23, 2.02it/s]epoch: 0 loss: 0.1576418 f1: 0.0000000: 16%|█▌ | 241/1497 [02:01<10:23, 2.02it/s]epoch: 0 loss: 0.1576418 f1: 0.0000000: 16%|█▌ | 242/1497 [02:01<10:18, 2.03it/s]epoch: 0 loss: 0.1052388 f1: 0.0000000: 16%|█▌ | 242/1497 [02:01<10:18, 2.03it/s]epoch: 0 loss: 0.1052388 f1: 0.0000000: 16%|█▌ | 243/1497 [02:01<10:17, 2.03it/s]epoch: 0 loss: 0.0752539 f1: 0.0000000: 16%|█▌ | 243/1497 [02:02<10:17, 2.03it/s]epoch: 0 loss: 0.0752539 f1: 0.0000000: 16%|█▋ | 244/1497 [02:02<10:15, 2.04it/s]epoch: 0 loss: 0.1940305 f1: 0.0000000: 16%|█▋ | 244/1497 [02:02<10:15, 2.04it/s]epoch: 0 loss: 0.1940305 f1: 0.0000000: 16%|█▋ | 245/1497 [02:02<10:17, 2.03it/s]epoch: 0 loss: 0.2200802 f1: 0.0000000: 16%|█▋ | 245/1497 [02:03<10:17, 2.03it/s]epoch: 0 loss: 0.2200802 f1: 0.0000000: 16%|█▋ | 246/1497 [02:03<10:16, 2.03it/s]epoch: 0 loss: 0.1404418 f1: 0.0000000: 16%|█▋ | 246/1497 [02:03<10:16, 2.03it/s]epoch: 0 loss: 0.1404418 f1: 0.0000000: 16%|█▋ | 247/1497 [02:03<10:20, 2.02it/s]epoch: 0 loss: 0.1704356 f1: 0.0000000: 16%|█▋ | 247/1497 [02:04<10:20, 2.02it/s]epoch: 0 loss: 0.1704356 f1: 0.0000000: 17%|█▋ | 248/1497 [02:04<10:17, 2.02it/s]epoch: 0 loss: 0.0841762 f1: 0.0000000: 17%|█▋ | 248/1497 [02:04<10:17, 2.02it/s]epoch: 0 loss: 0.0841762 f1: 0.0000000: 17%|█▋ | 249/1497 [02:04<10:15, 2.03it/s]epoch: 0 loss: 0.1185966 f1: 0.0000000: 17%|█▋ | 249/1497 [02:05<10:15, 2.03it/s]epoch: 0 loss: 0.1185966 f1: 0.0000000: 17%|█▋ | 250/1497 [02:05<10:11, 2.04it/s]epoch: 0 loss: 0.2026729 f1: 0.0000000: 17%|█▋ | 250/1497 [02:05<10:11, 2.04it/s]epoch: 0 loss: 0.2026729 f1: 0.0000000: 17%|█▋ | 251/1497 [02:05<10:10, 2.04it/s]epoch: 0 loss: 0.1962802 f1: 0.0000000: 17%|█▋ | 251/1497 [02:06<10:10, 2.04it/s]epoch: 0 loss: 0.1962802 f1: 0.0000000: 17%|█▋ | 252/1497 [02:06<10:12, 2.03it/s]epoch: 0 loss: 0.0895673 f1: 0.0000000: 17%|█▋ | 252/1497 [02:06<10:12, 2.03it/s]epoch: 0 loss: 0.0895673 f1: 0.0000000: 17%|█▋ | 253/1497 [02:06<10:08, 2.04it/s]epoch: 0 loss: 0.1108808 f1: 0.0000000: 17%|█▋ | 253/1497 [02:07<10:08, 2.04it/s]epoch: 0 loss: 0.1108808 f1: 0.0000000: 17%|█▋ | 254/1497 [02:07<10:07, 2.04it/s]epoch: 0 loss: 0.0954705 f1: 0.0000000: 17%|█▋ | 254/1497 [02:07<10:07, 2.04it/s]epoch: 0 loss: 0.0954705 f1: 0.0000000: 17%|█▋ | 255/1497 [02:07<10:07, 2.05it/s]epoch: 0 loss: 0.1149973 f1: 0.0000000: 17%|█▋ | 255/1497 [02:08<10:07, 2.05it/s]epoch: 0 loss: 0.1149973 f1: 0.0000000: 17%|█▋ | 256/1497 [02:08<10:05, 2.05it/s]epoch: 0 loss: 0.2082841 f1: 0.0000000: 17%|█▋ | 256/1497 [02:08<10:05, 2.05it/s]epoch: 0 loss: 0.2082841 f1: 0.0000000: 17%|█▋ | 257/1497 [02:08<10:08, 2.04it/s]epoch: 0 loss: 0.0541611 f1: 0.0000000: 17%|█▋ | 257/1497 [02:09<10:08, 2.04it/s]epoch: 0 loss: 0.0541611 f1: 0.0000000: 17%|█▋ | 258/1497 [02:09<10:16, 2.01it/s]epoch: 0 loss: 0.0729708 f1: 0.0000000: 17%|█▋ | 258/1497 [02:09<10:16, 2.01it/s]epoch: 0 loss: 0.0729708 f1: 0.0000000: 17%|█▋ | 259/1497 [02:09<10:18, 2.00it/s]epoch: 0 loss: 0.1995153 f1: 0.0000000: 17%|█▋ | 259/1497 [02:10<10:18, 2.00it/s]epoch: 0 loss: 0.1995153 f1: 0.0000000: 17%|█▋ | 260/1497 [02:10<10:13, 2.02it/s]epoch: 0 loss: 0.1734572 f1: 0.0000000: 17%|█▋ | 260/1497 [02:10<10:13, 2.02it/s]epoch: 0 loss: 0.1734572 f1: 0.0000000: 17%|█▋ | 261/1497 [02:10<10:12, 2.02it/s]epoch: 0 loss: 0.0376809 f1: 0.0000000: 17%|█▋ | 261/1497 [02:11<10:12, 2.02it/s]epoch: 0 loss: 0.0376809 f1: 0.0000000: 18%|█▊ | 262/1497 [02:11<10:09, 2.03it/s]epoch: 0 loss: 0.0429080 f1: 0.0000000: 18%|█▊ | 262/1497 [02:11<10:09, 2.03it/s]epoch: 0 loss: 0.0429080 f1: 0.0000000: 18%|█▊ | 263/1497 [02:11<10:06, 2.03it/s]epoch: 0 loss: 0.1626132 f1: 0.0000000: 18%|█▊ | 263/1497 [02:12<10:06, 2.03it/s]epoch: 0 loss: 0.1626132 f1: 0.0000000: 18%|█▊ | 264/1497 [02:12<10:05, 2.04it/s]epoch: 0 loss: 0.1602821 f1: 0.0000000: 18%|█▊ | 264/1497 [02:12<10:05, 2.04it/s]epoch: 0 loss: 0.1602821 f1: 0.0000000: 18%|█▊ | 265/1497 [02:12<10:06, 2.03it/s]epoch: 0 loss: 0.1753288 f1: 0.0000000: 18%|█▊ | 265/1497 [02:13<10:06, 2.03it/s]epoch: 0 loss: 0.1753288 f1: 0.0000000: 18%|█▊ | 266/1497 [02:13<10:04, 2.04it/s]epoch: 0 loss: 0.1196983 f1: 0.0000000: 18%|█▊ | 266/1497 [02:13<10:04, 2.04it/s]epoch: 0 loss: 0.1196983 f1: 0.0000000: 18%|█▊ | 267/1497 [02:13<10:07, 2.03it/s]epoch: 0 loss: 0.0528425 f1: 0.0000000: 18%|█▊ | 267/1497 [02:14<10:07, 2.03it/s]epoch: 0 loss: 0.0528425 f1: 0.0000000: 18%|█▊ | 268/1497 [02:14<10:08, 2.02it/s]epoch: 0 loss: 0.2412401 f1: 0.0000000: 18%|█▊ | 268/1497 [02:14<10:08, 2.02it/s]epoch: 0 loss: 0.2412401 f1: 0.0000000: 18%|█▊ | 269/1497 [02:14<10:11, 2.01it/s]epoch: 0 loss: 0.0472735 f1: 0.0000000: 18%|█▊ | 269/1497 [02:15<10:11, 2.01it/s]epoch: 0 loss: 0.0472735 f1: 0.0000000: 18%|█▊ | 270/1497 [02:15<10:10, 2.01it/s]epoch: 0 loss: 0.0520830 f1: 0.0000000: 18%|█▊ | 270/1497 [02:15<10:10, 2.01it/s]epoch: 0 loss: 0.0520830 f1: 0.0000000: 18%|█▊ | 271/1497 [02:15<10:11, 2.00it/s]epoch: 0 loss: 0.0912910 f1: 0.0000000: 18%|█▊ | 271/1497 [02:16<10:11, 2.00it/s]epoch: 0 loss: 0.0912910 f1: 0.0000000: 18%|█▊ | 272/1497 [02:16<10:13, 2.00it/s]epoch: 0 loss: 0.1377559 f1: 0.0000000: 18%|█▊ | 272/1497 [02:16<10:13, 2.00it/s]epoch: 0 loss: 0.1377559 f1: 0.0000000: 18%|█▊ | 273/1497 [02:16<10:11, 2.00it/s]epoch: 0 loss: 0.2131233 f1: 0.0000000: 18%|█▊ | 273/1497 [02:17<10:11, 2.00it/s]epoch: 0 loss: 0.2131233 f1: 0.0000000: 18%|█▊ | 274/1497 [02:17<10:12, 2.00it/s]epoch: 0 loss: 0.0765667 f1: 0.0000000: 18%|█▊ | 274/1497 [02:17<10:12, 2.00it/s]epoch: 0 loss: 0.0765667 f1: 0.0000000: 18%|█▊ | 275/1497 [02:17<10:10, 2.00it/s]epoch: 0 loss: 0.1350123 f1: 0.0000000: 18%|█▊ | 275/1497 [02:18<10:10, 2.00it/s]epoch: 0 loss: 0.1350123 f1: 0.0000000: 18%|█▊ | 276/1497 [02:18<10:10, 2.00it/s]epoch: 0 loss: 0.1438650 f1: 0.0000000: 18%|█▊ | 276/1497 [02:18<10:10, 2.00it/s]epoch: 0 loss: 0.1438650 f1: 0.0000000: 19%|█▊ | 277/1497 [02:18<10:08, 2.01it/s]epoch: 0 loss: 0.0946715 f1: 0.0000000: 19%|█▊ | 277/1497 [02:19<10:08, 2.01it/s]epoch: 0 loss: 0.0946715 f1: 0.0000000: 19%|█▊ | 278/1497 [02:19<10:06, 2.01it/s]epoch: 0 loss: 0.1213256 f1: 0.0000000: 19%|█▊ | 278/1497 [02:19<10:06, 2.01it/s]epoch: 0 loss: 0.1213256 f1: 0.0000000: 19%|█▊ | 279/1497 [02:19<10:06, 2.01it/s]epoch: 0 loss: 0.0649065 f1: 0.0000000: 19%|█▊ | 279/1497 [02:20<10:06, 2.01it/s]epoch: 0 loss: 0.0649065 f1: 0.0000000: 19%|█▊ | 280/1497 [02:20<10:05, 2.01it/s]epoch: 0 loss: 0.1360157 f1: 0.0000000: 19%|█▊ | 280/1497 [02:20<10:05, 2.01it/s]epoch: 0 loss: 0.1360157 f1: 0.0000000: 19%|█▉ | 281/1497 [02:20<10:06, 2.00it/s]epoch: 0 loss: 0.0682386 f1: 0.0000000: 19%|█▉ | 281/1497 [02:21<10:06, 2.00it/s]epoch: 0 loss: 0.0682386 f1: 0.0000000: 19%|█▉ | 282/1497 [02:21<10:05, 2.01it/s]epoch: 0 loss: 0.1298203 f1: 0.0000000: 19%|█▉ | 282/1497 [02:21<10:05, 2.01it/s]epoch: 0 loss: 0.1298203 f1: 0.0000000: 19%|█▉ | 283/1497 [02:21<10:05, 2.01it/s]epoch: 0 loss: 0.3159628 f1: 0.0000000: 19%|█▉ | 283/1497 [02:22<10:05, 2.01it/s]epoch: 0 loss: 0.3159628 f1: 0.0000000: 19%|█▉ | 284/1497 [02:22<10:05, 2.00it/s]epoch: 0 loss: 0.1660788 f1: 0.0000000: 19%|█▉ | 284/1497 [02:22<10:05, 2.00it/s]epoch: 0 loss: 0.1660788 f1: 0.0000000: 19%|█▉ | 285/1497 [02:22<10:08, 1.99it/s]epoch: 0 loss: 0.1378157 f1: 0.0000000: 19%|█▉ | 285/1497 [02:23<10:08, 1.99it/s]epoch: 0 loss: 0.1378157 f1: 0.0000000: 19%|█▉ | 286/1497 [02:23<10:08, 1.99it/s]epoch: 0 loss: 0.0618449 f1: 0.0000000: 19%|█▉ | 286/1497 [02:23<10:08, 1.99it/s]epoch: 0 loss: 0.0618449 f1: 0.0000000: 19%|█▉ | 287/1497 [02:23<10:07, 1.99it/s]epoch: 0 loss: 0.1137175 f1: 0.0000000: 19%|█▉ | 287/1497 [02:24<10:07, 1.99it/s]epoch: 0 loss: 0.1137175 f1: 0.0000000: 19%|█▉ | 288/1497 [02:24<10:08, 1.99it/s]epoch: 0 loss: 0.1560687 f1: 0.0000000: 19%|█▉ | 288/1497 [02:24<10:08, 1.99it/s]epoch: 0 loss: 0.1560687 f1: 0.0000000: 19%|█▉ | 289/1497 [02:24<10:10, 1.98it/s]epoch: 0 loss: 0.0661101 f1: 0.0000000: 19%|█▉ | 289/1497 [02:25<10:10, 1.98it/s]epoch: 0 loss: 0.0661101 f1: 0.0000000: 19%|█▉ | 290/1497 [02:25<10:08, 1.98it/s]epoch: 0 loss: 0.2135462 f1: 0.0000000: 19%|█▉ | 290/1497 [02:25<10:08, 1.98it/s]epoch: 0 loss: 0.2135462 f1: 0.0000000: 19%|█▉ | 291/1497 [02:25<10:06, 1.99it/s]epoch: 0 loss: 0.0491861 f1: 0.0000000: 19%|█▉ | 291/1497 [02:26<10:06, 1.99it/s]epoch: 0 loss: 0.0491861 f1: 0.0000000: 20%|█▉ | 292/1497 [02:26<10:04, 1.99it/s]epoch: 0 loss: 0.0799251 f1: 0.0000000: 20%|█▉ | 292/1497 [02:26<10:04, 1.99it/s]epoch: 0 loss: 0.0799251 f1: 0.0000000: 20%|█▉ | 293/1497 [02:26<10:03, 2.00it/s]epoch: 0 loss: 0.0419074 f1: 0.0000000: 20%|█▉ | 293/1497 [02:27<10:03, 2.00it/s]epoch: 0 loss: 0.0419074 f1: 0.0000000: 20%|█▉ | 294/1497 [02:27<10:03, 1.99it/s]epoch: 0 loss: 0.1104445 f1: 0.0000000: 20%|█▉ | 294/1497 [02:27<10:03, 1.99it/s]epoch: 0 loss: 0.1104445 f1: 0.0000000: 20%|█▉ | 295/1497 [02:27<10:04, 1.99it/s]epoch: 0 loss: 0.0966027 f1: 0.0000000: 20%|█▉ | 295/1497 [02:28<10:04, 1.99it/s]epoch: 0 loss: 0.0966027 f1: 0.0000000: 20%|█▉ | 296/1497 [02:28<10:02, 1.99it/s]epoch: 0 loss: 0.2971023 f1: 0.0000000: 20%|█▉ | 296/1497 [02:28<10:02, 1.99it/s]epoch: 0 loss: 0.2971023 f1: 0.0000000: 20%|█▉ | 297/1497 [02:28<10:01, 1.99it/s]epoch: 0 loss: 0.1047817 f1: 0.0000000: 20%|█▉ | 297/1497 [02:29<10:01, 1.99it/s]epoch: 0 loss: 0.1047817 f1: 0.0000000: 20%|█▉ | 298/1497 [02:29<10:02, 1.99it/s]epoch: 0 loss: 0.0629014 f1: 0.0000000: 20%|█▉ | 298/1497 [02:29<10:02, 1.99it/s]epoch: 0 loss: 0.0629014 f1: 0.0000000: 20%|█▉ | 299/1497 [02:29<10:03, 1.98it/s]epoch: 0 loss: 0.0780111 f1: 0.0000000: 20%|█▉ | 299/1497 [02:30<10:03, 1.98it/s]epoch: 0 loss: 0.0780111 f1: 0.0000000: 20%|██ | 300/1497 [02:30<10:15, 1.94it/s]epoch: 0 loss: 0.1231631 f1: 0.0000000: 20%|██ | 300/1497 [02:30<10:15, 1.94it/s]epoch: 0 loss: 0.1231631 f1: 0.0000000: 20%|██ | 301/1497 [02:30<10:13, 1.95it/s]epoch: 0 loss: 0.0560187 f1: 0.0000000: 20%|██ | 301/1497 [02:31<10:13, 1.95it/s]epoch: 0 loss: 0.0560187 f1: 0.0000000: 20%|██ | 302/1497 [02:31<10:08, 1.96it/s]epoch: 0 loss: 0.4510982 f1: 0.0000000: 20%|██ | 302/1497 [02:31<10:08, 1.96it/s]epoch: 0 loss: 0.4510982 f1: 0.0000000: 20%|██ | 303/1497 [02:31<10:07, 1.97it/s]epoch: 0 loss: 0.0729192 f1: 0.0000000: 20%|██ | 303/1497 [02:32<10:07, 1.97it/s]epoch: 0 loss: 0.0729192 f1: 0.0000000: 20%|██ | 304/1497 [02:32<10:04, 1.97it/s]epoch: 0 loss: 0.1942685 f1: 0.0000000: 20%|██ | 304/1497 [02:32<10:04, 1.97it/s]epoch: 0 loss: 0.1942685 f1: 0.0000000: 20%|██ | 305/1497 [02:32<10:03, 1.98it/s]epoch: 0 loss: 0.1344225 f1: 0.0000000: 20%|██ | 305/1497 [02:33<10:03, 1.98it/s]epoch: 0 loss: 0.1344225 f1: 0.0000000: 20%|██ | 306/1497 [02:33<10:01, 1.98it/s]epoch: 0 loss: 0.0588397 f1: 0.0000000: 20%|██ | 306/1497 [02:33<10:01, 1.98it/s]epoch: 0 loss: 0.0588397 f1: 0.0000000: 21%|██ | 307/1497 [02:33<10:01, 1.98it/s]epoch: 0 loss: 0.0502447 f1: 0.0000000: 21%|██ | 307/1497 [02:34<10:01, 1.98it/s]epoch: 0 loss: 0.0502447 f1: 0.0000000: 21%|██ | 308/1497 [02:34<09:58, 1.99it/s]epoch: 0 loss: 0.1701620 f1: 0.0000000: 21%|██ | 308/1497 [02:34<09:58, 1.99it/s]epoch: 0 loss: 0.1701620 f1: 0.0000000: 21%|██ | 309/1497 [02:34<09:58, 1.99it/s]epoch: 0 loss: 0.0543061 f1: 0.0000000: 21%|██ | 309/1497 [02:35<09:58, 1.99it/s]epoch: 0 loss: 0.0543061 f1: 0.0000000: 21%|██ | 310/1497 [02:35<09:57, 1.99it/s]epoch: 0 loss: 0.1427117 f1: 0.0000000: 21%|██ | 310/1497 [02:35<09:57, 1.99it/s]epoch: 0 loss: 0.1427117 f1: 0.0000000: 21%|██ | 311/1497 [02:35<09:54, 1.99it/s]epoch: 0 loss: 0.0555226 f1: 0.0000000: 21%|██ | 311/1497 [02:36<09:54, 1.99it/s]epoch: 0 loss: 0.0555226 f1: 0.0000000: 21%|██ | 312/1497 [02:36<09:56, 1.99it/s]epoch: 0 loss: 0.0584191 f1: 0.0000000: 21%|██ | 312/1497 [02:36<09:56, 1.99it/s]epoch: 0 loss: 0.0584191 f1: 0.0000000: 21%|██ | 313/1497 [02:36<09:56, 1.99it/s]epoch: 0 loss: 0.0482981 f1: 0.0000000: 21%|██ | 313/1497 [02:37<09:56, 1.99it/s]epoch: 0 loss: 0.0482981 f1: 0.0000000: 21%|██ | 314/1497 [02:37<09:56, 1.98it/s]epoch: 0 loss: 0.2176925 f1: 0.0000000: 21%|██ | 314/1497 [02:37<09:56, 1.98it/s]epoch: 0 loss: 0.2176925 f1: 0.0000000: 21%|██ | 315/1497 [02:37<09:56, 1.98it/s]epoch: 0 loss: 0.0962287 f1: 0.0000000: 21%|██ | 315/1497 [02:38<09:56, 1.98it/s]epoch: 0 loss: 0.0962287 f1: 0.0000000: 21%|██ | 316/1497 [02:38<09:56, 1.98it/s]epoch: 0 loss: 0.1175959 f1: 0.0000000: 21%|██ | 316/1497 [02:38<09:56, 1.98it/s]epoch: 0 loss: 0.1175959 f1: 0.0000000: 21%|██ | 317/1497 [02:38<09:57, 1.98it/s]epoch: 0 loss: 0.0849646 f1: 0.0000000: 21%|██ | 317/1497 [02:39<09:57, 1.98it/s]epoch: 0 loss: 0.0849646 f1: 0.0000000: 21%|██ | 318/1497 [02:39<09:57, 1.97it/s]epoch: 0 loss: 0.0429056 f1: 0.0000000: 21%|██ | 318/1497 [02:39<09:57, 1.97it/s]epoch: 0 loss: 0.0429056 f1: 0.0000000: 21%|██▏ | 319/1497 [02:39<09:57, 1.97it/s]epoch: 0 loss: 0.1154575 f1: 0.0000000: 21%|██▏ | 319/1497 [02:40<09:57, 1.97it/s]epoch: 0 loss: 0.1154575 f1: 0.0000000: 21%|██▏ | 320/1497 [02:40<09:56, 1.97it/s]epoch: 0 loss: 0.1422982 f1: 0.0000000: 21%|██▏ | 320/1497 [02:40<09:56, 1.97it/s]epoch: 0 loss: 0.1422982 f1: 0.0000000: 21%|██▏ | 321/1497 [02:40<09:55, 1.97it/s]epoch: 0 loss: 0.0587731 f1: 0.0000000: 21%|██▏ | 321/1497 [02:41<09:55, 1.97it/s]epoch: 0 loss: 0.0587731 f1: 0.0000000: 22%|██▏ | 322/1497 [02:41<09:52, 1.98it/s]epoch: 0 loss: 0.0494086 f1: 0.0000000: 22%|██▏ | 322/1497 [02:41<09:52, 1.98it/s]epoch: 0 loss: 0.0494086 f1: 0.0000000: 22%|██▏ | 323/1497 [02:41<09:50, 1.99it/s]epoch: 0 loss: 0.1400523 f1: 0.0000000: 22%|██▏ | 323/1497 [02:42<09:50, 1.99it/s]epoch: 0 loss: 0.1400523 f1: 0.0000000: 22%|██▏ | 324/1497 [02:42<09:53, 1.98it/s]epoch: 0 loss: 0.1198044 f1: 0.0000000: 22%|██▏ | 324/1497 [02:42<09:53, 1.98it/s]epoch: 0 loss: 0.1198044 f1: 0.0000000: 22%|██▏ | 325/1497 [02:42<09:54, 1.97it/s]epoch: 0 loss: 0.0522992 f1: 0.0000000: 22%|██▏ | 325/1497 [02:43<09:54, 1.97it/s]epoch: 0 loss: 0.0522992 f1: 0.0000000: 22%|██▏ | 326/1497 [02:43<09:50, 1.98it/s]epoch: 0 loss: 0.1569546 f1: 0.0000000: 22%|██▏ | 326/1497 [02:43<09:50, 1.98it/s]epoch: 0 loss: 0.1569546 f1: 0.0000000: 22%|██▏ | 327/1497 [02:43<09:48, 1.99it/s]epoch: 0 loss: 0.1190811 f1: 0.0000000: 22%|██▏ | 327/1497 [02:44<09:48, 1.99it/s]epoch: 0 loss: 0.1190811 f1: 0.0000000: 22%|██▏ | 328/1497 [02:44<09:48, 1.99it/s]epoch: 0 loss: 0.1808932 f1: 0.0000000: 22%|██▏ | 328/1497 [02:44<09:48, 1.99it/s]epoch: 0 loss: 0.1808932 f1: 0.0000000: 22%|██▏ | 329/1497 [02:44<09:49, 1.98it/s]epoch: 0 loss: 0.0948905 f1: 0.0000000: 22%|██▏ | 329/1497 [02:45<09:49, 1.98it/s]epoch: 0 loss: 0.0948905 f1: 0.0000000: 22%|██▏ | 330/1497 [02:45<09:50, 1.98it/s]epoch: 0 loss: 0.1359536 f1: 0.0000000: 22%|██▏ | 330/1497 [02:45<09:50, 1.98it/s]epoch: 0 loss: 0.1359536 f1: 0.0000000: 22%|██▏ | 331/1497 [02:45<09:47, 1.98it/s]epoch: 0 loss: 0.0756905 f1: 0.0000000: 22%|██▏ | 331/1497 [02:46<09:47, 1.98it/s]epoch: 0 loss: 0.0756905 f1: 0.0000000: 22%|██▏ | 332/1497 [02:46<09:49, 1.98it/s]epoch: 0 loss: 0.1284903 f1: 0.0000000: 22%|██▏ | 332/1497 [02:46<09:49, 1.98it/s]epoch: 0 loss: 0.1284903 f1: 0.0000000: 22%|██▏ | 333/1497 [02:46<09:49, 1.97it/s]epoch: 0 loss: 0.0369404 f1: 0.0000000: 22%|██▏ | 333/1497 [02:47<09:49, 1.97it/s]epoch: 0 loss: 0.0369404 f1: 0.0000000: 22%|██▏ | 334/1497 [02:47<09:48, 1.97it/s]epoch: 0 loss: 0.0793558 f1: 0.0000000: 22%|██▏ | 334/1497 [02:47<09:48, 1.97it/s]epoch: 0 loss: 0.0793558 f1: 0.0000000: 22%|██▏ | 335/1497 [02:47<09:55, 1.95it/s]epoch: 0 loss: 0.0482245 f1: 0.0000000: 22%|██▏ | 335/1497 [02:48<09:55, 1.95it/s]epoch: 0 loss: 0.0482245 f1: 0.0000000: 22%|██▏ | 336/1497 [02:48<09:52, 1.96it/s]epoch: 0 loss: 0.1638341 f1: 0.0000000: 22%|██▏ | 336/1497 [02:48<09:52, 1.96it/s]epoch: 0 loss: 0.1638341 f1: 0.0000000: 23%|██▎ | 337/1497 [02:48<09:51, 1.96it/s]epoch: 0 loss: 0.1964559 f1: 0.0000000: 23%|██▎ | 337/1497 [02:49<09:51, 1.96it/s]epoch: 0 loss: 0.1964559 f1: 0.0000000: 23%|██▎ | 338/1497 [02:49<09:53, 1.95it/s]epoch: 0 loss: 0.0636460 f1: 0.0000000: 23%|██▎ | 338/1497 [02:50<09:53, 1.95it/s]epoch: 0 loss: 0.0636460 f1: 0.0000000: 23%|██▎ | 339/1497 [02:50<09:52, 1.95it/s]epoch: 0 loss: 0.0425003 f1: 0.0000000: 23%|██▎ | 339/1497 [02:50<09:52, 1.95it/s]epoch: 0 loss: 0.0425003 f1: 0.0000000: 23%|██▎ | 340/1497 [02:50<09:58, 1.93it/s]epoch: 0 loss: 0.0811990 f1: 0.0000000: 23%|██▎ | 340/1497 [02:51<09:58, 1.93it/s]epoch: 0 loss: 0.0811990 f1: 0.0000000: 23%|██▎ | 341/1497 [02:51<10:05, 1.91it/s]epoch: 0 loss: 0.0786221 f1: 0.0000000: 23%|██▎ | 341/1497 [02:51<10:05, 1.91it/s]epoch: 0 loss: 0.0786221 f1: 0.0000000: 23%|██▎ | 342/1497 [02:51<09:59, 1.93it/s]epoch: 0 loss: 0.1523899 f1: 0.0000000: 23%|██▎ | 342/1497 [02:52<09:59, 1.93it/s]epoch: 0 loss: 0.1523899 f1: 0.0000000: 23%|██▎ | 343/1497 [02:52<09:54, 1.94it/s]epoch: 0 loss: 0.1222536 f1: 0.0000000: 23%|██▎ | 343/1497 [02:52<09:54, 1.94it/s]epoch: 0 loss: 0.1222536 f1: 0.0000000: 23%|██▎ | 344/1497 [02:52<09:51, 1.95it/s]epoch: 0 loss: 0.1774729 f1: 0.0000000: 23%|██▎ | 344/1497 [02:53<09:51, 1.95it/s]epoch: 0 loss: 0.1774729 f1: 0.0000000: 23%|██▎ | 345/1497 [02:53<09:56, 1.93it/s]epoch: 0 loss: 0.0396418 f1: 0.0000000: 23%|██▎ | 345/1497 [02:53<09:56, 1.93it/s]epoch: 0 loss: 0.0396418 f1: 0.0000000: 23%|██▎ | 346/1497 [02:53<09:51, 1.95it/s]epoch: 0 loss: 0.0683564 f1: 0.0000000: 23%|██▎ | 346/1497 [02:54<09:51, 1.95it/s]epoch: 0 loss: 0.0683564 f1: 0.0000000: 23%|██▎ | 347/1497 [02:54<09:46, 1.96it/s]epoch: 0 loss: 0.0250911 f1: 0.0000000: 23%|██▎ | 347/1497 [02:54<09:46, 1.96it/s]epoch: 0 loss: 0.0250911 f1: 0.0000000: 23%|██▎ | 348/1497 [02:54<09:43, 1.97it/s]epoch: 0 loss: 0.0348987 f1: 0.0000000: 23%|██▎ | 348/1497 [02:55<09:43, 1.97it/s]epoch: 0 loss: 0.0348987 f1: 0.0000000: 23%|██▎ | 349/1497 [02:55<09:42, 1.97it/s]epoch: 0 loss: 0.1284224 f1: 0.0000000: 23%|██▎ | 349/1497 [02:55<09:42, 1.97it/s]epoch: 0 loss: 0.1284224 f1: 0.0000000: 23%|██▎ | 350/1497 [02:55<09:43, 1.97it/s]epoch: 0 loss: 0.2249579 f1: 0.0000000: 23%|██▎ | 350/1497 [02:56<09:43, 1.97it/s]epoch: 0 loss: 0.2249579 f1: 0.0000000: 23%|██▎ | 351/1497 [02:56<09:41, 1.97it/s]epoch: 0 loss: 0.0534442 f1: 0.0000000: 23%|██▎ | 351/1497 [02:56<09:41, 1.97it/s]epoch: 0 loss: 0.0534442 f1: 0.0000000: 24%|██▎ | 352/1497 [02:56<09:39, 1.97it/s]epoch: 0 loss: 0.0589874 f1: 0.0000000: 24%|██▎ | 352/1497 [02:57<09:39, 1.97it/s]epoch: 0 loss: 0.0589874 f1: 0.0000000: 24%|██▎ | 353/1497 [02:57<09:37, 1.98it/s]epoch: 0 loss: 0.1800921 f1: 0.0000000: 24%|██▎ | 353/1497 [02:57<09:37, 1.98it/s]epoch: 0 loss: 0.1800921 f1: 0.0000000: 24%|██▎ | 354/1497 [02:57<09:41, 1.97it/s]epoch: 0 loss: 0.0548950 f1: 0.0000000: 24%|██▎ | 354/1497 [02:58<09:41, 1.97it/s]epoch: 0 loss: 0.0548950 f1: 0.0000000: 24%|██▎ | 355/1497 [02:58<09:38, 1.97it/s]epoch: 0 loss: 0.0924238 f1: 0.0000000: 24%|██▎ | 355/1497 [02:58<09:38, 1.97it/s]epoch: 0 loss: 0.0924238 f1: 0.0000000: 24%|██▍ | 356/1497 [02:58<09:35, 1.98it/s]epoch: 0 loss: 0.0504232 f1: 0.0000000: 24%|██▍ | 356/1497 [02:59<09:35, 1.98it/s]epoch: 0 loss: 0.0504232 f1: 0.0000000: 24%|██▍ | 357/1497 [02:59<09:33, 1.99it/s]epoch: 0 loss: 0.0232028 f1: 0.0000000: 24%|██▍ | 357/1497 [02:59<09:33, 1.99it/s]epoch: 0 loss: 0.0232028 f1: 0.0000000: 24%|██▍ | 358/1497 [02:59<09:31, 1.99it/s]epoch: 0 loss: 0.0758156 f1: 0.0000000: 24%|██▍ | 358/1497 [03:00<09:31, 1.99it/s]epoch: 0 loss: 0.0758156 f1: 0.0000000: 24%|██▍ | 359/1497 [03:00<09:31, 1.99it/s]epoch: 0 loss: 0.1333962 f1: 0.0000000: 24%|██▍ | 359/1497 [03:00<09:31, 1.99it/s]epoch: 0 loss: 0.1333962 f1: 0.0000000: 24%|██▍ | 360/1497 [03:00<09:32, 1.99it/s]epoch: 0 loss: 0.0924908 f1: 0.0000000: 24%|██▍ | 360/1497 [03:01<09:32, 1.99it/s]epoch: 0 loss: 0.0924908 f1: 0.0000000: 24%|██▍ | 361/1497 [03:01<09:38, 1.96it/s]epoch: 0 loss: 0.0945389 f1: 0.0000000: 24%|██▍ | 361/1497 [03:01<09:38, 1.96it/s]epoch: 0 loss: 0.0945389 f1: 0.0000000: 24%|██▍ | 362/1497 [03:01<09:39, 1.96it/s]epoch: 0 loss: 0.1264846 f1: 0.0000000: 24%|██▍ | 362/1497 [03:02<09:39, 1.96it/s]epoch: 0 loss: 0.1264846 f1: 0.0000000: 24%|██▍ | 363/1497 [03:02<09:42, 1.95it/s]epoch: 0 loss: 0.0228805 f1: 0.0000000: 24%|██▍ | 363/1497 [03:02<09:42, 1.95it/s]epoch: 0 loss: 0.0228805 f1: 0.0000000: 24%|██▍ | 364/1497 [03:02<09:42, 1.94it/s]epoch: 0 loss: 0.2041564 f1: 0.0000000: 24%|██▍ | 364/1497 [03:03<09:42, 1.94it/s]epoch: 0 loss: 0.2041564 f1: 0.0000000: 24%|██▍ | 365/1497 [03:03<09:39, 1.95it/s]epoch: 0 loss: 0.1661517 f1: 0.0000000: 24%|██▍ | 365/1497 [03:03<09:39, 1.95it/s]epoch: 0 loss: 0.1661517 f1: 0.0000000: 24%|██▍ | 366/1497 [03:03<09:34, 1.97it/s]epoch: 0 loss: 0.0651597 f1: 0.0000000: 24%|██▍ | 366/1497 [03:04<09:34, 1.97it/s]epoch: 0 loss: 0.0651597 f1: 0.0000000: 25%|██▍ | 367/1497 [03:04<09:32, 1.98it/s]epoch: 0 loss: 0.0209745 f1: 0.0000000: 25%|██▍ | 367/1497 [03:04<09:32, 1.98it/s]epoch: 0 loss: 0.0209745 f1: 0.0000000: 25%|██▍ | 368/1497 [03:04<09:32, 1.97it/s]epoch: 0 loss: 0.0159901 f1: 0.0000000: 25%|██▍ | 368/1497 [03:05<09:32, 1.97it/s]epoch: 0 loss: 0.0159901 f1: 0.0000000: 25%|██▍ | 369/1497 [03:05<09:32, 1.97it/s]epoch: 0 loss: 0.1203888 f1: 0.0000000: 25%|██▍ | 369/1497 [03:05<09:32, 1.97it/s]epoch: 0 loss: 0.1203888 f1: 0.0000000: 25%|██▍ | 370/1497 [03:05<09:33, 1.96it/s]epoch: 0 loss: 0.0550803 f1: 0.0000000: 25%|██▍ | 370/1497 [03:06<09:33, 1.96it/s]epoch: 0 loss: 0.0550803 f1: 0.0000000: 25%|██▍ | 371/1497 [03:06<09:30, 1.97it/s]epoch: 0 loss: 0.0342024 f1: 0.0000000: 25%|██▍ | 371/1497 [03:06<09:30, 1.97it/s]epoch: 0 loss: 0.0342024 f1: 0.0000000: 25%|██▍ | 372/1497 [03:06<09:25, 1.99it/s]epoch: 0 loss: 0.0276786 f1: 0.0000000: 25%|██▍ | 372/1497 [03:07<09:25, 1.99it/s]epoch: 0 loss: 0.0276786 f1: 0.0000000: 25%|██▍ | 373/1497 [03:07<09:22, 2.00it/s]epoch: 0 loss: 0.0506784 f1: 0.0000000: 25%|██▍ | 373/1497 [03:07<09:22, 2.00it/s]epoch: 0 loss: 0.0506784 f1: 0.0000000: 25%|██▍ | 374/1497 [03:07<09:18, 2.01it/s]epoch: 0 loss: 0.1151103 f1: 0.0000000: 25%|██▍ | 374/1497 [03:08<09:18, 2.01it/s]epoch: 0 loss: 0.1151103 f1: 0.0000000: 25%|██▌ | 375/1497 [03:08<09:16, 2.02it/s]epoch: 0 loss: 0.1658085 f1: 0.0000000: 25%|██▌ | 375/1497 [03:08<09:16, 2.02it/s]epoch: 0 loss: 0.1658085 f1: 0.0000000: 25%|██▌ | 376/1497 [03:08<09:14, 2.02it/s]epoch: 0 loss: 0.1055539 f1: 0.0000000: 25%|██▌ | 376/1497 [03:09<09:14, 2.02it/s]epoch: 0 loss: 0.1055539 f1: 0.0000000: 25%|██▌ | 377/1497 [03:09<09:13, 2.02it/s]epoch: 0 loss: 0.0303463 f1: 0.0000000: 25%|██▌ | 377/1497 [03:09<09:13, 2.02it/s]epoch: 0 loss: 0.0303463 f1: 0.0000000: 25%|██▌ | 378/1497 [03:09<09:12, 2.03it/s]epoch: 0 loss: 0.0595727 f1: 0.0000000: 25%|██▌ | 378/1497 [03:10<09:12, 2.03it/s]epoch: 0 loss: 0.0595727 f1: 0.0000000: 25%|██▌ | 379/1497 [03:10<09:10, 2.03it/s]epoch: 0 loss: 0.0782632 f1: 0.0000000: 25%|██▌ | 379/1497 [03:10<09:10, 2.03it/s]epoch: 0 loss: 0.0782632 f1: 0.0000000: 25%|██▌ | 380/1497 [03:10<09:10, 2.03it/s]epoch: 0 loss: 0.1466913 f1: 0.0000000: 25%|██▌ | 380/1497 [03:11<09:10, 2.03it/s]epoch: 0 loss: 0.1466913 f1: 0.0000000: 25%|██▌ | 381/1497 [03:11<09:14, 2.01it/s]epoch: 0 loss: 0.1051869 f1: 0.0000000: 25%|██▌ | 381/1497 [03:11<09:14, 2.01it/s]epoch: 0 loss: 0.1051869 f1: 0.0000000: 26%|██▌ | 382/1497 [03:11<09:21, 1.99it/s]epoch: 0 loss: 0.1257417 f1: 0.0000000: 26%|██▌ | 382/1497 [03:12<09:21, 1.99it/s]epoch: 0 loss: 0.1257417 f1: 0.0000000: 26%|██▌ | 383/1497 [03:12<09:19, 1.99it/s]epoch: 0 loss: 0.1064788 f1: 0.0000000: 26%|██▌ | 383/1497 [03:12<09:19, 1.99it/s]epoch: 0 loss: 0.1064788 f1: 0.0000000: 26%|██▌ | 384/1497 [03:12<09:20, 1.98it/s]epoch: 0 loss: 0.0848993 f1: 0.0000000: 26%|██▌ | 384/1497 [03:13<09:20, 1.98it/s]epoch: 0 loss: 0.0848993 f1: 0.0000000: 26%|██▌ | 385/1497 [03:13<09:21, 1.98it/s]epoch: 0 loss: 0.1720764 f1: 0.0000000: 26%|██▌ | 385/1497 [03:13<09:21, 1.98it/s]epoch: 0 loss: 0.1720764 f1: 0.0000000: 26%|██▌ | 386/1497 [03:13<09:18, 1.99it/s]epoch: 0 loss: 0.0908954 f1: 0.0000000: 26%|██▌ | 386/1497 [03:14<09:18, 1.99it/s]epoch: 0 loss: 0.0908954 f1: 0.0000000: 26%|██▌ | 387/1497 [03:14<09:16, 1.99it/s]epoch: 0 loss: 0.1666478 f1: 0.0000000: 26%|██▌ | 387/1497 [03:14<09:16, 1.99it/s]epoch: 0 loss: 0.1666478 f1: 0.0000000: 26%|██▌ | 388/1497 [03:14<09:18, 1.99it/s]epoch: 0 loss: 0.0489222 f1: 0.0000000: 26%|██▌ | 388/1497 [03:15<09:18, 1.99it/s]epoch: 0 loss: 0.0489222 f1: 0.0000000: 26%|██▌ | 389/1497 [03:15<09:17, 1.99it/s]epoch: 0 loss: 0.1230112 f1: 0.0000000: 26%|██▌ | 389/1497 [03:15<09:17, 1.99it/s]epoch: 0 loss: 0.1230112 f1: 0.0000000: 26%|██▌ | 390/1497 [03:15<09:16, 1.99it/s]epoch: 0 loss: 0.0468770 f1: 0.0000000: 26%|██▌ | 390/1497 [03:16<09:16, 1.99it/s]epoch: 0 loss: 0.0468770 f1: 0.0000000: 26%|██▌ | 391/1497 [03:16<09:18, 1.98it/s]epoch: 0 loss: 0.1922848 f1: 0.0000000: 26%|██▌ | 391/1497 [03:16<09:18, 1.98it/s]epoch: 0 loss: 0.1922848 f1: 0.0000000: 26%|██▌ | 392/1497 [03:16<09:17, 1.98it/s]epoch: 0 loss: 0.2958025 f1: 0.0000000: 26%|██▌ | 392/1497 [03:17<09:17, 1.98it/s]epoch: 0 loss: 0.2958025 f1: 0.0000000: 26%|██▋ | 393/1497 [03:17<09:18, 1.98it/s]epoch: 0 loss: 0.1053138 f1: 0.0000000: 26%|██▋ | 393/1497 [03:17<09:18, 1.98it/s]epoch: 0 loss: 0.1053138 f1: 0.0000000: 26%|██▋ | 394/1497 [03:17<09:16, 1.98it/s]epoch: 0 loss: 0.0250833 f1: 0.0000000: 26%|██▋ | 394/1497 [03:18<09:16, 1.98it/s]epoch: 0 loss: 0.0250833 f1: 0.0000000: 26%|██▋ | 395/1497 [03:18<09:15, 1.98it/s]epoch: 0 loss: 0.1664651 f1: 0.0000000: 26%|██▋ | 395/1497 [03:18<09:15, 1.98it/s]epoch: 0 loss: 0.1664651 f1: 0.0000000: 26%|██▋ | 396/1497 [03:18<09:13, 1.99it/s]epoch: 0 loss: 0.2768899 f1: 0.0000000: 26%|██▋ | 396/1497 [03:19<09:13, 1.99it/s]epoch: 0 loss: 0.2768899 f1: 0.0000000: 27%|██▋ | 397/1497 [03:19<09:12, 1.99it/s]epoch: 0 loss: 0.0971593 f1: 0.0000000: 27%|██▋ | 397/1497 [03:19<09:12, 1.99it/s]epoch: 0 loss: 0.0971593 f1: 0.0000000: 27%|██▋ | 398/1497 [03:19<09:13, 1.99it/s]epoch: 0 loss: 0.0492652 f1: 0.0000000: 27%|██▋ | 398/1497 [03:20<09:13, 1.99it/s]epoch: 0 loss: 0.0492652 f1: 0.0000000: 27%|██▋ | 399/1497 [03:20<09:11, 1.99it/s]epoch: 0 loss: 0.2618496 f1: 0.0000000: 27%|██▋ | 399/1497 [03:20<09:11, 1.99it/s]epoch: 0 loss: 0.2618496 f1: 0.0000000: 27%|██▋ | 400/1497 [03:20<09:10, 1.99it/s]epoch: 0 loss: 0.0167114 f1: 0.0000000: 27%|██▋ | 400/1497 [03:21<09:10, 1.99it/s]epoch: 0 loss: 0.0167114 f1: 0.0000000: 27%|██▋ | 401/1497 [03:21<09:09, 1.99it/s]epoch: 0 loss: 0.0930685 f1: 0.0000000: 27%|██▋ | 401/1497 [03:21<09:09, 1.99it/s]epoch: 0 loss: 0.0930685 f1: 0.0000000: 27%|██▋ | 402/1497 [03:21<09:09, 1.99it/s]epoch: 0 loss: 0.0451929 f1: 0.0000000: 27%|██▋ | 402/1497 [03:22<09:09, 1.99it/s]epoch: 0 loss: 0.0451929 f1: 0.0000000: 27%|██▋ | 403/1497 [03:22<09:09, 1.99it/s]epoch: 0 loss: 0.0204398 f1: 0.0000000: 27%|██▋ | 403/1497 [03:22<09:09, 1.99it/s]epoch: 0 loss: 0.0204398 f1: 0.0000000: 27%|██▋ | 404/1497 [03:22<09:09, 1.99it/s]epoch: 0 loss: 0.0126261 f1: 0.0000000: 27%|██▋ | 404/1497 [03:23<09:09, 1.99it/s]epoch: 0 loss: 0.0126261 f1: 0.0000000: 27%|██▋ | 405/1497 [03:23<09:11, 1.98it/s]epoch: 0 loss: 0.0693464 f1: 0.0000000: 27%|██▋ | 405/1497 [03:23<09:11, 1.98it/s]epoch: 0 loss: 0.0693464 f1: 0.0000000: 27%|██▋ | 406/1497 [03:23<09:10, 1.98it/s]epoch: 0 loss: 0.2144806 f1: 0.0000000: 27%|██▋ | 406/1497 [03:24<09:10, 1.98it/s]epoch: 0 loss: 0.2144806 f1: 0.0000000: 27%|██▋ | 407/1497 [03:24<09:05, 2.00it/s]epoch: 0 loss: 0.0162647 f1: 0.0000000: 27%|██▋ | 407/1497 [03:24<09:05, 2.00it/s]epoch: 0 loss: 0.0162647 f1: 0.0000000: 27%|██▋ | 408/1497 [03:24<09:01, 2.01it/s]epoch: 0 loss: 0.0282906 f1: 0.0000000: 27%|██▋ | 408/1497 [03:25<09:01, 2.01it/s]epoch: 0 loss: 0.0282906 f1: 0.0000000: 27%|██▋ | 409/1497 [03:25<08:58, 2.02it/s]epoch: 0 loss: 0.1764271 f1: 0.0000000: 27%|██▋ | 409/1497 [03:25<08:58, 2.02it/s]epoch: 0 loss: 0.1764271 f1: 0.0000000: 27%|██▋ | 410/1497 [03:25<08:56, 2.03it/s]epoch: 0 loss: 0.1138566 f1: 0.0000000: 27%|██▋ | 410/1497 [03:26<08:56, 2.03it/s]epoch: 0 loss: 0.1138566 f1: 0.0000000: 27%|██▋ | 411/1497 [03:26<08:59, 2.01it/s]epoch: 0 loss: 0.2422675 f1: 0.0000000: 27%|██▋ | 411/1497 [03:26<08:59, 2.01it/s]epoch: 0 loss: 0.2422675 f1: 0.0000000: 28%|██▊ | 412/1497 [03:26<08:57, 2.02it/s]epoch: 0 loss: 0.1064706 f1: 0.0000000: 28%|██▊ | 412/1497 [03:27<08:57, 2.02it/s]epoch: 0 loss: 0.1064706 f1: 0.0000000: 28%|██▊ | 413/1497 [03:27<08:59, 2.01it/s]epoch: 0 loss: 0.0221662 f1: 0.0000000: 28%|██▊ | 413/1497 [03:27<08:59, 2.01it/s]epoch: 0 loss: 0.0221662 f1: 0.0000000: 28%|██▊ | 414/1497 [03:27<09:00, 2.00it/s]epoch: 0 loss: 0.0318645 f1: 0.0000000: 28%|██▊ | 414/1497 [03:28<09:00, 2.00it/s]epoch: 0 loss: 0.0318645 f1: 0.0000000: 28%|██▊ | 415/1497 [03:28<09:04, 1.99it/s]epoch: 0 loss: 0.0215515 f1: 0.0000000: 28%|██▊ | 415/1497 [03:28<09:04, 1.99it/s]epoch: 0 loss: 0.0215515 f1: 0.0000000: 28%|██▊ | 416/1497 [03:28<09:06, 1.98it/s]epoch: 0 loss: 0.1296414 f1: 0.0000000: 28%|██▊ | 416/1497 [03:29<09:06, 1.98it/s]epoch: 0 loss: 0.1296414 f1: 0.0000000: 28%|██▊ | 417/1497 [03:29<09:06, 1.98it/s]epoch: 0 loss: 0.0797098 f1: 0.0000000: 28%|██▊ | 417/1497 [03:29<09:06, 1.98it/s]epoch: 0 loss: 0.0797098 f1: 0.0000000: 28%|██▊ | 418/1497 [03:29<09:06, 1.97it/s]epoch: 0 loss: 0.0678626 f1: 0.0000000: 28%|██▊ | 418/1497 [03:30<09:06, 1.97it/s]epoch: 0 loss: 0.0678626 f1: 0.0000000: 28%|██▊ | 419/1497 [03:30<09:05, 1.98it/s]epoch: 0 loss: 0.0556527 f1: 0.0000000: 28%|██▊ | 419/1497 [03:30<09:05, 1.98it/s]epoch: 0 loss: 0.0556527 f1: 0.0000000: 28%|██▊ | 420/1497 [03:30<09:02, 1.99it/s]epoch: 0 loss: 0.1068517 f1: 0.0000000: 28%|██▊ | 420/1497 [03:31<09:02, 1.99it/s]epoch: 0 loss: 0.1068517 f1: 0.0000000: 28%|██▊ | 421/1497 [03:31<09:02, 1.98it/s]epoch: 0 loss: 0.0891187 f1: 0.0000000: 28%|██▊ | 421/1497 [03:31<09:02, 1.98it/s]epoch: 0 loss: 0.0891187 f1: 0.0000000: 28%|██▊ | 422/1497 [03:31<09:04, 1.97it/s]epoch: 0 loss: 0.1602325 f1: 0.0000000: 28%|██▊ | 422/1497 [03:32<09:04, 1.97it/s]epoch: 0 loss: 0.1602325 f1: 0.0000000: 28%|██▊ | 423/1497 [03:32<09:18, 1.92it/s]epoch: 0 loss: 0.1083709 f1: 0.0000000: 28%|██▊ | 423/1497 [03:32<09:18, 1.92it/s]epoch: 0 loss: 0.1083709 f1: 0.0000000: 28%|██▊ | 424/1497 [03:32<09:16, 1.93it/s]epoch: 0 loss: 0.0694547 f1: 0.0000000: 28%|██▊ | 424/1497 [03:33<09:16, 1.93it/s]epoch: 0 loss: 0.0694547 f1: 0.0000000: 28%|██▊ | 425/1497 [03:33<09:11, 1.94it/s]epoch: 0 loss: 0.1003408 f1: 0.0000000: 28%|██▊ | 425/1497 [03:33<09:11, 1.94it/s]epoch: 0 loss: 0.1003408 f1: 0.0000000: 28%|██▊ | 426/1497 [03:33<09:08, 1.95it/s]epoch: 0 loss: 0.0548751 f1: 0.0000000: 28%|██▊ | 426/1497 [03:34<09:08, 1.95it/s]epoch: 0 loss: 0.0548751 f1: 0.0000000: 29%|██▊ | 427/1497 [03:34<09:05, 1.96it/s]epoch: 0 loss: 0.0525652 f1: 0.0000000: 29%|██▊ | 427/1497 [03:34<09:05, 1.96it/s]epoch: 0 loss: 0.0525652 f1: 0.0000000: 29%|██▊ | 428/1497 [03:34<09:03, 1.97it/s]epoch: 0 loss: 0.0312681 f1: 0.0000000: 29%|██▊ | 428/1497 [03:35<09:03, 1.97it/s]epoch: 0 loss: 0.0312681 f1: 0.0000000: 29%|██▊ | 429/1497 [03:35<09:01, 1.97it/s]epoch: 0 loss: 0.0810004 f1: 0.0000000: 29%|██▊ | 429/1497 [03:35<09:01, 1.97it/s]epoch: 0 loss: 0.0810004 f1: 0.0000000: 29%|██▊ | 430/1497 [03:35<08:59, 1.98it/s]epoch: 0 loss: 0.0490766 f1: 0.0000000: 29%|██▊ | 430/1497 [03:36<08:59, 1.98it/s]epoch: 0 loss: 0.0490766 f1: 0.0000000: 29%|██▉ | 431/1497 [03:36<08:58, 1.98it/s]epoch: 0 loss: 0.1474469 f1: 0.0000000: 29%|██▉ | 431/1497 [03:36<08:58, 1.98it/s]epoch: 0 loss: 0.1474469 f1: 0.0000000: 29%|██▉ | 432/1497 [03:36<08:56, 1.98it/s]epoch: 0 loss: 0.0319660 f1: 0.0000000: 29%|██▉ | 432/1497 [03:37<08:56, 1.98it/s]epoch: 0 loss: 0.0319660 f1: 0.0000000: 29%|██▉ | 433/1497 [03:37<08:55, 1.99it/s]epoch: 0 loss: 0.1213701 f1: 0.0000000: 29%|██▉ | 433/1497 [03:37<08:55, 1.99it/s]epoch: 0 loss: 0.1213701 f1: 0.0000000: 29%|██▉ | 434/1497 [03:37<08:57, 1.98it/s]epoch: 0 loss: 0.1091412 f1: 0.0000000: 29%|██▉ | 434/1497 [03:38<08:57, 1.98it/s]epoch: 0 loss: 0.1091412 f1: 0.0000000: 29%|██▉ | 435/1497 [03:38<08:55, 1.98it/s]epoch: 0 loss: 0.0482651 f1: 0.0000000: 29%|██▉ | 435/1497 [03:39<08:55, 1.98it/s]epoch: 0 loss: 0.0482651 f1: 0.0000000: 29%|██▉ | 436/1497 [03:39<08:56, 1.98it/s]epoch: 0 loss: 0.0462394 f1: 0.0000000: 29%|██▉ | 436/1497 [03:39<08:56, 1.98it/s]epoch: 0 loss: 0.0462394 f1: 0.0000000: 29%|██▉ | 437/1497 [03:39<08:51, 2.00it/s]epoch: 0 loss: 0.0382715 f1: 0.0000000: 29%|██▉ | 437/1497 [03:39<08:51, 2.00it/s]epoch: 0 loss: 0.0382715 f1: 0.0000000: 29%|██▉ | 438/1497 [03:39<08:52, 1.99it/s]epoch: 0 loss: 0.0996739 f1: 0.0000000: 29%|██▉ | 438/1497 [03:40<08:52, 1.99it/s]epoch: 0 loss: 0.0996739 f1: 0.0000000: 29%|██▉ | 439/1497 [03:40<08:55, 1.98it/s]epoch: 0 loss: 0.0563190 f1: 0.0000000: 29%|██▉ | 439/1497 [03:41<08:55, 1.98it/s]epoch: 0 loss: 0.0563190 f1: 0.0000000: 29%|██▉ | 440/1497 [03:41<08:49, 2.00it/s]epoch: 0 loss: 0.0771948 f1: 0.0000000: 29%|██▉ | 440/1497 [03:41<08:49, 2.00it/s]epoch: 0 loss: 0.0771948 f1: 0.0000000: 29%|██▉ | 441/1497 [03:41<08:43, 2.02it/s]epoch: 0 loss: 0.0959267 f1: 0.0000000: 29%|██▉ | 441/1497 [03:41<08:43, 2.02it/s]epoch: 0 loss: 0.0959267 f1: 0.0000000: 30%|██▉ | 442/1497 [03:41<08:42, 2.02it/s]epoch: 0 loss: 0.0739617 f1: 0.0000000: 30%|██▉ | 442/1497 [03:42<08:42, 2.02it/s]epoch: 0 loss: 0.0739617 f1: 0.0000000: 30%|██▉ | 443/1497 [03:42<08:38, 2.03it/s]epoch: 0 loss: 0.1921984 f1: 0.0000000: 30%|██▉ | 443/1497 [03:42<08:38, 2.03it/s]epoch: 0 loss: 0.1921984 f1: 0.0000000: 30%|██▉ | 444/1497 [03:42<08:39, 2.03it/s]epoch: 0 loss: 0.0429978 f1: 0.0000000: 30%|██▉ | 444/1497 [03:43<08:39, 2.03it/s]epoch: 0 loss: 0.0429978 f1: 0.0000000: 30%|██▉ | 445/1497 [03:43<08:42, 2.01it/s]epoch: 0 loss: 0.0975183 f1: 0.0000000: 30%|██▉ | 445/1497 [03:43<08:42, 2.01it/s]epoch: 0 loss: 0.0975183 f1: 0.0000000: 30%|██▉ | 446/1497 [03:43<08:43, 2.01it/s]epoch: 0 loss: 0.0873610 f1: 0.0000000: 30%|██▉ | 446/1497 [03:44<08:43, 2.01it/s]epoch: 0 loss: 0.0873610 f1: 0.0000000: 30%|██▉ | 447/1497 [03:44<08:42, 2.01it/s]epoch: 0 loss: 0.2448678 f1: 0.0000000: 30%|██▉ | 447/1497 [03:44<08:42, 2.01it/s]epoch: 0 loss: 0.2448678 f1: 0.0000000: 30%|██▉ | 448/1497 [03:44<08:42, 2.01it/s]epoch: 0 loss: 0.1906148 f1: 0.0000000: 30%|██▉ | 448/1497 [03:45<08:42, 2.01it/s]epoch: 0 loss: 0.1906148 f1: 0.0000000: 30%|██▉ | 449/1497 [03:45<08:41, 2.01it/s]epoch: 0 loss: 0.1073392 f1: 0.0000000: 30%|██▉ | 449/1497 [03:45<08:41, 2.01it/s]epoch: 0 loss: 0.1073392 f1: 0.0000000: 30%|███ | 450/1497 [03:45<08:39, 2.02it/s]epoch: 0 loss: 0.0954520 f1: 0.0000000: 30%|███ | 450/1497 [03:46<08:39, 2.02it/s]epoch: 0 loss: 0.0954520 f1: 0.0000000: 30%|███ | 451/1497 [03:46<08:38, 2.02it/s]epoch: 0 loss: 0.0617131 f1: 0.0000000: 30%|███ | 451/1497 [03:46<08:38, 2.02it/s]epoch: 0 loss: 0.0617131 f1: 0.0000000: 30%|███ | 452/1497 [03:46<08:42, 2.00it/s]epoch: 0 loss: 0.1629283 f1: 0.0000000: 30%|███ | 452/1497 [03:47<08:42, 2.00it/s]epoch: 0 loss: 0.1629283 f1: 0.0000000: 30%|███ | 453/1497 [03:47<08:43, 1.99it/s]epoch: 0 loss: 0.0593389 f1: 0.0000000: 30%|███ | 453/1497 [03:47<08:43, 1.99it/s]epoch: 0 loss: 0.0593389 f1: 0.0000000: 30%|███ | 454/1497 [03:47<08:43, 1.99it/s]epoch: 0 loss: 0.0973011 f1: 0.0000000: 30%|███ | 454/1497 [03:48<08:43, 1.99it/s]epoch: 0 loss: 0.0973011 f1: 0.0000000: 30%|███ | 455/1497 [03:48<08:44, 1.99it/s]epoch: 0 loss: 0.0516048 f1: 0.0000000: 30%|███ | 455/1497 [03:48<08:44, 1.99it/s]epoch: 0 loss: 0.0516048 f1: 0.0000000: 30%|███ | 456/1497 [03:48<08:44, 1.98it/s]epoch: 0 loss: 0.0612235 f1: 0.0000000: 30%|███ | 456/1497 [03:49<08:44, 1.98it/s]epoch: 0 loss: 0.0612235 f1: 0.0000000: 31%|███ | 457/1497 [03:49<08:46, 1.97it/s]epoch: 0 loss: 0.0296635 f1: 0.0000000: 31%|███ | 457/1497 [03:49<08:46, 1.97it/s]epoch: 0 loss: 0.0296635 f1: 0.0000000: 31%|███ | 458/1497 [03:49<08:48, 1.97it/s]epoch: 0 loss: 0.0423046 f1: 0.0000000: 31%|███ | 458/1497 [03:50<08:48, 1.97it/s]epoch: 0 loss: 0.0423046 f1: 0.0000000: 31%|███ | 459/1497 [03:50<08:47, 1.97it/s]epoch: 0 loss: 0.0957997 f1: 0.0000000: 31%|███ | 459/1497 [03:51<08:47, 1.97it/s]epoch: 0 loss: 0.0957997 f1: 0.0000000: 31%|███ | 460/1497 [03:51<08:47, 1.97it/s]epoch: 0 loss: 0.1030393 f1: 0.0000000: 31%|███ | 460/1497 [03:51<08:47, 1.97it/s]epoch: 0 loss: 0.1030393 f1: 0.0000000: 31%|███ | 461/1497 [03:51<08:43, 1.98it/s]epoch: 0 loss: 0.0472263 f1: 0.0000000: 31%|███ | 461/1497 [03:52<08:43, 1.98it/s]epoch: 0 loss: 0.0472263 f1: 0.0000000: 31%|███ | 462/1497 [03:52<08:42, 1.98it/s]epoch: 0 loss: 0.0264763 f1: 0.0000000: 31%|███ | 462/1497 [03:52<08:42, 1.98it/s]epoch: 0 loss: 0.0264763 f1: 0.0000000: 31%|███ | 463/1497 [03:52<08:40, 1.99it/s]epoch: 0 loss: 0.1760914 f1: 0.0000000: 31%|███ | 463/1497 [03:53<08:40, 1.99it/s]epoch: 0 loss: 0.1760914 f1: 0.0000000: 31%|███ | 464/1497 [03:53<08:49, 1.95it/s]epoch: 0 loss: 0.0316643 f1: 0.0000000: 31%|███ | 464/1497 [03:53<08:49, 1.95it/s]epoch: 0 loss: 0.0316643 f1: 0.0000000: 31%|███ | 465/1497 [03:53<08:47, 1.96it/s]epoch: 0 loss: 0.0251689 f1: 0.0000000: 31%|███ | 465/1497 [03:54<08:47, 1.96it/s]epoch: 0 loss: 0.0251689 f1: 0.0000000: 31%|███ | 466/1497 [03:54<08:45, 1.96it/s]epoch: 0 loss: 0.1246737 f1: 0.0000000: 31%|███ | 466/1497 [03:54<08:45, 1.96it/s]epoch: 0 loss: 0.1246737 f1: 0.0000000: 31%|███ | 467/1497 [03:54<08:40, 1.98it/s]epoch: 0 loss: 0.0959556 f1: 0.0000000: 31%|███ | 467/1497 [03:55<08:40, 1.98it/s]epoch: 0 loss: 0.0959556 f1: 0.0000000: 31%|███▏ | 468/1497 [03:55<08:38, 1.98it/s]epoch: 0 loss: 0.0121581 f1: 0.0000000: 31%|███▏ | 468/1497 [03:55<08:38, 1.98it/s]epoch: 0 loss: 0.0121581 f1: 0.0000000: 31%|███▏ | 469/1497 [03:55<08:36, 1.99it/s]epoch: 0 loss: 0.0442190 f1: 0.0000000: 31%|███▏ | 469/1497 [03:56<08:36, 1.99it/s]epoch: 0 loss: 0.0442190 f1: 0.0000000: 31%|███▏ | 470/1497 [03:56<08:39, 1.98it/s]epoch: 0 loss: 0.0649952 f1: 0.0000000: 31%|███▏ | 470/1497 [03:56<08:39, 1.98it/s]epoch: 0 loss: 0.0649952 f1: 0.0000000: 31%|███▏ | 471/1497 [03:56<08:39, 1.98it/s]epoch: 0 loss: 0.0949889 f1: 0.0000000: 31%|███▏ | 471/1497 [03:57<08:39, 1.98it/s]epoch: 0 loss: 0.0949889 f1: 0.0000000: 32%|███▏ | 472/1497 [03:57<08:39, 1.97it/s]epoch: 0 loss: 0.1105877 f1: 0.0000000: 32%|███▏ | 472/1497 [03:57<08:39, 1.97it/s]epoch: 0 loss: 0.1105877 f1: 0.0000000: 32%|███▏ | 473/1497 [03:57<08:38, 1.97it/s]epoch: 0 loss: 0.0665559 f1: 0.0000000: 32%|███▏ | 473/1497 [03:58<08:38, 1.97it/s]epoch: 0 loss: 0.0665559 f1: 0.0000000: 32%|███▏ | 474/1497 [03:58<08:38, 1.97it/s]epoch: 0 loss: 0.0697029 f1: 0.0000000: 32%|███▏ | 474/1497 [03:58<08:38, 1.97it/s]epoch: 0 loss: 0.0697029 f1: 0.0000000: 32%|███▏ | 475/1497 [03:58<08:38, 1.97it/s]epoch: 0 loss: 0.0909009 f1: 0.0000000: 32%|███▏ | 475/1497 [03:59<08:38, 1.97it/s]epoch: 0 loss: 0.0909009 f1: 0.0000000: 32%|███▏ | 476/1497 [03:59<08:40, 1.96it/s]epoch: 0 loss: 0.0868696 f1: 0.0000000: 32%|███▏ | 476/1497 [03:59<08:40, 1.96it/s]epoch: 0 loss: 0.0868696 f1: 0.0000000: 32%|███▏ | 477/1497 [03:59<08:37, 1.97it/s]epoch: 0 loss: 0.1617257 f1: 0.0000000: 32%|███▏ | 477/1497 [04:00<08:37, 1.97it/s]epoch: 0 loss: 0.1617257 f1: 0.0000000: 32%|███▏ | 478/1497 [04:00<08:34, 1.98it/s]epoch: 0 loss: 0.0333356 f1: 0.0000000: 32%|███▏ | 478/1497 [04:00<08:34, 1.98it/s]epoch: 0 loss: 0.0333356 f1: 0.0000000: 32%|███▏ | 479/1497 [04:00<08:33, 1.98it/s]epoch: 0 loss: 0.0511851 f1: 0.0000000: 32%|███▏ | 479/1497 [04:01<08:33, 1.98it/s]epoch: 0 loss: 0.0511851 f1: 0.0000000: 32%|███▏ | 480/1497 [04:01<08:32, 1.99it/s]epoch: 0 loss: 0.0436439 f1: 0.0000000: 32%|███▏ | 480/1497 [04:01<08:32, 1.99it/s]epoch: 0 loss: 0.0436439 f1: 0.0000000: 32%|███▏ | 481/1497 [04:01<08:29, 1.99it/s]epoch: 0 loss: 0.0257884 f1: 0.0000000: 32%|███▏ | 481/1497 [04:02<08:29, 1.99it/s]epoch: 0 loss: 0.0257884 f1: 0.0000000: 32%|███▏ | 482/1497 [04:02<08:30, 1.99it/s]epoch: 0 loss: 0.0257908 f1: 0.0000000: 32%|███▏ | 482/1497 [04:02<08:30, 1.99it/s]epoch: 0 loss: 0.0257908 f1: 0.0000000: 32%|███▏ | 483/1497 [04:02<08:30, 1.99it/s]epoch: 0 loss: 0.0815082 f1: 0.0000000: 32%|███▏ | 483/1497 [04:03<08:30, 1.99it/s]epoch: 0 loss: 0.0815082 f1: 0.0000000: 32%|███▏ | 484/1497 [04:03<08:29, 1.99it/s]epoch: 0 loss: 0.1281471 f1: 0.0000000: 32%|███▏ | 484/1497 [04:03<08:29, 1.99it/s]epoch: 0 loss: 0.1281471 f1: 0.0000000: 32%|███▏ | 485/1497 [04:03<08:31, 1.98it/s]epoch: 0 loss: 0.0083161 f1: 0.0000000: 32%|███▏ | 485/1497 [04:04<08:31, 1.98it/s]epoch: 0 loss: 0.0083161 f1: 0.0000000: 32%|███▏ | 486/1497 [04:04<08:34, 1.97it/s]epoch: 0 loss: 0.0131326 f1: 0.0000000: 32%|███▏ | 486/1497 [04:04<08:34, 1.97it/s]epoch: 0 loss: 0.0131326 f1: 0.0000000: 33%|███▎ | 487/1497 [04:04<08:30, 1.98it/s]epoch: 0 loss: 0.0229702 f1: 0.0000000: 33%|███▎ | 487/1497 [04:05<08:30, 1.98it/s]epoch: 0 loss: 0.0229702 f1: 0.0000000: 33%|███▎ | 488/1497 [04:05<08:26, 1.99it/s]epoch: 0 loss: 0.0120318 f1: 0.0000000: 33%|███▎ | 488/1497 [04:05<08:26, 1.99it/s]epoch: 0 loss: 0.0120318 f1: 0.0000000: 33%|███▎ | 489/1497 [04:05<08:24, 2.00it/s]epoch: 0 loss: 0.1150650 f1: 0.0000000: 33%|███▎ | 489/1497 [04:06<08:24, 2.00it/s]epoch: 0 loss: 0.1150650 f1: 0.0000000: 33%|███▎ | 490/1497 [04:06<08:20, 2.01it/s]epoch: 0 loss: 0.1525822 f1: 0.0000000: 33%|███▎ | 490/1497 [04:06<08:20, 2.01it/s]epoch: 0 loss: 0.1525822 f1: 0.0000000: 33%|███▎ | 491/1497 [04:06<08:14, 2.03it/s]epoch: 0 loss: 0.0710109 f1: 0.0000000: 33%|███▎ | 491/1497 [04:07<08:14, 2.03it/s]epoch: 0 loss: 0.0710109 f1: 0.0000000: 33%|███▎ | 492/1497 [04:07<08:14, 2.03it/s]epoch: 0 loss: 0.0553544 f1: 0.0000000: 33%|███▎ | 492/1497 [04:07<08:14, 2.03it/s]epoch: 0 loss: 0.0553544 f1: 0.0000000: 33%|███▎ | 493/1497 [04:07<08:17, 2.02it/s]epoch: 0 loss: 0.0476435 f1: 0.0000000: 33%|███▎ | 493/1497 [04:08<08:17, 2.02it/s]epoch: 0 loss: 0.0476435 f1: 0.0000000: 33%|███▎ | 494/1497 [04:08<08:16, 2.02it/s]epoch: 0 loss: 0.1772809 f1: 0.0000000: 33%|███▎ | 494/1497 [04:08<08:16, 2.02it/s]epoch: 0 loss: 0.1772809 f1: 0.0000000: 33%|███▎ | 495/1497 [04:08<08:15, 2.02it/s]epoch: 0 loss: 0.0971408 f1: 0.0000000: 33%|███▎ | 495/1497 [04:09<08:15, 2.02it/s]epoch: 0 loss: 0.0971408 f1: 0.0000000: 33%|███▎ | 496/1497 [04:09<08:15, 2.02it/s]epoch: 0 loss: 0.0157709 f1: 0.0000000: 33%|███▎ | 496/1497 [04:09<08:15, 2.02it/s]epoch: 0 loss: 0.0157709 f1: 0.0000000: 33%|███▎ | 497/1497 [04:09<08:14, 2.02it/s]epoch: 0 loss: 0.0730150 f1: 0.0000000: 33%|███▎ | 497/1497 [04:10<08:14, 2.02it/s]epoch: 0 loss: 0.0730150 f1: 0.0000000: 33%|███▎ | 498/1497 [04:10<08:15, 2.02it/s]
0%| | 0/1998 [00:00<?, ?it/s][A
13%|█▎ | 250/1998 [00:00<00:00, 2493.24it/s][A
26%|██▌ | 520/1998 [00:00<00:00, 2550.39it/s][A
39%|███▉ | 785/1998 [00:00<00:00, 2576.79it/s][A
53%|█████▎ | 1062/1998 [00:00<00:00, 2629.12it/s][A
66%|██████▌ | 1320/1998 [00:00<00:00, 2613.68it/s][A
80%|████████ | 1604/1998 [00:00<00:00, 2676.74it/s][A
94%|█████████▍| 1883/1998 [00:00<00:00, 2708.58it/s][A
100%|██████████| 1998/1998 [00:00<00:00, 2672.11it/s][A
test: 0%| | 0/63 [00:00<?, ?it/s][A
test: 2%|▏ | 1/63 [00:00<00:14, 4.18it/s][A
test: 3%|▎ | 2/63 [00:00<00:12, 4.80it/s][A
test: 5%|▍ | 3/63 [00:00<00:11, 5.11it/s][A
test: 6%|▋ | 4/63 [00:00<00:11, 5.34it/s][A
test: 8%|▊ | 5/63 [00:00<00:10, 5.49it/s][A
test: 10%|▉ | 6/63 [00:01<00:09, 5.94it/s][A
test: 11%|█ | 7/63 [00:01<00:09, 6.03it/s][A
test: 13%|█▎ | 8/63 [00:01<00:09, 5.77it/s][A
test: 14%|█▍ | 9/63 [00:01<00:08, 6.02it/s][A
test: 16%|█▌ | 10/63 [00:01<00:08, 6.36it/s][A
test: 17%|█▋ | 11/63 [00:01<00:08, 6.36it/s][A
test: 19%|█▉ | 12/63 [00:02<00:08, 5.98it/s][A
test: 21%|██ | 13/63 [00:02<00:07, 6.36it/s][A
test: 22%|██▏ | 14/63 [00:02<00:07, 6.29it/s][A
test: 24%|██▍ | 15/63 [00:02<00:08, 5.92it/s][A
test: 25%|██▌ | 16/63 [00:02<00:07, 6.10it/s][A
test: 27%|██▋ | 17/63 [00:02<00:07, 6.45it/s][A
test: 29%|██▊ | 18/63 [00:02<00:07, 6.24it/s][A
test: 30%|███ | 19/63 [00:03<00:07, 5.68it/s][A
test: 32%|███▏ | 20/63 [00:03<00:07, 6.08it/s][A
test: 33%|███▎ | 21/63 [00:03<00:06, 6.05it/s][A
test: 35%|███▍ | 22/63 [00:03<00:07, 5.67it/s][A
test: 37%|███▋ | 23/63 [00:03<00:06, 5.85it/s][A
test: 38%|███▊ | 24/63 [00:03<00:06, 5.88it/s][A
test: 40%|███▉ | 25/63 [00:04<00:06, 6.28it/s][A
test: 41%|████▏ | 26/63 [00:04<00:05, 6.17it/s][A
test: 43%|████▎ | 27/63 [00:04<00:06, 5.54it/s][A
test: 44%|████▍ | 28/63 [00:04<00:05, 5.99it/s][A
test: 46%|████▌ | 29/63 [00:04<00:05, 6.19it/s][A
test: 48%|████▊ | 30/63 [00:04<00:05, 6.12it/s][A
test: 49%|████▉ | 31/63 [00:05<00:05, 6.04it/s][A
test: 51%|█████ | 32/63 [00:05<00:04, 6.37it/s][A
test: 52%|█████▏ | 33/63 [00:05<00:05, 5.91it/s][A
test: 54%|█████▍ | 34/63 [00:05<00:04, 6.06it/s][A
test: 56%|█████▌ | 35/63 [00:05<00:04, 6.15it/s][A
test: 57%|█████▋ | 36/63 [00:05<00:04, 6.16it/s][A
test: 59%|█████▊ | 37/63 [00:06<00:04, 6.25it/s][A
test: 60%|██████ | 38/63 [00:06<00:03, 6.40it/s][A
test: 62%|██████▏ | 39/63 [00:06<00:03, 6.38it/s][A
test: 63%|██████▎ | 40/63 [00:06<00:03, 6.36it/s][A
test: 65%|██████▌ | 41/63 [00:06<00:03, 5.94it/s][A
test: 67%|██████▋ | 42/63 [00:06<00:03, 6.20it/s][A
test: 68%|██████▊ | 43/63 [00:07<00:03, 6.48it/s][A
test: 70%|██████▉ | 44/63 [00:07<00:02, 6.48it/s][A
test: 71%|███████▏ | 45/63 [00:07<00:02, 6.43it/s][A
test: 73%|███████▎ | 46/63 [00:07<00:02, 6.58it/s][A
test: 75%|███████▍ | 47/63 [00:07<00:02, 6.33it/s][A
test: 76%|███████▌ | 48/63 [00:07<00:02, 6.19it/s][A
test: 78%|███████▊ | 49/63 [00:07<00:02, 6.40it/s][A
test: 79%|███████▉ | 50/63 [00:08<00:01, 6.51it/s][A
test: 81%|████████ | 51/63 [00:08<00:01, 6.31it/s][A
test: 83%|████████▎ | 52/63 [00:08<00:01, 6.61it/s][A
test: 84%|████████▍ | 53/63 [00:08<00:01, 6.58it/s][A
test: 86%|████████▌ | 54/63 [00:08<00:01, 6.78it/s][A
test: 87%|████████▋ | 55/63 [00:08<00:01, 6.23it/s][A
test: 89%|████████▉ | 56/63 [00:09<00:01, 5.63it/s][A
test: 90%|█████████ | 57/63 [00:09<00:01, 5.47it/s][A
test: 92%|█████████▏| 58/63 [00:09<00:00, 5.54it/s][A
test: 94%|█████████▎| 59/63 [00:09<00:00, 5.85it/s][A
test: 95%|█████████▌| 60/63 [00:09<00:00, 5.87it/s][A
test: 97%|█████████▋| 61/63 [00:09<00:00, 5.87it/s][A
test: 98%|█████████▊| 62/63 [00:10<00:00, 6.16it/s][A
test: 100%|██████████| 63/63 [00:10<00:00, 6.85it/s][A
[Aepoch: 0 loss: 0.0107419 f1: 0.7262888: 33%|███▎ | 498/1497 [04:38<08:15, 2.02it/s]epoch: 0 loss: 0.0107419 f1: 0.7262888: 33%|███▎ | 499/1497 [04:38<2:26:54, 8.83s/it]epoch: 1 loss: 0.0580775 f1: 0.7262888: 33%|███▎ | 499/1497 [04:39<2:26:54, 8.83s/it]epoch: 1 loss: 0.0580775 f1: 0.7262888: 33%|███▎ | 500/1497 [04:39<1:46:13, 6.39s/it]epoch: 1 loss: 0.3586280 f1: 0.7262888: 33%|███▎ | 500/1497 [04:39<1:46:13, 6.39s/it]epoch: 1 loss: 0.3586280 f1: 0.7262888: 33%|███▎ | 501/1497 [04:39<1:16:48, 4.63s/it]epoch: 1 loss: 0.0988361 f1: 0.7262888: 33%|███▎ | 501/1497 [04:40<1:16:48, 4.63s/it]epoch: 1 loss: 0.0988361 f1: 0.7262888: 34%|███▎ | 502/1497 [04:40<56:13, 3.39s/it] epoch: 1 loss: 0.0268514 f1: 0.7262888: 34%|███▎ | 502/1497 [04:40<56:13, 3.39s/it]epoch: 1 loss: 0.0268514 f1: 0.7262888: 34%|███▎ | 503/1497 [04:40<41:52, 2.53s/it]epoch: 1 loss: 0.1103790 f1: 0.7262888: 34%|███▎ | 503/1497 [04:41<41:52, 2.53s/it]epoch: 1 loss: 0.1103790 f1: 0.7262888: 34%|███▎ | 504/1497 [04:41<31:49, 1.92s/it]epoch: 1 loss: 0.0398400 f1: 0.7262888: 34%|███▎ | 504/1497 [04:41<31:49, 1.92s/it]epoch: 1 loss: 0.0398400 f1: 0.7262888: 34%|███▎ | 505/1497 [04:41<24:47, 1.50s/it]epoch: 1 loss: 0.0105549 f1: 0.7262888: 34%|███▎ | 505/1497 [04:42<24:47, 1.50s/it]epoch: 1 loss: 0.0105549 f1: 0.7262888: 34%|███▍ | 506/1497 [04:42<19:52, 1.20s/it]epoch: 1 loss: 0.0342064 f1: 0.7262888: 34%|███▍ | 506/1497 [04:42<19:52, 1.20s/it]epoch: 1 loss: 0.0342064 f1: 0.7262888: 34%|███▍ | 507/1497 [04:42<16:24, 1.01it/s]epoch: 1 loss: 0.0635941 f1: 0.7262888: 34%|███▍ | 507/1497 [04:43<16:24, 1.01it/s]epoch: 1 loss: 0.0635941 f1: 0.7262888: 34%|███▍ | 508/1497 [04:43<14:01, 1.18it/s]epoch: 1 loss: 0.1071262 f1: 0.7262888: 34%|███▍ | 508/1497 [04:43<14:01, 1.18it/s]epoch: 1 loss: 0.1071262 f1: 0.7262888: 34%|███▍ | 509/1497 [04:43<12:17, 1.34it/s]epoch: 1 loss: 0.0430867 f1: 0.7262888: 34%|███▍ | 509/1497 [04:44<12:17, 1.34it/s]epoch: 1 loss: 0.0430867 f1: 0.7262888: 34%|███▍ | 510/1497 [04:44<11:03, 1.49it/s]epoch: 1 loss: 0.0618348 f1: 0.7262888: 34%|███▍ | 510/1497 [04:44<11:03, 1.49it/s]epoch: 1 loss: 0.0618348 f1: 0.7262888: 34%|███▍ | 511/1497 [04:44<10:11, 1.61it/s]epoch: 1 loss: 0.0709197 f1: 0.7262888: 34%|███▍ | 511/1497 [04:45<10:11, 1.61it/s]epoch: 1 loss: 0.0709197 f1: 0.7262888: 34%|███▍ | 512/1497 [04:45<09:37, 1.70it/s]epoch: 1 loss: 0.1001101 f1: 0.7262888: 34%|███▍ | 512/1497 [04:45<09:37, 1.70it/s]epoch: 1 loss: 0.1001101 f1: 0.7262888: 34%|███▍ | 513/1497 [04:45<09:14, 1.78it/s]epoch: 1 loss: 0.0771911 f1: 0.7262888: 34%|███▍ | 513/1497 [04:46<09:14, 1.78it/s]epoch: 1 loss: 0.0771911 f1: 0.7262888: 34%|███▍ | 514/1497 [04:46<08:56, 1.83it/s]epoch: 1 loss: 0.1503434 f1: 0.7262888: 34%|███▍ | 514/1497 [04:46<08:56, 1.83it/s]epoch: 1 loss: 0.1503434 f1: 0.7262888: 34%|███▍ | 515/1497 [04:46<08:40, 1.89it/s]epoch: 1 loss: 0.0222649 f1: 0.7262888: 34%|███▍ | 515/1497 [04:47<08:40, 1.89it/s]epoch: 1 loss: 0.0222649 f1: 0.7262888: 34%|███▍ | 516/1497 [04:47<08:32, 1.91it/s]epoch: 1 loss: 0.1111188 f1: 0.7262888: 34%|███▍ | 516/1497 [04:47<08:32, 1.91it/s]epoch: 1 loss: 0.1111188 f1: 0.7262888: 35%|███▍ | 517/1497 [04:47<08:32, 1.91it/s]epoch: 1 loss: 0.0305585 f1: 0.7262888: 35%|███▍ | 517/1497 [04:48<08:32, 1.91it/s]epoch: 1 loss: 0.0305585 f1: 0.7262888: 35%|███▍ | 518/1497 [04:48<08:27, 1.93it/s]epoch: 1 loss: 0.0474285 f1: 0.7262888: 35%|███▍ | 518/1497 [04:48<08:27, 1.93it/s]epoch: 1 loss: 0.0474285 f1: 0.7262888: 35%|███▍ | 519/1497 [04:48<08:24, 1.94it/s]epoch: 1 loss: 0.0569628 f1: 0.7262888: 35%|███▍ | 519/1497 [04:49<08:24, 1.94it/s]epoch: 1 loss: 0.0569628 f1: 0.7262888: 35%|███▍ | 520/1497 [04:49<08:19, 1.96it/s]epoch: 1 loss: 0.0604851 f1: 0.7262888: 35%|███▍ | 520/1497 [04:49<08:19, 1.96it/s]epoch: 1 loss: 0.0604851 f1: 0.7262888: 35%|███▍ | 521/1497 [04:49<08:18, 1.96it/s]epoch: 1 loss: 0.0180062 f1: 0.7262888: 35%|███▍ | 521/1497 [04:50<08:18, 1.96it/s]epoch: 1 loss: 0.0180062 f1: 0.7262888: 35%|███▍ | 522/1497 [04:50<08:16, 1.97it/s]epoch: 1 loss: 0.0441549 f1: 0.7262888: 35%|███▍ | 522/1497 [04:50<08:16, 1.97it/s]epoch: 1 loss: 0.0441549 f1: 0.7262888: 35%|███▍ | 523/1497 [04:50<08:17, 1.96it/s]epoch: 1 loss: 0.0401628 f1: 0.7262888: 35%|███▍ | 523/1497 [04:51<08:17, 1.96it/s]epoch: 1 loss: 0.0401628 f1: 0.7262888: 35%|███▌ | 524/1497 [04:51<08:18, 1.95it/s]epoch: 1 loss: 0.0520687 f1: 0.7262888: 35%|███▌ | 524/1497 [04:51<08:18, 1.95it/s]epoch: 1 loss: 0.0520687 f1: 0.7262888: 35%|███▌ | 525/1497 [04:51<08:15, 1.96it/s]epoch: 1 loss: 0.0535306 f1: 0.7262888: 35%|███▌ | 525/1497 [04:52<08:15, 1.96it/s]epoch: 1 loss: 0.0535306 f1: 0.7262888: 35%|███▌ | 526/1497 [04:52<08:15, 1.96it/s]epoch: 1 loss: 0.1316015 f1: 0.7262888: 35%|███▌ | 526/1497 [04:52<08:15, 1.96it/s]epoch: 1 loss: 0.1316015 f1: 0.7262888: 35%|███▌ | 527/1497 [04:52<08:13, 1.96it/s]epoch: 1 loss: 0.0632498 f1: 0.7262888: 35%|███▌ | 527/1497 [04:53<08:13, 1.96it/s]epoch: 1 loss: 0.0632498 f1: 0.7262888: 35%|███▌ | 528/1497 [04:53<08:12, 1.97it/s]epoch: 1 loss: 0.2688051 f1: 0.7262888: 35%|███▌ | 528/1497 [04:53<08:12, 1.97it/s]epoch: 1 loss: 0.2688051 f1: 0.7262888: 35%|███▌ | 529/1497 [04:53<08:11, 1.97it/s]epoch: 1 loss: 0.0273816 f1: 0.7262888: 35%|███▌ | 529/1497 [04:54<08:11, 1.97it/s]epoch: 1 loss: 0.0273816 f1: 0.7262888: 35%|███▌ | 530/1497 [04:54<08:12, 1.96it/s]epoch: 1 loss: 0.0233826 f1: 0.7262888: 35%|███▌ | 530/1497 [04:54<08:12, 1.96it/s]epoch: 1 loss: 0.0233826 f1: 0.7262888: 35%|███▌ | 531/1497 [04:54<08:21, 1.92it/s]epoch: 1 loss: 0.0706633 f1: 0.7262888: 35%|███▌ | 531/1497 [04:55<08:21, 1.92it/s]epoch: 1 loss: 0.0706633 f1: 0.7262888: 36%|███▌ | 532/1497 [04:55<08:18, 1.94it/s]epoch: 1 loss: 0.0547495 f1: 0.7262888: 36%|███▌ | 532/1497 [04:55<08:18, 1.94it/s]epoch: 1 loss: 0.0547495 f1: 0.7262888: 36%|███▌ | 533/1497 [04:55<08:15, 1.95it/s]epoch: 1 loss: 0.3871397 f1: 0.7262888: 36%|███▌ | 533/1497 [04:56<08:15, 1.95it/s]epoch: 1 loss: 0.3871397 f1: 0.7262888: 36%|███▌ | 534/1497 [04:56<08:11, 1.96it/s]epoch: 1 loss: 0.0602433 f1: 0.7262888: 36%|███▌ | 534/1497 [04:56<08:11, 1.96it/s]epoch: 1 loss: 0.0602433 f1: 0.7262888: 36%|███▌ | 535/1497 [04:56<08:09, 1.97it/s]epoch: 1 loss: 0.0313996 f1: 0.7262888: 36%|███▌ | 535/1497 [04:57<08:09, 1.97it/s]epoch: 1 loss: 0.0313996 f1: 0.7262888: 36%|███▌ | 536/1497 [04:57<08:10, 1.96it/s]epoch: 1 loss: 0.1220800 f1: 0.7262888: 36%|███▌ | 536/1497 [04:57<08:10, 1.96it/s]epoch: 1 loss: 0.1220800 f1: 0.7262888: 36%|███▌ | 537/1497 [04:57<08:09, 1.96it/s]epoch: 1 loss: 0.0451662 f1: 0.7262888: 36%|███▌ | 537/1497 [04:58<08:09, 1.96it/s]epoch: 1 loss: 0.0451662 f1: 0.7262888: 36%|███▌ | 538/1497 [04:58<08:06, 1.97it/s]epoch: 1 loss: 0.0197817 f1: 0.7262888: 36%|███▌ | 538/1497 [04:58<08:06, 1.97it/s]epoch: 1 loss: 0.0197817 f1: 0.7262888: 36%|███▌ | 539/1497 [04:58<08:06, 1.97it/s]epoch: 1 loss: 0.0440556 f1: 0.7262888: 36%|███▌ | 539/1497 [04:59<08:06, 1.97it/s]epoch: 1 loss: 0.0440556 f1: 0.7262888: 36%|███▌ | 540/1497 [04:59<08:04, 1.97it/s]epoch: 1 loss: 0.0701036 f1: 0.7262888: 36%|███▌ | 540/1497 [04:59<08:04, 1.97it/s]epoch: 1 loss: 0.0701036 f1: 0.7262888: 36%|███▌ | 541/1497 [04:59<08:04, 1.97it/s]epoch: 1 loss: 0.0147542 f1: 0.7262888: 36%|███▌ | 541/1497 [05:00<08:04, 1.97it/s]epoch: 1 loss: 0.0147542 f1: 0.7262888: 36%|███▌ | 542/1497 [05:00<08:04, 1.97it/s]epoch: 1 loss: 0.1174324 f1: 0.7262888: 36%|███▌ | 542/1497 [05:00<08:04, 1.97it/s]epoch: 1 loss: 0.1174324 f1: 0.7262888: 36%|███▋ | 543/1497 [05:00<08:03, 1.97it/s]epoch: 1 loss: 0.0745073 f1: 0.7262888: 36%|███▋ | 543/1497 [05:01<08:03, 1.97it/s]epoch: 1 loss: 0.0745073 f1: 0.7262888: 36%|███▋ | 544/1497 [05:01<08:04, 1.97it/s]epoch: 1 loss: 0.2391148 f1: 0.7262888: 36%|███▋ | 544/1497 [05:01<08:04, 1.97it/s]epoch: 1 loss: 0.2391148 f1: 0.7262888: 36%|███▋ | 545/1497 [05:01<08:02, 1.97it/s]epoch: 1 loss: 0.0292535 f1: 0.7262888: 36%|███▋ | 545/1497 [05:02<08:02, 1.97it/s]epoch: 1 loss: 0.0292535 f1: 0.7262888: 36%|███▋ | 546/1497 [05:02<08:02, 1.97it/s]epoch: 1 loss: 0.0499889 f1: 0.7262888: 36%|███▋ | 546/1497 [05:02<08:02, 1.97it/s]epoch: 1 loss: 0.0499889 f1: 0.7262888: 37%|███▋ | 547/1497 [05:02<08:01, 1.97it/s]epoch: 1 loss: 0.0335544 f1: 0.7262888: 37%|███▋ | 547/1497 [05:03<08:01, 1.97it/s]epoch: 1 loss: 0.0335544 f1: 0.7262888: 37%|███▋ | 548/1497 [05:03<08:01, 1.97it/s]epoch: 1 loss: 0.0630557 f1: 0.7262888: 37%|███▋ | 548/1497 [05:04<08:01, 1.97it/s]epoch: 1 loss: 0.0630557 f1: 0.7262888: 37%|███▋ | 549/1497 [05:04<08:04, 1.96it/s]epoch: 1 loss: 0.0487120 f1: 0.7262888: 37%|███▋ | 549/1497 [05:04<08:04, 1.96it/s]epoch: 1 loss: 0.0487120 f1: 0.7262888: 37%|███▋ | 550/1497 [05:04<08:03, 1.96it/s]epoch: 1 loss: 0.0856882 f1: 0.7262888: 37%|███▋ | 550/1497 [05:05<08:03, 1.96it/s]epoch: 1 loss: 0.0856882 f1: 0.7262888: 37%|███▋ | 551/1497 [05:05<08:01, 1.97it/s]epoch: 1 loss: 0.1021181 f1: 0.7262888: 37%|███▋ | 551/1497 [05:05<08:01, 1.97it/s]epoch: 1 loss: 0.1021181 f1: 0.7262888: 37%|███▋ | 552/1497 [05:05<08:00, 1.97it/s]epoch: 1 loss: 0.1056844 f1: 0.7262888: 37%|███▋ | 552/1497 [05:06<08:00, 1.97it/s]epoch: 1 loss: 0.1056844 f1: 0.7262888: 37%|███▋ | 553/1497 [05:06<08:01, 1.96it/s]epoch: 1 loss: 0.1550541 f1: 0.7262888: 37%|███▋ | 553/1497 [05:06<08:01, 1.96it/s]epoch: 1 loss: 0.1550541 f1: 0.7262888: 37%|███▋ | 554/1497 [05:06<08:01, 1.96it/s]epoch: 1 loss: 0.0446448 f1: 0.7262888: 37%|███▋ | 554/1497 [05:07<08:01, 1.96it/s]epoch: 1 loss: 0.0446448 f1: 0.7262888: 37%|███▋ | 555/1497 [05:07<07:58, 1.97it/s]epoch: 1 loss: 0.0108735 f1: 0.7262888: 37%|███▋ | 555/1497 [05:07<07:58, 1.97it/s]epoch: 1 loss: 0.0108735 f1: 0.7262888: 37%|███▋ | 556/1497 [05:07<07:56, 1.97it/s]epoch: 1 loss: 0.0199259 f1: 0.7262888: 37%|███▋ | 556/1497 [05:08<07:56, 1.97it/s]epoch: 1 loss: 0.0199259 f1: 0.7262888: 37%|███▋ | 557/1497 [05:08<07:55, 1.98it/s]epoch: 1 loss: 0.0488419 f1: 0.7262888: 37%|███▋ | 557/1497 [05:08<07:55, 1.98it/s]epoch: 1 loss: 0.0488419 f1: 0.7262888: 37%|███▋ | 558/1497 [05:08<07:54, 1.98it/s]epoch: 1 loss: 0.0349348 f1: 0.7262888: 37%|███▋ | 558/1497 [05:09<07:54, 1.98it/s]epoch: 1 loss: 0.0349348 f1: 0.7262888: 37%|███▋ | 559/1497 [05:09<07:54, 1.98it/s]epoch: 1 loss: 0.0137145 f1: 0.7262888: 37%|███▋ | 559/1497 [05:09<07:54, 1.98it/s]epoch: 1 loss: 0.0137145 f1: 0.7262888: 37%|███▋ | 560/1497 [05:09<07:53, 1.98it/s]epoch: 1 loss: 0.0324691 f1: 0.7262888: 37%|███▋ | 560/1497 [05:10<07:53, 1.98it/s]epoch: 1 loss: 0.0324691 f1: 0.7262888: 37%|███▋ | 561/1497 [05:10<07:54, 1.97it/s]epoch: 1 loss: 0.0718456 f1: 0.7262888: 37%|███▋ | 561/1497 [05:10<07:54, 1.97it/s]epoch: 1 loss: 0.0718456 f1: 0.7262888: 38%|███▊ | 562/1497 [05:10<07:52, 1.98it/s]epoch: 1 loss: 0.0507948 f1: 0.7262888: 38%|███▊ | 562/1497 [05:11<07:52, 1.98it/s]epoch: 1 loss: 0.0507948 f1: 0.7262888: 38%|███▊ | 563/1497 [05:11<07:53, 1.97it/s]epoch: 1 loss: 0.0848016 f1: 0.7262888: 38%|███▊ | 563/1497 [05:11<07:53, 1.97it/s]epoch: 1 loss: 0.0848016 f1: 0.7262888: 38%|███▊ | 564/1497 [05:11<07:55, 1.96it/s]epoch: 1 loss: 0.0071229 f1: 0.7262888: 38%|███▊ | 564/1497 [05:12<07:55, 1.96it/s]epoch: 1 loss: 0.0071229 f1: 0.7262888: 38%|███▊ | 565/1497 [05:12<07:55, 1.96it/s]epoch: 1 loss: 0.0450744 f1: 0.7262888: 38%|███▊ | 565/1497 [05:12<07:55, 1.96it/s]epoch: 1 loss: 0.0450744 f1: 0.7262888: 38%|███▊ | 566/1497 [05:12<07:53, 1.97it/s]epoch: 1 loss: 0.0203848 f1: 0.7262888: 38%|███▊ | 566/1497 [05:13<07:53, 1.97it/s]epoch: 1 loss: 0.0203848 f1: 0.7262888: 38%|███▊ | 567/1497 [05:13<07:51, 1.97it/s]epoch: 1 loss: 0.0839875 f1: 0.7262888: 38%|███▊ | 567/1497 [05:13<07:51, 1.97it/s]epoch: 1 loss: 0.0839875 f1: 0.7262888: 38%|███▊ | 568/1497 [05:13<07:50, 1.97it/s]epoch: 1 loss: 0.1391359 f1: 0.7262888: 38%|███▊ | 568/1497 [05:14<07:50, 1.97it/s]epoch: 1 loss: 0.1391359 f1: 0.7262888: 38%|███▊ | 569/1497 [05:14<07:48, 1.98it/s]epoch: 1 loss: 0.1016536 f1: 0.7262888: 38%|███▊ | 569/1497 [05:14<07:48, 1.98it/s]epoch: 1 loss: 0.1016536 f1: 0.7262888: 38%|███▊ | 570/1497 [05:14<07:47, 1.98it/s]epoch: 1 loss: 0.0382071 f1: 0.7262888: 38%|███▊ | 570/1497 [05:15<07:47, 1.98it/s]epoch: 1 loss: 0.0382071 f1: 0.7262888: 38%|███▊ | 571/1497 [05:15<07:50, 1.97it/s]epoch: 1 loss: 0.0482412 f1: 0.7262888: 38%|███▊ | 571/1497 [05:15<07:50, 1.97it/s]epoch: 1 loss: 0.0482412 f1: 0.7262888: 38%|███▊ | 572/1497 [05:15<08:00, 1.93it/s]epoch: 1 loss: 0.1256371 f1: 0.7262888: 38%|███▊ | 572/1497 [05:16<08:00, 1.93it/s]epoch: 1 loss: 0.1256371 f1: 0.7262888: 38%|███▊ | 573/1497 [05:16<08:00, 1.92it/s]epoch: 1 loss: 0.0267708 f1: 0.7262888: 38%|███▊ | 573/1497 [05:16<08:00, 1.92it/s]epoch: 1 loss: 0.0267708 f1: 0.7262888: 38%|███▊ | 574/1497 [05:16<08:00, 1.92it/s]epoch: 1 loss: 0.0180858 f1: 0.7262888: 38%|███▊ | 574/1497 [05:17<08:00, 1.92it/s]epoch: 1 loss: 0.0180858 f1: 0.7262888: 38%|███▊ | 575/1497 [05:17<07:58, 1.93it/s]epoch: 1 loss: 0.0644917 f1: 0.7262888: 38%|███▊ | 575/1497 [05:17<07:58, 1.93it/s]epoch: 1 loss: 0.0644917 f1: 0.7262888: 38%|███▊ | 576/1497 [05:17<08:01, 1.91it/s]epoch: 1 loss: 0.0492978 f1: 0.7262888: 38%|███▊ | 576/1497 [05:18<08:01, 1.91it/s]epoch: 1 loss: 0.0492978 f1: 0.7262888: 39%|███▊ | 577/1497 [05:18<07:58, 1.92it/s]epoch: 1 loss: 0.0346397 f1: 0.7262888: 39%|███▊ | 577/1497 [05:18<07:58, 1.92it/s]epoch: 1 loss: 0.0346397 f1: 0.7262888: 39%|███▊ | 578/1497 [05:18<07:54, 1.94it/s]epoch: 1 loss: 0.0291326 f1: 0.7262888: 39%|███▊ | 578/1497 [05:19<07:54, 1.94it/s]epoch: 1 loss: 0.0291326 f1: 0.7262888: 39%|███▊ | 579/1497 [05:19<07:51, 1.95it/s]epoch: 1 loss: 0.1457260 f1: 0.7262888: 39%|███▊ | 579/1497 [05:19<07:51, 1.95it/s]epoch: 1 loss: 0.1457260 f1: 0.7262888: 39%|███▊ | 580/1497 [05:19<07:49, 1.95it/s]epoch: 1 loss: 0.0356986 f1: 0.7262888: 39%|███▊ | 580/1497 [05:20<07:49, 1.95it/s]epoch: 1 loss: 0.0356986 f1: 0.7262888: 39%|███▉ | 581/1497 [05:20<07:47, 1.96it/s]epoch: 1 loss: 0.0836907 f1: 0.7262888: 39%|███▉ | 581/1497 [05:20<07:47, 1.96it/s]epoch: 1 loss: 0.0836907 f1: 0.7262888: 39%|███▉ | 582/1497 [05:20<07:47, 1.96it/s]epoch: 1 loss: 0.0666379 f1: 0.7262888: 39%|███▉ | 582/1497 [05:21<07:47, 1.96it/s]epoch: 1 loss: 0.0666379 f1: 0.7262888: 39%|███▉ | 583/1497 [05:21<07:51, 1.94it/s]epoch: 1 loss: 0.0207570 f1: 0.7262888: 39%|███▉ | 583/1497 [05:21<07:51, 1.94it/s]epoch: 1 loss: 0.0207570 f1: 0.7262888: 39%|███▉ | 584/1497 [05:21<07:49, 1.95it/s]epoch: 1 loss: 0.0089174 f1: 0.7262888: 39%|███▉ | 584/1497 [05:22<07:49, 1.95it/s]epoch: 1 loss: 0.0089174 f1: 0.7262888: 39%|███▉ | 585/1497 [05:22<07:49, 1.94it/s]epoch: 1 loss: 0.1036387 f1: 0.7262888: 39%|███▉ | 585/1497 [05:22<07:49, 1.94it/s]epoch: 1 loss: 0.1036387 f1: 0.7262888: 39%|███▉ | 586/1497 [05:22<07:47, 1.95it/s]epoch: 1 loss: 0.1136752 f1: 0.7262888: 39%|███▉ | 586/1497 [05:23<07:47, 1.95it/s]epoch: 1 loss: 0.1136752 f1: 0.7262888: 39%|███▉ | 587/1497 [05:23<07:43, 1.96it/s]epoch: 1 loss: 0.0822177 f1: 0.7262888: 39%|███▉ | 587/1497 [05:23<07:43, 1.96it/s]epoch: 1 loss: 0.0822177 f1: 0.7262888: 39%|███▉ | 588/1497 [05:23<07:40, 1.97it/s]epoch: 1 loss: 0.0264228 f1: 0.7262888: 39%|███▉ | 588/1497 [05:24<07:40, 1.97it/s]epoch: 1 loss: 0.0264228 f1: 0.7262888: 39%|███▉ | 589/1497 [05:24<07:41, 1.97it/s]epoch: 1 loss: 0.1264771 f1: 0.7262888: 39%|███▉ | 589/1497 [05:24<07:41, 1.97it/s]epoch: 1 loss: 0.1264771 f1: 0.7262888: 39%|███▉ | 590/1497 [05:24<07:40, 1.97it/s]epoch: 1 loss: 0.0077261 f1: 0.7262888: 39%|███▉ | 590/1497 [05:25<07:40, 1.97it/s]epoch: 1 loss: 0.0077261 f1: 0.7262888: 39%|███▉ | 591/1497 [05:25<07:37, 1.98it/s]epoch: 1 loss: 0.1512692 f1: 0.7262888: 39%|███▉ | 591/1497 [05:25<07:37, 1.98it/s]epoch: 1 loss: 0.1512692 f1: 0.7262888: 40%|███▉ | 592/1497 [05:25<07:36, 1.98it/s]epoch: 1 loss: 0.0147935 f1: 0.7262888: 40%|███▉ | 592/1497 [05:26<07:36, 1.98it/s]epoch: 1 loss: 0.0147935 f1: 0.7262888: 40%|███▉ | 593/1497 [05:26<07:35, 1.99it/s]epoch: 1 loss: 0.0263700 f1: 0.7262888: 40%|███▉ | 593/1497 [05:26<07:35, 1.99it/s]epoch: 1 loss: 0.0263700 f1: 0.7262888: 40%|███▉ | 594/1497 [05:26<07:35, 1.98it/s]epoch: 1 loss: 0.2477663 f1: 0.7262888: 40%|███▉ | 594/1497 [05:27<07:35, 1.98it/s]epoch: 1 loss: 0.2477663 f1: 0.7262888: 40%|███▉ | 595/1497 [05:27<07:36, 1.98it/s]epoch: 1 loss: 0.1539031 f1: 0.7262888: 40%|███▉ | 595/1497 [05:27<07:36, 1.98it/s]epoch: 1 loss: 0.1539031 f1: 0.7262888: 40%|███▉ | 596/1497 [05:27<07:35, 1.98it/s]epoch: 1 loss: 0.0437970 f1: 0.7262888: 40%|███▉ | 596/1497 [05:28<07:35, 1.98it/s]epoch: 1 loss: 0.0437970 f1: 0.7262888: 40%|███▉ | 597/1497 [05:28<07:33, 1.98it/s]epoch: 1 loss: 0.0506824 f1: 0.7262888: 40%|███▉ | 597/1497 [05:28<07:33, 1.98it/s]epoch: 1 loss: 0.0506824 f1: 0.7262888: 40%|███▉ | 598/1497 [05:28<07:32, 1.99it/s]epoch: 1 loss: 0.0054459 f1: 0.7262888: 40%|███▉ | 598/1497 [05:29<07:32, 1.99it/s]epoch: 1 loss: 0.0054459 f1: 0.7262888: 40%|████ | 599/1497 [05:29<07:31, 1.99it/s]epoch: 1 loss: 0.0687020 f1: 0.7262888: 40%|████ | 599/1497 [05:29<07:31, 1.99it/s]epoch: 1 loss: 0.0687020 f1: 0.7262888: 40%|████ | 600/1497 [05:29<07:31, 1.98it/s]epoch: 1 loss: 0.0343248 f1: 0.7262888: 40%|████ | 600/1497 [05:30<07:31, 1.98it/s]epoch: 1 loss: 0.0343248 f1: 0.7262888: 40%|████ | 601/1497 [05:30<07:29, 1.99it/s]epoch: 1 loss: 0.0267744 f1: 0.7262888: 40%|████ | 601/1497 [05:30<07:29, 1.99it/s]epoch: 1 loss: 0.0267744 f1: 0.7262888: 40%|████ | 602/1497 [05:30<07:28, 1.99it/s]epoch: 1 loss: 0.0647468 f1: 0.7262888: 40%|████ | 602/1497 [05:31<07:28, 1.99it/s]epoch: 1 loss: 0.0647468 f1: 0.7262888: 40%|████ | 603/1497 [05:31<07:28, 1.99it/s]epoch: 1 loss: 0.0199501 f1: 0.7262888: 40%|████ | 603/1497 [05:31<07:28, 1.99it/s]epoch: 1 loss: 0.0199501 f1: 0.7262888: 40%|████ | 604/1497 [05:31<07:29, 1.99it/s]epoch: 1 loss: 0.0314011 f1: 0.7262888: 40%|████ | 604/1497 [05:32<07:29, 1.99it/s]epoch: 1 loss: 0.0314011 f1: 0.7262888: 40%|████ | 605/1497 [05:32<07:29, 1.98it/s]epoch: 1 loss: 0.0365740 f1: 0.7262888: 40%|████ | 605/1497 [05:32<07:29, 1.98it/s]epoch: 1 loss: 0.0365740 f1: 0.7262888: 40%|████ | 606/1497 [05:32<07:27, 1.99it/s]epoch: 1 loss: 0.0280381 f1: 0.7262888: 40%|████ | 606/1497 [05:33<07:27, 1.99it/s]epoch: 1 loss: 0.0280381 f1: 0.7262888: 41%|████ | 607/1497 [05:33<07:24, 2.00it/s]epoch: 1 loss: 0.1917022 f1: 0.7262888: 41%|████ | 607/1497 [05:33<07:24, 2.00it/s]epoch: 1 loss: 0.1917022 f1: 0.7262888: 41%|████ | 608/1497 [05:33<07:25, 1.99it/s]epoch: 1 loss: 0.0325642 f1: 0.7262888: 41%|████ | 608/1497 [05:34<07:25, 1.99it/s]epoch: 1 loss: 0.0325642 f1: 0.7262888: 41%|████ | 609/1497 [05:34<07:20, 2.01it/s]epoch: 1 loss: 0.0474659 f1: 0.7262888: 41%|████ | 609/1497 [05:34<07:20, 2.01it/s]epoch: 1 loss: 0.0474659 f1: 0.7262888: 41%|████ | 610/1497 [05:34<07:20, 2.02it/s]epoch: 1 loss: 0.1245134 f1: 0.7262888: 41%|████ | 610/1497 [05:35<07:20, 2.02it/s]epoch: 1 loss: 0.1245134 f1: 0.7262888: 41%|████ | 611/1497 [05:35<07:22, 2.00it/s]epoch: 1 loss: 0.0260619 f1: 0.7262888: 41%|████ | 611/1497 [05:36<07:22, 2.00it/s]epoch: 1 loss: 0.0260619 f1: 0.7262888: 41%|████ | 612/1497 [05:36<07:31, 1.96it/s]epoch: 1 loss: 0.0510930 f1: 0.7262888: 41%|████ | 612/1497 [05:36<07:31, 1.96it/s]epoch: 1 loss: 0.0510930 f1: 0.7262888: 41%|████ | 613/1497 [05:36<07:33, 1.95it/s]epoch: 1 loss: 0.0072450 f1: 0.7262888: 41%|████ | 613/1497 [05:37<07:33, 1.95it/s]epoch: 1 loss: 0.0072450 f1: 0.7262888: 41%|████ | 614/1497 [05:37<07:30, 1.96it/s]epoch: 1 loss: 0.0231807 f1: 0.7262888: 41%|████ | 614/1497 [05:37<07:30, 1.96it/s]epoch: 1 loss: 0.0231807 f1: 0.7262888: 41%|████ | 615/1497 [05:37<07:25, 1.98it/s]epoch: 1 loss: 0.0339084 f1: 0.7262888: 41%|████ | 615/1497 [05:38<07:25, 1.98it/s]epoch: 1 loss: 0.0339084 f1: 0.7262888: 41%|████ | 616/1497 [05:38<07:22, 1.99it/s]epoch: 1 loss: 0.0591475 f1: 0.7262888: 41%|████ | 616/1497 [05:38<07:22, 1.99it/s]epoch: 1 loss: 0.0591475 f1: 0.7262888: 41%|████ | 617/1497 [05:38<07:18, 2.01it/s]epoch: 1 loss: 0.0193607 f1: 0.7262888: 41%|████ | 617/1497 [05:38<07:18, 2.01it/s]epoch: 1 loss: 0.0193607 f1: 0.7262888: 41%|████▏ | 618/1497 [05:38<07:14, 2.02it/s]epoch: 1 loss: 0.0871508 f1: 0.7262888: 41%|████▏ | 618/1497 [05:39<07:14, 2.02it/s]epoch: 1 loss: 0.0871508 f1: 0.7262888: 41%|████▏ | 619/1497 [05:39<07:13, 2.03it/s]epoch: 1 loss: 0.0099005 f1: 0.7262888: 41%|████▏ | 619/1497 [05:39<07:13, 2.03it/s]epoch: 1 loss: 0.0099005 f1: 0.7262888: 41%|████▏ | 620/1497 [05:39<07:11, 2.03it/s]epoch: 1 loss: 0.0060270 f1: 0.7262888: 41%|████▏ | 620/1497 [05:40<07:11, 2.03it/s]epoch: 1 loss: 0.0060270 f1: 0.7262888: 41%|████▏ | 621/1497 [05:40<07:10, 2.03it/s]epoch: 1 loss: 0.0294023 f1: 0.7262888: 41%|████▏ | 621/1497 [05:40<07:10, 2.03it/s]epoch: 1 loss: 0.0294023 f1: 0.7262888: 42%|████▏ | 622/1497 [05:40<07:10, 2.03it/s]epoch: 1 loss: 0.1204829 f1: 0.7262888: 42%|████▏ | 622/1497 [05:41<07:10, 2.03it/s]epoch: 1 loss: 0.1204829 f1: 0.7262888: 42%|████▏ | 623/1497 [05:41<07:15, 2.01it/s]epoch: 1 loss: 0.0927581 f1: 0.7262888: 42%|████▏ | 623/1497 [05:41<07:15, 2.01it/s]epoch: 1 loss: 0.0927581 f1: 0.7262888: 42%|████▏ | 624/1497 [05:41<07:17, 2.00it/s]epoch: 1 loss: 0.0115782 f1: 0.7262888: 42%|████▏ | 624/1497 [05:42<07:17, 2.00it/s]epoch: 1 loss: 0.0115782 f1: 0.7262888: 42%|████▏ | 625/1497 [05:42<07:18, 1.99it/s]epoch: 1 loss: 0.0250724 f1: 0.7262888: 42%|████▏ | 625/1497 [05:42<07:18, 1.99it/s]epoch: 1 loss: 0.0250724 f1: 0.7262888: 42%|████▏ | 626/1497 [05:42<07:21, 1.97it/s]epoch: 1 loss: 0.1558072 f1: 0.7262888: 42%|████▏ | 626/1497 [05:43<07:21, 1.97it/s]epoch: 1 loss: 0.1558072 f1: 0.7262888: 42%|████▏ | 627/1497 [05:43<07:19, 1.98it/s]epoch: 1 loss: 0.0695201 f1: 0.7262888: 42%|████▏ | 627/1497 [05:44<07:19, 1.98it/s]epoch: 1 loss: 0.0695201 f1: 0.7262888: 42%|████▏ | 628/1497 [05:44<07:17, 1.99it/s]epoch: 1 loss: 0.0043690 f1: 0.7262888: 42%|████▏ | 628/1497 [05:44<07:17, 1.99it/s]epoch: 1 loss: 0.0043690 f1: 0.7262888: 42%|████▏ | 629/1497 [05:44<07:13, 2.00it/s]epoch: 1 loss: 0.0176552 f1: 0.7262888: 42%|████▏ | 629/1497 [05:44<07:13, 2.00it/s]epoch: 1 loss: 0.0176552 f1: 0.7262888: 42%|████▏ | 630/1497 [05:44<07:13, 2.00it/s]epoch: 1 loss: 0.1905605 f1: 0.7262888: 42%|████▏ | 630/1497 [05:45<07:13, 2.00it/s]epoch: 1 loss: 0.1905605 f1: 0.7262888: 42%|████▏ | 631/1497 [05:45<07:15, 1.99it/s]epoch: 1 loss: 0.0679271 f1: 0.7262888: 42%|████▏ | 631/1497 [05:46<07:15, 1.99it/s]epoch: 1 loss: 0.0679271 f1: 0.7262888: 42%|████▏ | 632/1497 [05:46<07:17, 1.98it/s]epoch: 1 loss: 0.1866532 f1: 0.7262888: 42%|████▏ | 632/1497 [05:46<07:17, 1.98it/s]epoch: 1 loss: 0.1866532 f1: 0.7262888: 42%|████▏ | 633/1497 [05:46<07:15, 1.98it/s]epoch: 1 loss: 0.0311680 f1: 0.7262888: 42%|████▏ | 633/1497 [05:47<07:15, 1.98it/s]epoch: 1 loss: 0.0311680 f1: 0.7262888: 42%|████▏ | 634/1497 [05:47<07:17, 1.97it/s]epoch: 1 loss: 0.0269590 f1: 0.7262888: 42%|████▏ | 634/1497 [05:47<07:17, 1.97it/s]epoch: 1 loss: 0.0269590 f1: 0.7262888: 42%|████▏ | 635/1497 [05:47<07:19, 1.96it/s]epoch: 1 loss: 0.0886506 f1: 0.7262888: 42%|████▏ | 635/1497 [05:48<07:19, 1.96it/s]epoch: 1 loss: 0.0886506 f1: 0.7262888: 42%|████▏ | 636/1497 [05:48<07:19, 1.96it/s]epoch: 1 loss: 0.1380357 f1: 0.7262888: 42%|████▏ | 636/1497 [05:48<07:19, 1.96it/s]epoch: 1 loss: 0.1380357 f1: 0.7262888: 43%|████▎ | 637/1497 [05:48<07:18, 1.96it/s]epoch: 1 loss: 0.0327084 f1: 0.7262888: 43%|████▎ | 637/1497 [05:49<07:18, 1.96it/s]epoch: 1 loss: 0.0327084 f1: 0.7262888: 43%|████▎ | 638/1497 [05:49<07:15, 1.97it/s]epoch: 1 loss: 0.0934362 f1: 0.7262888: 43%|████▎ | 638/1497 [05:49<07:15, 1.97it/s]epoch: 1 loss: 0.0934362 f1: 0.7262888: 43%|████▎ | 639/1497 [05:49<07:13, 1.98it/s]epoch: 1 loss: 0.0093295 f1: 0.7262888: 43%|████▎ | 639/1497 [05:50<07:13, 1.98it/s]epoch: 1 loss: 0.0093295 f1: 0.7262888: 43%|████▎ | 640/1497 [05:50<07:13, 1.98it/s]epoch: 1 loss: 0.0374725 f1: 0.7262888: 43%|████▎ | 640/1497 [05:50<07:13, 1.98it/s]epoch: 1 loss: 0.0374725 f1: 0.7262888: 43%|████▎ | 641/1497 [05:50<07:14, 1.97it/s]epoch: 1 loss: 0.1014992 f1: 0.7262888: 43%|████▎ | 641/1497 [05:51<07:14, 1.97it/s]epoch: 1 loss: 0.1014992 f1: 0.7262888: 43%|████▎ | 642/1497 [05:51<07:13, 1.97it/s]epoch: 1 loss: 0.0425339 f1: 0.7262888: 43%|████▎ | 642/1497 [05:51<07:13, 1.97it/s]epoch: 1 loss: 0.0425339 f1: 0.7262888: 43%|████▎ | 643/1497 [05:51<07:11, 1.98it/s]epoch: 1 loss: 0.0247751 f1: 0.7262888: 43%|████▎ | 643/1497 [05:52<07:11, 1.98it/s]epoch: 1 loss: 0.0247751 f1: 0.7262888: 43%|████▎ | 644/1497 [05:52<07:12, 1.97it/s]epoch: 1 loss: 0.0278196 f1: 0.7262888: 43%|████▎ | 644/1497 [05:52<07:12, 1.97it/s]epoch: 1 loss: 0.0278196 f1: 0.7262888: 43%|████▎ | 645/1497 [05:52<07:13, 1.97it/s]epoch: 1 loss: 0.0888340 f1: 0.7262888: 43%|████▎ | 645/1497 [05:53<07:13, 1.97it/s]epoch: 1 loss: 0.0888340 f1: 0.7262888: 43%|████▎ | 646/1497 [05:53<07:13, 1.96it/s]epoch: 1 loss: 0.0618406 f1: 0.7262888: 43%|████▎ | 646/1497 [05:53<07:13, 1.96it/s]epoch: 1 loss: 0.0618406 f1: 0.7262888: 43%|████▎ | 647/1497 [05:53<07:13, 1.96it/s]epoch: 1 loss: 0.0150875 f1: 0.7262888: 43%|████▎ | 647/1497 [05:54<07:13, 1.96it/s]epoch: 1 loss: 0.0150875 f1: 0.7262888: 43%|████▎ | 648/1497 [05:54<07:09, 1.97it/s]epoch: 1 loss: 0.0126645 f1: 0.7262888: 43%|████▎ | 648/1497 [05:54<07:09, 1.97it/s]epoch: 1 loss: 0.0126645 f1: 0.7262888: 43%|████▎ | 649/1497 [05:54<07:09, 1.97it/s]epoch: 1 loss: 0.0292008 f1: 0.7262888: 43%|████▎ | 649/1497 [05:55<07:09, 1.97it/s]epoch: 1 loss: 0.0292008 f1: 0.7262888: 43%|████▎ | 650/1497 [05:55<07:07, 1.98it/s]epoch: 1 loss: 0.0058494 f1: 0.7262888: 43%|████▎ | 650/1497 [05:55<07:07, 1.98it/s]epoch: 1 loss: 0.0058494 f1: 0.7262888: 43%|████▎ | 651/1497 [05:55<07:06, 1.98it/s]epoch: 1 loss: 0.0776870 f1: 0.7262888: 43%|████▎ | 651/1497 [05:56<07:06, 1.98it/s]epoch: 1 loss: 0.0776870 f1: 0.7262888: 44%|████▎ | 652/1497 [05:56<07:06, 1.98it/s]epoch: 1 loss: 0.0979683 f1: 0.7262888: 44%|████▎ | 652/1497 [05:56<07:06, 1.98it/s]epoch: 1 loss: 0.0979683 f1: 0.7262888: 44%|████▎ | 653/1497 [05:56<07:13, 1.95it/s]epoch: 1 loss: 0.0474525 f1: 0.7262888: 44%|████▎ | 653/1497 [05:57<07:13, 1.95it/s]epoch: 1 loss: 0.0474525 f1: 0.7262888: 44%|████▎ | 654/1497 [05:57<07:13, 1.94it/s]epoch: 1 loss: 0.0799750 f1: 0.7262888: 44%|████▎ | 654/1497 [05:57<07:13, 1.94it/s]epoch: 1 loss: 0.0799750 f1: 0.7262888: 44%|████▍ | 655/1497 [05:57<07:15, 1.93it/s]epoch: 1 loss: 0.0837850 f1: 0.7262888: 44%|████▍ | 655/1497 [05:58<07:15, 1.93it/s]epoch: 1 loss: 0.0837850 f1: 0.7262888: 44%|████▍ | 656/1497 [05:58<07:11, 1.95it/s]epoch: 1 loss: 0.0252580 f1: 0.7262888: 44%|████▍ | 656/1497 [05:58<07:11, 1.95it/s]epoch: 1 loss: 0.0252580 f1: 0.7262888: 44%|████▍ | 657/1497 [05:58<07:09, 1.95it/s]epoch: 1 loss: 0.0087663 f1: 0.7262888: 44%|████▍ | 657/1497 [05:59<07:09, 1.95it/s]epoch: 1 loss: 0.0087663 f1: 0.7262888: 44%|████▍ | 658/1497 [05:59<07:07, 1.96it/s]epoch: 1 loss: 0.0443812 f1: 0.7262888: 44%|████▍ | 658/1497 [05:59<07:07, 1.96it/s]epoch: 1 loss: 0.0443812 f1: 0.7262888: 44%|████▍ | 659/1497 [05:59<07:07, 1.96it/s]epoch: 1 loss: 0.0272282 f1: 0.7262888: 44%|████▍ | 659/1497 [06:00<07:07, 1.96it/s]epoch: 1 loss: 0.0272282 f1: 0.7262888: 44%|████▍ | 660/1497 [06:00<07:04, 1.97it/s]epoch: 1 loss: 0.0171153 f1: 0.7262888: 44%|████▍ | 660/1497 [06:00<07:04, 1.97it/s]epoch: 1 loss: 0.0171153 f1: 0.7262888: 44%|████▍ | 661/1497 [06:00<07:03, 1.97it/s]epoch: 1 loss: 0.0704733 f1: 0.7262888: 44%|████▍ | 661/1497 [06:01<07:03, 1.97it/s]epoch: 1 loss: 0.0704733 f1: 0.7262888: 44%|████▍ | 662/1497 [06:01<07:03, 1.97it/s]epoch: 1 loss: 0.1604182 f1: 0.7262888: 44%|████▍ | 662/1497 [06:01<07:03, 1.97it/s]epoch: 1 loss: 0.1604182 f1: 0.7262888: 44%|████▍ | 663/1497 [06:01<06:59, 1.99it/s]epoch: 1 loss: 0.0223976 f1: 0.7262888: 44%|████▍ | 663/1497 [06:02<06:59, 1.99it/s]epoch: 1 loss: 0.0223976 f1: 0.7262888: 44%|████▍ | 664/1497 [06:02<06:58, 1.99it/s]epoch: 1 loss: 0.0169374 f1: 0.7262888: 44%|████▍ | 664/1497 [06:02<06:58, 1.99it/s]epoch: 1 loss: 0.0169374 f1: 0.7262888: 44%|████▍ | 665/1497 [06:02<06:56, 2.00it/s]epoch: 1 loss: 0.0378629 f1: 0.7262888: 44%|████▍ | 665/1497 [06:03<06:56, 2.00it/s]epoch: 1 loss: 0.0378629 f1: 0.7262888: 44%|████▍ | 666/1497 [06:03<06:52, 2.01it/s]epoch: 1 loss: 0.0128859 f1: 0.7262888: 44%|████▍ | 666/1497 [06:03<06:52, 2.01it/s]epoch: 1 loss: 0.0128859 f1: 0.7262888: 45%|████▍ | 667/1497 [06:03<06:52, 2.01it/s]epoch: 1 loss: 0.1690345 f1: 0.7262888: 45%|████▍ | 667/1497 [06:04<06:52, 2.01it/s]epoch: 1 loss: 0.1690345 f1: 0.7262888: 45%|████▍ | 668/1497 [06:04<06:52, 2.01it/s]epoch: 1 loss: 0.0947335 f1: 0.7262888: 45%|████▍ | 668/1497 [06:04<06:52, 2.01it/s]epoch: 1 loss: 0.0947335 f1: 0.7262888: 45%|████▍ | 669/1497 [06:04<06:49, 2.02it/s]epoch: 1 loss: 0.0159318 f1: 0.7262888: 45%|████▍ | 669/1497 [06:05<06:49, 2.02it/s]epoch: 1 loss: 0.0159318 f1: 0.7262888: 45%|████▍ | 670/1497 [06:05<06:46, 2.03it/s]epoch: 1 loss: 0.0560452 f1: 0.7262888: 45%|████▍ | 670/1497 [06:05<06:46, 2.03it/s]epoch: 1 loss: 0.0560452 f1: 0.7262888: 45%|████▍ | 671/1497 [06:05<06:46, 2.03it/s]epoch: 1 loss: 0.0196349 f1: 0.7262888: 45%|████▍ | 671/1497 [06:06<06:46, 2.03it/s]epoch: 1 loss: 0.0196349 f1: 0.7262888: 45%|████▍ | 672/1497 [06:06<06:45, 2.04it/s]epoch: 1 loss: 0.0303896 f1: 0.7262888: 45%|████▍ | 672/1497 [06:06<06:45, 2.04it/s]epoch: 1 loss: 0.0303896 f1: 0.7262888: 45%|████▍ | 673/1497 [06:06<06:43, 2.04it/s]epoch: 1 loss: 0.0424394 f1: 0.7262888: 45%|████▍ | 673/1497 [06:07<06:43, 2.04it/s]epoch: 1 loss: 0.0424394 f1: 0.7262888: 45%|████▌ | 674/1497 [06:07<06:46, 2.02it/s]epoch: 1 loss: 0.1339061 f1: 0.7262888: 45%|████▌ | 674/1497 [06:07<06:46, 2.02it/s]epoch: 1 loss: 0.1339061 f1: 0.7262888: 45%|████▌ | 675/1497 [06:07<06:47, 2.02it/s]epoch: 1 loss: 0.1112338 f1: 0.7262888: 45%|████▌ | 675/1497 [06:08<06:47, 2.02it/s]epoch: 1 loss: 0.1112338 f1: 0.7262888: 45%|████▌ | 676/1497 [06:08<06:46, 2.02it/s]epoch: 1 loss: 0.0450322 f1: 0.7262888: 45%|████▌ | 676/1497 [06:08<06:46, 2.02it/s]epoch: 1 loss: 0.0450322 f1: 0.7262888: 45%|████▌ | 677/1497 [06:08<06:47, 2.01it/s]epoch: 1 loss: 0.0628653 f1: 0.7262888: 45%|████▌ | 677/1497 [06:09<06:47, 2.01it/s]epoch: 1 loss: 0.0628653 f1: 0.7262888: 45%|████▌ | 678/1497 [06:09<06:46, 2.02it/s]epoch: 1 loss: 0.1191080 f1: 0.7262888: 45%|████▌ | 678/1497 [06:09<06:46, 2.02it/s]epoch: 1 loss: 0.1191080 f1: 0.7262888: 45%|████▌ | 679/1497 [06:09<06:45, 2.02it/s]epoch: 1 loss: 0.0221117 f1: 0.7262888: 45%|████▌ | 679/1497 [06:10<06:45, 2.02it/s]epoch: 1 loss: 0.0221117 f1: 0.7262888: 45%|████▌ | 680/1497 [06:10<06:45, 2.01it/s]epoch: 1 loss: 0.0565078 f1: 0.7262888: 45%|████▌ | 680/1497 [06:10<06:45, 2.01it/s]epoch: 1 loss: 0.0565078 f1: 0.7262888: 45%|████▌ | 681/1497 [06:10<06:45, 2.01it/s]epoch: 1 loss: 0.1776972 f1: 0.7262888: 45%|████▌ | 681/1497 [06:11<06:45, 2.01it/s]epoch: 1 loss: 0.1776972 f1: 0.7262888: 46%|████▌ | 682/1497 [06:11<06:43, 2.02it/s]epoch: 1 loss: 0.0229937 f1: 0.7262888: 46%|████▌ | 682/1497 [06:11<06:43, 2.02it/s]epoch: 1 loss: 0.0229937 f1: 0.7262888: 46%|████▌ | 683/1497 [06:11<06:41, 2.03it/s]epoch: 1 loss: 0.0216301 f1: 0.7262888: 46%|████▌ | 683/1497 [06:12<06:41, 2.03it/s]epoch: 1 loss: 0.0216301 f1: 0.7262888: 46%|████▌ | 684/1497 [06:12<06:42, 2.02it/s]epoch: 1 loss: 0.1822248 f1: 0.7262888: 46%|████▌ | 684/1497 [06:12<06:42, 2.02it/s]epoch: 1 loss: 0.1822248 f1: 0.7262888: 46%|████▌ | 685/1497 [06:12<06:41, 2.02it/s]epoch: 1 loss: 0.0871536 f1: 0.7262888: 46%|████▌ | 685/1497 [06:13<06:41, 2.02it/s]epoch: 1 loss: 0.0871536 f1: 0.7262888: 46%|████▌ | 686/1497 [06:13<06:42, 2.01it/s]epoch: 1 loss: 0.1203638 f1: 0.7262888: 46%|████▌ | 686/1497 [06:13<06:42, 2.01it/s]epoch: 1 loss: 0.1203638 f1: 0.7262888: 46%|████▌ | 687/1497 [06:13<06:44, 2.00it/s]epoch: 1 loss: 0.0404658 f1: 0.7262888: 46%|████▌ | 687/1497 [06:14<06:44, 2.00it/s]epoch: 1 loss: 0.0404658 f1: 0.7262888: 46%|████▌ | 688/1497 [06:14<06:40, 2.02it/s]epoch: 1 loss: 0.0159485 f1: 0.7262888: 46%|████▌ | 688/1497 [06:14<06:40, 2.02it/s]epoch: 1 loss: 0.0159485 f1: 0.7262888: 46%|████▌ | 689/1497 [06:14<06:41, 2.01it/s]epoch: 1 loss: 0.0265631 f1: 0.7262888: 46%|████▌ | 689/1497 [06:15<06:41, 2.01it/s]epoch: 1 loss: 0.0265631 f1: 0.7262888: 46%|████▌ | 690/1497 [06:15<06:39, 2.02it/s]epoch: 1 loss: 0.0896455 f1: 0.7262888: 46%|████▌ | 690/1497 [06:15<06:39, 2.02it/s]epoch: 1 loss: 0.0896455 f1: 0.7262888: 46%|████▌ | 691/1497 [06:15<06:37, 2.03it/s]epoch: 1 loss: 0.0721167 f1: 0.7262888: 46%|████▌ | 691/1497 [06:16<06:37, 2.03it/s]epoch: 1 loss: 0.0721167 f1: 0.7262888: 46%|████▌ | 692/1497 [06:16<06:36, 2.03it/s]epoch: 1 loss: 0.0621186 f1: 0.7262888: 46%|████▌ | 692/1497 [06:16<06:36, 2.03it/s]epoch: 1 loss: 0.0621186 f1: 0.7262888: 46%|████▋ | 693/1497 [06:16<06:36, 2.03it/s]epoch: 1 loss: 0.0812144 f1: 0.7262888: 46%|████▋ | 693/1497 [06:17<06:36, 2.03it/s]epoch: 1 loss: 0.0812144 f1: 0.7262888: 46%|████▋ | 694/1497 [06:17<06:41, 2.00it/s]epoch: 1 loss: 0.0436191 f1: 0.7262888: 46%|████▋ | 694/1497 [06:17<06:41, 2.00it/s]epoch: 1 loss: 0.0436191 f1: 0.7262888: 46%|████▋ | 695/1497 [06:17<06:48, 1.96it/s]epoch: 1 loss: 0.0547329 f1: 0.7262888: 46%|████▋ | 695/1497 [06:18<06:48, 1.96it/s]epoch: 1 loss: 0.0547329 f1: 0.7262888: 46%|████▋ | 696/1497 [06:18<06:47, 1.96it/s]epoch: 1 loss: 0.0765920 f1: 0.7262888: 46%|████▋ | 696/1497 [06:18<06:47, 1.96it/s]epoch: 1 loss: 0.0765920 f1: 0.7262888: 47%|████▋ | 697/1497 [06:18<06:44, 1.98it/s]epoch: 1 loss: 0.0944570 f1: 0.7262888: 47%|████▋ | 697/1497 [06:19<06:44, 1.98it/s]epoch: 1 loss: 0.0944570 f1: 0.7262888: 47%|████▋ | 698/1497 [06:19<06:45, 1.97it/s]epoch: 1 loss: 0.0189153 f1: 0.7262888: 47%|████▋ | 698/1497 [06:19<06:45, 1.97it/s]epoch: 1 loss: 0.0189153 f1: 0.7262888: 47%|████▋ | 699/1497 [06:19<06:49, 1.95it/s]epoch: 1 loss: 0.0964656 f1: 0.7262888: 47%|████▋ | 699/1497 [06:20<06:49, 1.95it/s]epoch: 1 loss: 0.0964656 f1: 0.7262888: 47%|████▋ | 700/1497 [06:20<06:54, 1.93it/s]epoch: 1 loss: 0.0862814 f1: 0.7262888: 47%|████▋ | 700/1497 [06:20<06:54, 1.93it/s]epoch: 1 loss: 0.0862814 f1: 0.7262888: 47%|████▋ | 701/1497 [06:20<06:51, 1.94it/s]epoch: 1 loss: 0.0330438 f1: 0.7262888: 47%|████▋ | 701/1497 [06:21<06:51, 1.94it/s]epoch: 1 loss: 0.0330438 f1: 0.7262888: 47%|████▋ | 702/1497 [06:21<06:48, 1.95it/s]epoch: 1 loss: 0.0300904 f1: 0.7262888: 47%|████▋ | 702/1497 [06:21<06:48, 1.95it/s]epoch: 1 loss: 0.0300904 f1: 0.7262888: 47%|████▋ | 703/1497 [06:21<06:46, 1.95it/s]epoch: 1 loss: 0.0302466 f1: 0.7262888: 47%|████▋ | 703/1497 [06:22<06:46, 1.95it/s]epoch: 1 loss: 0.0302466 f1: 0.7262888: 47%|████▋ | 704/1497 [06:22<06:43, 1.96it/s]epoch: 1 loss: 0.0270341 f1: 0.7262888: 47%|████▋ | 704/1497 [06:22<06:43, 1.96it/s]epoch: 1 loss: 0.0270341 f1: 0.7262888: 47%|████▋ | 705/1497 [06:22<06:37, 1.99it/s]epoch: 1 loss: 0.1060501 f1: 0.7262888: 47%|████▋ | 705/1497 [06:23<06:37, 1.99it/s]epoch: 1 loss: 0.1060501 f1: 0.7262888: 47%|████▋ | 706/1497 [06:23<06:32, 2.02it/s]epoch: 1 loss: 0.0629966 f1: 0.7262888: 47%|████▋ | 706/1497 [06:23<06:32, 2.02it/s]epoch: 1 loss: 0.0629966 f1: 0.7262888: 47%|████▋ | 707/1497 [06:23<06:32, 2.01it/s]epoch: 1 loss: 0.0276112 f1: 0.7262888: 47%|████▋ | 707/1497 [06:24<06:32, 2.01it/s]epoch: 1 loss: 0.0276112 f1: 0.7262888: 47%|████▋ | 708/1497 [06:24<06:28, 2.03it/s]epoch: 1 loss: 0.1241714 f1: 0.7262888: 47%|████▋ | 708/1497 [06:24<06:28, 2.03it/s]epoch: 1 loss: 0.1241714 f1: 0.7262888: 47%|████▋ | 709/1497 [06:24<06:27, 2.04it/s]epoch: 1 loss: 0.0126243 f1: 0.7262888: 47%|████▋ | 709/1497 [06:25<06:27, 2.04it/s]epoch: 1 loss: 0.0126243 f1: 0.7262888: 47%|████▋ | 710/1497 [06:25<06:29, 2.02it/s]epoch: 1 loss: 0.0868064 f1: 0.7262888: 47%|████▋ | 710/1497 [06:25<06:29, 2.02it/s]epoch: 1 loss: 0.0868064 f1: 0.7262888: 47%|████▋ | 711/1497 [06:25<06:27, 2.03it/s]epoch: 1 loss: 0.0609245 f1: 0.7262888: 47%|████▋ | 711/1497 [06:26<06:27, 2.03it/s]epoch: 1 loss: 0.0609245 f1: 0.7262888: 48%|████▊ | 712/1497 [06:26<06:24, 2.04it/s]epoch: 1 loss: 0.0527017 f1: 0.7262888: 48%|████▊ | 712/1497 [06:26<06:24, 2.04it/s]epoch: 1 loss: 0.0527017 f1: 0.7262888: 48%|████▊ | 713/1497 [06:26<06:21, 2.05it/s]epoch: 1 loss: 0.1427086 f1: 0.7262888: 48%|████▊ | 713/1497 [06:27<06:21, 2.05it/s]epoch: 1 loss: 0.1427086 f1: 0.7262888: 48%|████▊ | 714/1497 [06:27<06:25, 2.03it/s]epoch: 1 loss: 0.0077038 f1: 0.7262888: 48%|████▊ | 714/1497 [06:27<06:25, 2.03it/s]epoch: 1 loss: 0.0077038 f1: 0.7262888: 48%|████▊ | 715/1497 [06:27<06:25, 2.03it/s]epoch: 1 loss: 0.1176259 f1: 0.7262888: 48%|████▊ | 715/1497 [06:28<06:25, 2.03it/s]epoch: 1 loss: 0.1176259 f1: 0.7262888: 48%|████▊ | 716/1497 [06:28<06:25, 2.03it/s]epoch: 1 loss: 0.1882425 f1: 0.7262888: 48%|████▊ | 716/1497 [06:28<06:25, 2.03it/s]epoch: 1 loss: 0.1882425 f1: 0.7262888: 48%|████▊ | 717/1497 [06:28<06:24, 2.03it/s]epoch: 1 loss: 0.0910026 f1: 0.7262888: 48%|████▊ | 717/1497 [06:29<06:24, 2.03it/s]epoch: 1 loss: 0.0910026 f1: 0.7262888: 48%|████▊ | 718/1497 [06:29<06:24, 2.02it/s]epoch: 1 loss: 0.0516676 f1: 0.7262888: 48%|████▊ | 718/1497 [06:29<06:24, 2.02it/s]epoch: 1 loss: 0.0516676 f1: 0.7262888: 48%|████▊ | 719/1497 [06:29<06:25, 2.02it/s]epoch: 1 loss: 0.2109561 f1: 0.7262888: 48%|████▊ | 719/1497 [06:30<06:25, 2.02it/s]epoch: 1 loss: 0.2109561 f1: 0.7262888: 48%|████▊ | 720/1497 [06:30<06:25, 2.02it/s]epoch: 1 loss: 0.0052638 f1: 0.7262888: 48%|████▊ | 720/1497 [06:30<06:25, 2.02it/s]epoch: 1 loss: 0.0052638 f1: 0.7262888: 48%|████▊ | 721/1497 [06:30<06:24, 2.02it/s]epoch: 1 loss: 0.0357361 f1: 0.7262888: 48%|████▊ | 721/1497 [06:31<06:24, 2.02it/s]epoch: 1 loss: 0.0357361 f1: 0.7262888: 48%|████▊ | 722/1497 [06:31<06:23, 2.02it/s]epoch: 1 loss: 0.0323831 f1: 0.7262888: 48%|████▊ | 722/1497 [06:31<06:23, 2.02it/s]epoch: 1 loss: 0.0323831 f1: 0.7262888: 48%|████▊ | 723/1497 [06:31<06:23, 2.02it/s]epoch: 1 loss: 0.0834704 f1: 0.7262888: 48%|████▊ | 723/1497 [06:32<06:23, 2.02it/s]epoch: 1 loss: 0.0834704 f1: 0.7262888: 48%|████▊ | 724/1497 [06:32<06:22, 2.02it/s]epoch: 1 loss: 0.0336617 f1: 0.7262888: 48%|████▊ | 724/1497 [06:32<06:22, 2.02it/s]epoch: 1 loss: 0.0336617 f1: 0.7262888: 48%|████▊ | 725/1497 [06:32<06:24, 2.01it/s]epoch: 1 loss: 0.0397279 f1: 0.7262888: 48%|████▊ | 725/1497 [06:33<06:24, 2.01it/s]epoch: 1 loss: 0.0397279 f1: 0.7262888: 48%|████▊ | 726/1497 [06:33<06:23, 2.01it/s]epoch: 1 loss: 0.1003051 f1: 0.7262888: 48%|████▊ | 726/1497 [06:33<06:23, 2.01it/s]epoch: 1 loss: 0.1003051 f1: 0.7262888: 49%|████▊ | 727/1497 [06:33<06:23, 2.01it/s]epoch: 1 loss: 0.1139026 f1: 0.7262888: 49%|████▊ | 727/1497 [06:34<06:23, 2.01it/s]epoch: 1 loss: 0.1139026 f1: 0.7262888: 49%|████▊ | 728/1497 [06:34<06:25, 1.99it/s]epoch: 1 loss: 0.0640077 f1: 0.7262888: 49%|████▊ | 728/1497 [06:34<06:25, 1.99it/s]epoch: 1 loss: 0.0640077 f1: 0.7262888: 49%|████▊ | 729/1497 [06:34<06:24, 2.00it/s]epoch: 1 loss: 0.0917511 f1: 0.7262888: 49%|████▊ | 729/1497 [06:35<06:24, 2.00it/s]epoch: 1 loss: 0.0917511 f1: 0.7262888: 49%|████▉ | 730/1497 [06:35<06:23, 2.00it/s]epoch: 1 loss: 0.0909615 f1: 0.7262888: 49%|████▉ | 730/1497 [06:35<06:23, 2.00it/s]epoch: 1 loss: 0.0909615 f1: 0.7262888: 49%|████▉ | 731/1497 [06:35<06:23, 2.00it/s]epoch: 1 loss: 0.0300680 f1: 0.7262888: 49%|████▉ | 731/1497 [06:36<06:23, 2.00it/s]epoch: 1 loss: 0.0300680 f1: 0.7262888: 49%|████▉ | 732/1497 [06:36<06:25, 1.99it/s]epoch: 1 loss: 0.0632954 f1: 0.7262888: 49%|████▉ | 732/1497 [06:36<06:25, 1.99it/s]epoch: 1 loss: 0.0632954 f1: 0.7262888: 49%|████▉ | 733/1497 [06:36<06:23, 1.99it/s]epoch: 1 loss: 0.0047511 f1: 0.7262888: 49%|████▉ | 733/1497 [06:37<06:23, 1.99it/s]epoch: 1 loss: 0.0047511 f1: 0.7262888: 49%|████▉ | 734/1497 [06:37<06:27, 1.97it/s]epoch: 1 loss: 0.0896793 f1: 0.7262888: 49%|████▉ | 734/1497 [06:37<06:27, 1.97it/s]epoch: 1 loss: 0.0896793 f1: 0.7262888: 49%|████▉ | 735/1497 [06:37<06:28, 1.96it/s]epoch: 1 loss: 0.0462334 f1: 0.7262888: 49%|████▉ | 735/1497 [06:38<06:28, 1.96it/s]epoch: 1 loss: 0.0462334 f1: 0.7262888: 49%|████▉ | 736/1497 [06:38<06:35, 1.93it/s]epoch: 1 loss: 0.0171076 f1: 0.7262888: 49%|████▉ | 736/1497 [06:38<06:35, 1.93it/s]epoch: 1 loss: 0.0171076 f1: 0.7262888: 49%|████▉ | 737/1497 [06:38<06:33, 1.93it/s]epoch: 1 loss: 0.0493342 f1: 0.7262888: 49%|████▉ | 737/1497 [06:39<06:33, 1.93it/s]epoch: 1 loss: 0.0493342 f1: 0.7262888: 49%|████▉ | 738/1497 [06:39<06:33, 1.93it/s]epoch: 1 loss: 0.1596767 f1: 0.7262888: 49%|████▉ | 738/1497 [06:39<06:33, 1.93it/s]epoch: 1 loss: 0.1596767 f1: 0.7262888: 49%|████▉ | 739/1497 [06:39<06:30, 1.94it/s]epoch: 1 loss: 0.1339857 f1: 0.7262888: 49%|████▉ | 739/1497 [06:40<06:30, 1.94it/s]epoch: 1 loss: 0.1339857 f1: 0.7262888: 49%|████▉ | 740/1497 [06:40<06:28, 1.95it/s]epoch: 1 loss: 0.0427365 f1: 0.7262888: 49%|████▉ | 740/1497 [06:40<06:28, 1.95it/s]epoch: 1 loss: 0.0427365 f1: 0.7262888: 49%|████▉ | 741/1497 [06:40<06:25, 1.96it/s]epoch: 1 loss: 0.0429815 f1: 0.7262888: 49%|████▉ | 741/1497 [06:41<06:25, 1.96it/s]epoch: 1 loss: 0.0429815 f1: 0.7262888: 50%|████▉ | 742/1497 [06:41<06:25, 1.96it/s]epoch: 1 loss: 0.1314987 f1: 0.7262888: 50%|████▉ | 742/1497 [06:41<06:25, 1.96it/s]epoch: 1 loss: 0.1314987 f1: 0.7262888: 50%|████▉ | 743/1497 [06:41<06:24, 1.96it/s]epoch: 1 loss: 0.0152492 f1: 0.7262888: 50%|████▉ | 743/1497 [06:42<06:24, 1.96it/s]epoch: 1 loss: 0.0152492 f1: 0.7262888: 50%|████▉ | 744/1497 [06:42<06:22, 1.97it/s]epoch: 1 loss: 0.0115731 f1: 0.7262888: 50%|████▉ | 744/1497 [06:42<06:22, 1.97it/s]epoch: 1 loss: 0.0115731 f1: 0.7262888: 50%|████▉ | 745/1497 [06:42<06:22, 1.97it/s]epoch: 1 loss: 0.0643192 f1: 0.7262888: 50%|████▉ | 745/1497 [06:43<06:22, 1.97it/s]epoch: 1 loss: 0.0643192 f1: 0.7262888: 50%|████▉ | 746/1497 [06:43<06:24, 1.95it/s]epoch: 1 loss: 0.1197276 f1: 0.7262888: 50%|████▉ | 746/1497 [06:43<06:24, 1.95it/s]epoch: 1 loss: 0.1197276 f1: 0.7262888: 50%|████▉ | 747/1497 [06:43<06:24, 1.95it/s]epoch: 1 loss: 0.0385953 f1: 0.7262888: 50%|████▉ | 747/1497 [06:44<06:24, 1.95it/s]epoch: 1 loss: 0.0385953 f1: 0.7262888: 50%|████▉ | 748/1497 [06:44<06:21, 1.96it/s]epoch: 1 loss: 0.0142119 f1: 0.7262888: 50%|████▉ | 748/1497 [06:44<06:21, 1.96it/s]epoch: 1 loss: 0.0142119 f1: 0.7262888: 50%|█████ | 749/1497 [06:44<06:20, 1.97it/s]epoch: 1 loss: 0.0126206 f1: 0.7262888: 50%|█████ | 749/1497 [06:45<06:20, 1.97it/s]epoch: 1 loss: 0.0126206 f1: 0.7262888: 50%|█████ | 750/1497 [06:45<06:21, 1.96it/s]epoch: 1 loss: 0.1487538 f1: 0.7262888: 50%|█████ | 750/1497 [06:45<06:21, 1.96it/s]epoch: 1 loss: 0.1487538 f1: 0.7262888: 50%|█████ | 751/1497 [06:45<06:23, 1.95it/s]epoch: 1 loss: 0.0636611 f1: 0.7262888: 50%|█████ | 751/1497 [06:46<06:23, 1.95it/s]epoch: 1 loss: 0.0636611 f1: 0.7262888: 50%|█████ | 752/1497 [06:46<06:21, 1.95it/s]epoch: 1 loss: 0.1063360 f1: 0.7262888: 50%|█████ | 752/1497 [06:46<06:21, 1.95it/s]epoch: 1 loss: 0.1063360 f1: 0.7262888: 50%|█████ | 753/1497 [06:46<06:20, 1.96it/s]epoch: 1 loss: 0.0141517 f1: 0.7262888: 50%|█████ | 753/1497 [06:47<06:20, 1.96it/s]epoch: 1 loss: 0.0141517 f1: 0.7262888: 50%|█████ | 754/1497 [06:47<06:20, 1.95it/s]epoch: 1 loss: 0.0055813 f1: 0.7262888: 50%|█████ | 754/1497 [06:47<06:20, 1.95it/s]epoch: 1 loss: 0.0055813 f1: 0.7262888: 50%|█████ | 755/1497 [06:47<06:19, 1.95it/s]epoch: 1 loss: 0.0208851 f1: 0.7262888: 50%|█████ | 755/1497 [06:48<06:19, 1.95it/s]epoch: 1 loss: 0.0208851 f1: 0.7262888: 51%|█████ | 756/1497 [06:48<06:16, 1.97it/s]epoch: 1 loss: 0.1272922 f1: 0.7262888: 51%|█████ | 756/1497 [06:48<06:16, 1.97it/s]epoch: 1 loss: 0.1272922 f1: 0.7262888: 51%|█████ | 757/1497 [06:48<06:12, 1.99it/s]epoch: 1 loss: 0.0077195 f1: 0.7262888: 51%|█████ | 757/1497 [06:49<06:12, 1.99it/s]epoch: 1 loss: 0.0077195 f1: 0.7262888: 51%|█████ | 758/1497 [06:49<06:12, 1.98it/s]epoch: 1 loss: 0.0216689 f1: 0.7262888: 51%|█████ | 758/1497 [06:49<06:12, 1.98it/s]epoch: 1 loss: 0.0216689 f1: 0.7262888: 51%|█████ | 759/1497 [06:49<06:12, 1.98it/s]epoch: 1 loss: 0.0050533 f1: 0.7262888: 51%|█████ | 759/1497 [06:50<06:12, 1.98it/s]epoch: 1 loss: 0.0050533 f1: 0.7262888: 51%|█████ | 760/1497 [06:50<06:13, 1.98it/s]epoch: 1 loss: 0.0401724 f1: 0.7262888: 51%|█████ | 760/1497 [06:50<06:13, 1.98it/s]epoch: 1 loss: 0.0401724 f1: 0.7262888: 51%|█████ | 761/1497 [06:50<06:14, 1.96it/s]epoch: 1 loss: 0.0629239 f1: 0.7262888: 51%|█████ | 761/1497 [06:51<06:14, 1.96it/s]epoch: 1 loss: 0.0629239 f1: 0.7262888: 51%|█████ | 762/1497 [06:51<06:14, 1.96it/s]epoch: 1 loss: 0.0818115 f1: 0.7262888: 51%|█████ | 762/1497 [06:51<06:14, 1.96it/s]epoch: 1 loss: 0.0818115 f1: 0.7262888: 51%|█████ | 763/1497 [06:51<06:13, 1.96it/s]epoch: 1 loss: 0.0051552 f1: 0.7262888: 51%|█████ | 763/1497 [06:52<06:13, 1.96it/s]epoch: 1 loss: 0.0051552 f1: 0.7262888: 51%|█████ | 764/1497 [06:52<06:11, 1.97it/s]epoch: 1 loss: 0.0285808 f1: 0.7262888: 51%|█████ | 764/1497 [06:52<06:11, 1.97it/s]epoch: 1 loss: 0.0285808 f1: 0.7262888: 51%|█████ | 765/1497 [06:52<06:12, 1.97it/s]epoch: 1 loss: 0.0174531 f1: 0.7262888: 51%|█████ | 765/1497 [06:53<06:12, 1.97it/s]epoch: 1 loss: 0.0174531 f1: 0.7262888: 51%|█████ | 766/1497 [06:53<06:09, 1.98it/s]epoch: 1 loss: 0.0312080 f1: 0.7262888: 51%|█████ | 766/1497 [06:53<06:09, 1.98it/s]epoch: 1 loss: 0.0312080 f1: 0.7262888: 51%|█████ | 767/1497 [06:53<06:09, 1.97it/s]epoch: 1 loss: 0.0281998 f1: 0.7262888: 51%|█████ | 767/1497 [06:54<06:09, 1.97it/s]epoch: 1 loss: 0.0281998 f1: 0.7262888: 51%|█████▏ | 768/1497 [06:54<06:09, 1.97it/s]epoch: 1 loss: 0.0722351 f1: 0.7262888: 51%|█████▏ | 768/1497 [06:54<06:09, 1.97it/s]epoch: 1 loss: 0.0722351 f1: 0.7262888: 51%|█████▏ | 769/1497 [06:54<06:10, 1.96it/s]epoch: 1 loss: 0.0271545 f1: 0.7262888: 51%|█████▏ | 769/1497 [06:55<06:10, 1.96it/s]epoch: 1 loss: 0.0271545 f1: 0.7262888: 51%|█████▏ | 770/1497 [06:55<06:10, 1.96it/s]epoch: 1 loss: 0.0058709 f1: 0.7262888: 51%|█████▏ | 770/1497 [06:56<06:10, 1.96it/s]epoch: 1 loss: 0.0058709 f1: 0.7262888: 52%|█████▏ | 771/1497 [06:56<06:09, 1.96it/s]epoch: 1 loss: 0.0464121 f1: 0.7262888: 52%|█████▏ | 771/1497 [06:56<06:09, 1.96it/s]epoch: 1 loss: 0.0464121 f1: 0.7262888: 52%|█████▏ | 772/1497 [06:56<06:12, 1.95it/s]epoch: 1 loss: 0.0204287 f1: 0.7262888: 52%|█████▏ | 772/1497 [06:57<06:12, 1.95it/s]epoch: 1 loss: 0.0204287 f1: 0.7262888: 52%|█████▏ | 773/1497 [06:57<06:12, 1.95it/s]epoch: 1 loss: 0.0554792 f1: 0.7262888: 52%|█████▏ | 773/1497 [06:57<06:12, 1.95it/s]epoch: 1 loss: 0.0554792 f1: 0.7262888: 52%|█████▏ | 774/1497 [06:57<06:10, 1.95it/s]epoch: 1 loss: 0.0333875 f1: 0.7262888: 52%|█████▏ | 774/1497 [06:58<06:10, 1.95it/s]epoch: 1 loss: 0.0333875 f1: 0.7262888: 52%|█████▏ | 775/1497 [06:58<06:09, 1.95it/s]epoch: 1 loss: 0.0940263 f1: 0.7262888: 52%|█████▏ | 775/1497 [06:58<06:09, 1.95it/s]epoch: 1 loss: 0.0940263 f1: 0.7262888: 52%|█████▏ | 776/1497 [06:58<06:15, 1.92it/s]epoch: 1 loss: 0.0571273 f1: 0.7262888: 52%|█████▏ | 776/1497 [06:59<06:15, 1.92it/s]epoch: 1 loss: 0.0571273 f1: 0.7262888: 52%|█████▏ | 777/1497 [06:59<06:16, 1.91it/s]epoch: 1 loss: 0.0044047 f1: 0.7262888: 52%|█████▏ | 777/1497 [06:59<06:16, 1.91it/s]epoch: 1 loss: 0.0044047 f1: 0.7262888: 52%|█████▏ | 778/1497 [06:59<06:12, 1.93it/s]epoch: 1 loss: 0.0765484 f1: 0.7262888: 52%|█████▏ | 778/1497 [07:00<06:12, 1.93it/s]epoch: 1 loss: 0.0765484 f1: 0.7262888: 52%|█████▏ | 779/1497 [07:00<06:08, 1.95it/s]epoch: 1 loss: 0.0037719 f1: 0.7262888: 52%|█████▏ | 779/1497 [07:00<06:08, 1.95it/s]epoch: 1 loss: 0.0037719 f1: 0.7262888: 52%|█████▏ | 780/1497 [07:00<06:05, 1.96it/s]epoch: 1 loss: 0.0887827 f1: 0.7262888: 52%|█████▏ | 780/1497 [07:01<06:05, 1.96it/s]epoch: 1 loss: 0.0887827 f1: 0.7262888: 52%|█████▏ | 781/1497 [07:01<06:03, 1.97it/s]epoch: 1 loss: 0.0042872 f1: 0.7262888: 52%|█████▏ | 781/1497 [07:01<06:03, 1.97it/s]epoch: 1 loss: 0.0042872 f1: 0.7262888: 52%|█████▏ | 782/1497 [07:01<06:03, 1.97it/s]epoch: 1 loss: 0.3058083 f1: 0.7262888: 52%|█████▏ | 782/1497 [07:02<06:03, 1.97it/s]epoch: 1 loss: 0.3058083 f1: 0.7262888: 52%|█████▏ | 783/1497 [07:02<06:01, 1.97it/s]epoch: 1 loss: 0.0361580 f1: 0.7262888: 52%|█████▏ | 783/1497 [07:02<06:01, 1.97it/s]epoch: 1 loss: 0.0361580 f1: 0.7262888: 52%|█████▏ | 784/1497 [07:02<06:02, 1.97it/s]epoch: 1 loss: 0.0491599 f1: 0.7262888: 52%|█████▏ | 784/1497 [07:03<06:02, 1.97it/s]epoch: 1 loss: 0.0491599 f1: 0.7262888: 52%|█████▏ | 785/1497 [07:03<06:01, 1.97it/s]epoch: 1 loss: 0.0126539 f1: 0.7262888: 52%|█████▏ | 785/1497 [07:03<06:01, 1.97it/s]epoch: 1 loss: 0.0126539 f1: 0.7262888: 53%|█████▎ | 786/1497 [07:03<06:01, 1.96it/s]epoch: 1 loss: 0.1505143 f1: 0.7262888: 53%|█████▎ | 786/1497 [07:04<06:01, 1.96it/s]epoch: 1 loss: 0.1505143 f1: 0.7262888: 53%|█████▎ | 787/1497 [07:04<06:01, 1.96it/s]epoch: 1 loss: 0.0638044 f1: 0.7262888: 53%|█████▎ | 787/1497 [07:04<06:01, 1.96it/s]epoch: 1 loss: 0.0638044 f1: 0.7262888: 53%|█████▎ | 788/1497 [07:04<06:01, 1.96it/s]epoch: 1 loss: 0.0151928 f1: 0.7262888: 53%|█████▎ | 788/1497 [07:05<06:01, 1.96it/s]epoch: 1 loss: 0.0151928 f1: 0.7262888: 53%|█████▎ | 789/1497 [07:05<06:00, 1.96it/s]epoch: 1 loss: 0.0387300 f1: 0.7262888: 53%|█████▎ | 789/1497 [07:05<06:00, 1.96it/s]epoch: 1 loss: 0.0387300 f1: 0.7262888: 53%|█████▎ | 790/1497 [07:05<06:00, 1.96it/s]epoch: 1 loss: 0.0371168 f1: 0.7262888: 53%|█████▎ | 790/1497 [07:06<06:00, 1.96it/s]epoch: 1 loss: 0.0371168 f1: 0.7262888: 53%|█████▎ | 791/1497 [07:06<06:00, 1.96it/s]epoch: 1 loss: 0.0444743 f1: 0.7262888: 53%|█████▎ | 791/1497 [07:06<06:00, 1.96it/s]epoch: 1 loss: 0.0444743 f1: 0.7262888: 53%|█████▎ | 792/1497 [07:06<06:00, 1.96it/s]epoch: 1 loss: 0.0100050 f1: 0.7262888: 53%|█████▎ | 792/1497 [07:07<06:00, 1.96it/s]epoch: 1 loss: 0.0100050 f1: 0.7262888: 53%|█████▎ | 793/1497 [07:07<05:58, 1.96it/s]epoch: 1 loss: 0.0540966 f1: 0.7262888: 53%|█████▎ | 793/1497 [07:07<05:58, 1.96it/s]epoch: 1 loss: 0.0540966 f1: 0.7262888: 53%|█████▎ | 794/1497 [07:07<05:56, 1.97it/s]epoch: 1 loss: 0.0503451 f1: 0.7262888: 53%|█████▎ | 794/1497 [07:08<05:56, 1.97it/s]epoch: 1 loss: 0.0503451 f1: 0.7262888: 53%|█████▎ | 795/1497 [07:08<05:53, 1.98it/s]epoch: 1 loss: 0.1211543 f1: 0.7262888: 53%|█████▎ | 795/1497 [07:08<05:53, 1.98it/s]epoch: 1 loss: 0.1211543 f1: 0.7262888: 53%|█████▎ | 796/1497 [07:08<05:52, 1.99it/s]epoch: 1 loss: 0.0317844 f1: 0.7262888: 53%|█████▎ | 796/1497 [07:09<05:52, 1.99it/s]epoch: 1 loss: 0.0317844 f1: 0.7262888: 53%|█████▎ | 797/1497 [07:09<05:50, 2.00it/s]epoch: 1 loss: 0.0143516 f1: 0.7262888: 53%|█████▎ | 797/1497 [07:09<05:50, 2.00it/s]epoch: 1 loss: 0.0143516 f1: 0.7262888: 53%|█████▎ | 798/1497 [07:09<05:49, 2.00it/s]epoch: 1 loss: 0.0200744 f1: 0.7262888: 53%|█████▎ | 798/1497 [07:10<05:49, 2.00it/s]epoch: 1 loss: 0.0200744 f1: 0.7262888: 53%|█████▎ | 799/1497 [07:10<05:48, 2.00it/s]epoch: 1 loss: 0.1093694 f1: 0.7262888: 53%|█████▎ | 799/1497 [07:10<05:48, 2.00it/s]epoch: 1 loss: 0.1093694 f1: 0.7262888: 53%|█████▎ | 800/1497 [07:10<05:48, 2.00it/s]epoch: 1 loss: 0.1294711 f1: 0.7262888: 53%|█████▎ | 800/1497 [07:11<05:48, 2.00it/s]epoch: 1 loss: 0.1294711 f1: 0.7262888: 54%|█████▎ | 801/1497 [07:11<05:47, 2.00it/s]epoch: 1 loss: 0.0314469 f1: 0.7262888: 54%|█████▎ | 801/1497 [07:11<05:47, 2.00it/s]epoch: 1 loss: 0.0314469 f1: 0.7262888: 54%|█████▎ | 802/1497 [07:11<05:45, 2.01it/s]epoch: 1 loss: 0.1416593 f1: 0.7262888: 54%|█████▎ | 802/1497 [07:12<05:45, 2.01it/s]epoch: 1 loss: 0.1416593 f1: 0.7262888: 54%|█████▎ | 803/1497 [07:12<05:43, 2.02it/s]epoch: 1 loss: 0.0163938 f1: 0.7262888: 54%|█████▎ | 803/1497 [07:12<05:43, 2.02it/s]epoch: 1 loss: 0.0163938 f1: 0.7262888: 54%|█████▎ | 804/1497 [07:12<05:43, 2.02it/s]epoch: 1 loss: 0.2424218 f1: 0.7262888: 54%|█████▎ | 804/1497 [07:13<05:43, 2.02it/s]epoch: 1 loss: 0.2424218 f1: 0.7262888: 54%|█████▍ | 805/1497 [07:13<05:42, 2.02it/s]epoch: 1 loss: 0.0399393 f1: 0.7262888: 54%|█████▍ | 805/1497 [07:13<05:42, 2.02it/s]epoch: 1 loss: 0.0399393 f1: 0.7262888: 54%|█████▍ | 806/1497 [07:13<05:43, 2.01it/s]epoch: 1 loss: 0.2726666 f1: 0.7262888: 54%|█████▍ | 806/1497 [07:14<05:43, 2.01it/s]epoch: 1 loss: 0.2726666 f1: 0.7262888: 54%|█████▍ | 807/1497 [07:14<05:41, 2.02it/s]epoch: 1 loss: 0.0764860 f1: 0.7262888: 54%|█████▍ | 807/1497 [07:14<05:41, 2.02it/s]epoch: 1 loss: 0.0764860 f1: 0.7262888: 54%|█████▍ | 808/1497 [07:14<05:40, 2.02it/s]epoch: 1 loss: 0.0445550 f1: 0.7262888: 54%|█████▍ | 808/1497 [07:15<05:40, 2.02it/s]epoch: 1 loss: 0.0445550 f1: 0.7262888: 54%|█████▍ | 809/1497 [07:15<05:41, 2.02it/s]epoch: 1 loss: 0.2980866 f1: 0.7262888: 54%|█████▍ | 809/1497 [07:15<05:41, 2.02it/s]epoch: 1 loss: 0.2980866 f1: 0.7262888: 54%|█████▍ | 810/1497 [07:15<05:40, 2.02it/s]epoch: 1 loss: 0.0208246 f1: 0.7262888: 54%|█████▍ | 810/1497 [07:16<05:40, 2.02it/s]epoch: 1 loss: 0.0208246 f1: 0.7262888: 54%|█████▍ | 811/1497 [07:16<05:40, 2.01it/s]epoch: 1 loss: 0.0584116 f1: 0.7262888: 54%|█████▍ | 811/1497 [07:16<05:40, 2.01it/s]epoch: 1 loss: 0.0584116 f1: 0.7262888: 54%|█████▍ | 812/1497 [07:16<05:40, 2.01it/s]epoch: 1 loss: 0.1543265 f1: 0.7262888: 54%|█████▍ | 812/1497 [07:17<05:40, 2.01it/s]epoch: 1 loss: 0.1543265 f1: 0.7262888: 54%|█████▍ | 813/1497 [07:17<05:41, 2.00it/s]epoch: 1 loss: 0.0139254 f1: 0.7262888: 54%|█████▍ | 813/1497 [07:17<05:41, 2.00it/s]epoch: 1 loss: 0.0139254 f1: 0.7262888: 54%|█████▍ | 814/1497 [07:17<05:40, 2.00it/s]epoch: 1 loss: 0.0315871 f1: 0.7262888: 54%|█████▍ | 814/1497 [07:18<05:40, 2.00it/s]epoch: 1 loss: 0.0315871 f1: 0.7262888: 54%|█████▍ | 815/1497 [07:18<05:39, 2.01it/s]epoch: 1 loss: 0.0092476 f1: 0.7262888: 54%|█████▍ | 815/1497 [07:18<05:39, 2.01it/s]epoch: 1 loss: 0.0092476 f1: 0.7262888: 55%|█████▍ | 816/1497 [07:18<05:39, 2.01it/s]epoch: 1 loss: 0.0854097 f1: 0.7262888: 55%|█████▍ | 816/1497 [07:19<05:39, 2.01it/s]epoch: 1 loss: 0.0854097 f1: 0.7262888: 55%|█████▍ | 817/1497 [07:19<05:42, 1.98it/s]epoch: 1 loss: 0.0367645 f1: 0.7262888: 55%|█████▍ | 817/1497 [07:19<05:42, 1.98it/s]epoch: 1 loss: 0.0367645 f1: 0.7262888: 55%|█████▍ | 818/1497 [07:19<05:43, 1.98it/s]epoch: 1 loss: 0.0567621 f1: 0.7262888: 55%|█████▍ | 818/1497 [07:20<05:43, 1.98it/s]epoch: 1 loss: 0.0567621 f1: 0.7262888: 55%|█████▍ | 819/1497 [07:20<05:40, 1.99it/s]epoch: 1 loss: 0.0195059 f1: 0.7262888: 55%|█████▍ | 819/1497 [07:20<05:40, 1.99it/s]epoch: 1 loss: 0.0195059 f1: 0.7262888: 55%|█████▍ | 820/1497 [07:20<05:38, 2.00it/s]epoch: 1 loss: 0.0371747 f1: 0.7262888: 55%|█████▍ | 820/1497 [07:21<05:38, 2.00it/s]epoch: 1 loss: 0.0371747 f1: 0.7262888: 55%|█████▍ | 821/1497 [07:21<05:36, 2.01it/s]epoch: 1 loss: 0.1054605 f1: 0.7262888: 55%|█████▍ | 821/1497 [07:21<05:36, 2.01it/s]epoch: 1 loss: 0.1054605 f1: 0.7262888: 55%|█████▍ | 822/1497 [07:21<05:35, 2.01it/s]epoch: 1 loss: 0.2750557 f1: 0.7262888: 55%|█████▍ | 822/1497 [07:22<05:35, 2.01it/s]epoch: 1 loss: 0.2750557 f1: 0.7262888: 55%|█████▍ | 823/1497 [07:22<05:36, 2.00it/s]epoch: 1 loss: 0.0209699 f1: 0.7262888: 55%|█████▍ | 823/1497 [07:22<05:36, 2.00it/s]epoch: 1 loss: 0.0209699 f1: 0.7262888: 55%|█████▌ | 824/1497 [07:22<05:38, 1.99it/s]epoch: 1 loss: 0.0085482 f1: 0.7262888: 55%|█████▌ | 824/1497 [07:23<05:38, 1.99it/s]epoch: 1 loss: 0.0085482 f1: 0.7262888: 55%|█████▌ | 825/1497 [07:23<05:39, 1.98it/s]epoch: 1 loss: 0.1014571 f1: 0.7262888: 55%|█████▌ | 825/1497 [07:23<05:39, 1.98it/s]epoch: 1 loss: 0.1014571 f1: 0.7262888: 55%|█████▌ | 826/1497 [07:23<05:38, 1.98it/s]epoch: 1 loss: 0.0101902 f1: 0.7262888: 55%|█████▌ | 826/1497 [07:24<05:38, 1.98it/s]epoch: 1 loss: 0.0101902 f1: 0.7262888: 55%|█████▌ | 827/1497 [07:24<05:33, 2.01it/s]epoch: 1 loss: 0.0445456 f1: 0.7262888: 55%|█████▌ | 827/1497 [07:24<05:33, 2.01it/s]epoch: 1 loss: 0.0445456 f1: 0.7262888: 55%|█████▌ | 828/1497 [07:24<05:31, 2.02it/s]epoch: 1 loss: 0.1312699 f1: 0.7262888: 55%|█████▌ | 828/1497 [07:25<05:31, 2.02it/s]epoch: 1 loss: 0.1312699 f1: 0.7262888: 55%|█████▌ | 829/1497 [07:25<05:28, 2.03it/s]epoch: 1 loss: 0.0198272 f1: 0.7262888: 55%|█████▌ | 829/1497 [07:25<05:28, 2.03it/s]epoch: 1 loss: 0.0198272 f1: 0.7262888: 55%|█████▌ | 830/1497 [07:25<05:28, 2.03it/s]epoch: 1 loss: 0.0157343 f1: 0.7262888: 55%|█████▌ | 830/1497 [07:26<05:28, 2.03it/s]epoch: 1 loss: 0.0157343 f1: 0.7262888: 56%|█████▌ | 831/1497 [07:26<05:27, 2.03it/s]epoch: 1 loss: 0.0234775 f1: 0.7262888: 56%|█████▌ | 831/1497 [07:26<05:27, 2.03it/s]epoch: 1 loss: 0.0234775 f1: 0.7262888: 56%|█████▌ | 832/1497 [07:26<05:30, 2.02it/s]epoch: 1 loss: 0.1338790 f1: 0.7262888: 56%|█████▌ | 832/1497 [07:27<05:30, 2.02it/s]epoch: 1 loss: 0.1338790 f1: 0.7262888: 56%|█████▌ | 833/1497 [07:27<05:30, 2.01it/s]epoch: 1 loss: 0.0406060 f1: 0.7262888: 56%|█████▌ | 833/1497 [07:27<05:30, 2.01it/s]epoch: 1 loss: 0.0406060 f1: 0.7262888: 56%|█████▌ | 834/1497 [07:27<05:30, 2.01it/s]epoch: 1 loss: 0.0923667 f1: 0.7262888: 56%|█████▌ | 834/1497 [07:28<05:30, 2.01it/s]epoch: 1 loss: 0.0923667 f1: 0.7262888: 56%|█████▌ | 835/1497 [07:28<05:30, 2.01it/s]epoch: 1 loss: 0.0494994 f1: 0.7262888: 56%|█████▌ | 835/1497 [07:28<05:30, 2.01it/s]epoch: 1 loss: 0.0494994 f1: 0.7262888: 56%|█████▌ | 836/1497 [07:28<05:28, 2.01it/s]epoch: 1 loss: 0.0632172 f1: 0.7262888: 56%|█████▌ | 836/1497 [07:29<05:28, 2.01it/s]epoch: 1 loss: 0.0632172 f1: 0.7262888: 56%|█████▌ | 837/1497 [07:29<05:27, 2.02it/s]epoch: 1 loss: 0.1079226 f1: 0.7262888: 56%|█████▌ | 837/1497 [07:29<05:27, 2.02it/s]epoch: 1 loss: 0.1079226 f1: 0.7262888: 56%|█████▌ | 838/1497 [07:29<05:27, 2.01it/s]epoch: 1 loss: 0.0336935 f1: 0.7262888: 56%|█████▌ | 838/1497 [07:30<05:27, 2.01it/s]epoch: 1 loss: 0.0336935 f1: 0.7262888: 56%|█████▌ | 839/1497 [07:30<05:26, 2.01it/s]epoch: 1 loss: 0.0302341 f1: 0.7262888: 56%|█████▌ | 839/1497 [07:30<05:26, 2.01it/s]epoch: 1 loss: 0.0302341 f1: 0.7262888: 56%|█████▌ | 840/1497 [07:30<05:25, 2.02it/s]epoch: 1 loss: 0.0898985 f1: 0.7262888: 56%|█████▌ | 840/1497 [07:31<05:25, 2.02it/s]epoch: 1 loss: 0.0898985 f1: 0.7262888: 56%|█████▌ | 841/1497 [07:31<05:22, 2.03it/s]epoch: 1 loss: 0.0206484 f1: 0.7262888: 56%|█████▌ | 841/1497 [07:31<05:22, 2.03it/s]epoch: 1 loss: 0.0206484 f1: 0.7262888: 56%|█████▌ | 842/1497 [07:31<05:21, 2.03it/s]epoch: 1 loss: 0.0284498 f1: 0.7262888: 56%|█████▌ | 842/1497 [07:32<05:21, 2.03it/s]epoch: 1 loss: 0.0284498 f1: 0.7262888: 56%|█████▋ | 843/1497 [07:32<05:21, 2.03it/s]epoch: 1 loss: 0.1442847 f1: 0.7262888: 56%|█████▋ | 843/1497 [07:32<05:21, 2.03it/s]epoch: 1 loss: 0.1442847 f1: 0.7262888: 56%|█████▋ | 844/1497 [07:32<05:20, 2.04it/s]epoch: 1 loss: 0.0683107 f1: 0.7262888: 56%|█████▋ | 844/1497 [07:33<05:20, 2.04it/s]epoch: 1 loss: 0.0683107 f1: 0.7262888: 56%|█████▋ | 845/1497 [07:33<05:22, 2.02it/s]epoch: 1 loss: 0.0347165 f1: 0.7262888: 56%|█████▋ | 845/1497 [07:33<05:22, 2.02it/s]epoch: 1 loss: 0.0347165 f1: 0.7262888: 57%|█████▋ | 846/1497 [07:33<05:21, 2.02it/s]epoch: 1 loss: 0.0379655 f1: 0.7262888: 57%|█████▋ | 846/1497 [07:34<05:21, 2.02it/s]epoch: 1 loss: 0.0379655 f1: 0.7262888: 57%|█████▋ | 847/1497 [07:34<05:22, 2.01it/s]epoch: 1 loss: 0.0069366 f1: 0.7262888: 57%|█████▋ | 847/1497 [07:34<05:22, 2.01it/s]epoch: 1 loss: 0.0069366 f1: 0.7262888: 57%|█████▋ | 848/1497 [07:34<05:23, 2.01it/s]epoch: 1 loss: 0.0860340 f1: 0.7262888: 57%|█████▋ | 848/1497 [07:35<05:23, 2.01it/s]epoch: 1 loss: 0.0860340 f1: 0.7262888: 57%|█████▋ | 849/1497 [07:35<05:20, 2.02it/s]epoch: 1 loss: 0.0853398 f1: 0.7262888: 57%|█████▋ | 849/1497 [07:35<05:20, 2.02it/s]epoch: 1 loss: 0.0853398 f1: 0.7262888: 57%|█████▋ | 850/1497 [07:35<05:20, 2.02it/s]epoch: 1 loss: 0.0557884 f1: 0.7262888: 57%|█████▋ | 850/1497 [07:36<05:20, 2.02it/s]epoch: 1 loss: 0.0557884 f1: 0.7262888: 57%|█████▋ | 851/1497 [07:36<05:22, 2.00it/s]epoch: 1 loss: 0.0882893 f1: 0.7262888: 57%|█████▋ | 851/1497 [07:36<05:22, 2.00it/s]epoch: 1 loss: 0.0882893 f1: 0.7262888: 57%|█████▋ | 852/1497 [07:36<05:19, 2.02it/s]epoch: 1 loss: 0.0590009 f1: 0.7262888: 57%|█████▋ | 852/1497 [07:37<05:19, 2.02it/s]epoch: 1 loss: 0.0590009 f1: 0.7262888: 57%|█████▋ | 853/1497 [07:37<05:20, 2.01it/s]epoch: 1 loss: 0.0872741 f1: 0.7262888: 57%|█████▋ | 853/1497 [07:37<05:20, 2.01it/s]epoch: 1 loss: 0.0872741 f1: 0.7262888: 57%|█████▋ | 854/1497 [07:37<05:21, 2.00it/s]epoch: 1 loss: 0.0646243 f1: 0.7262888: 57%|█████▋ | 854/1497 [07:38<05:21, 2.00it/s]epoch: 1 loss: 0.0646243 f1: 0.7262888: 57%|█████▋ | 855/1497 [07:38<05:22, 1.99it/s]epoch: 1 loss: 0.1066555 f1: 0.7262888: 57%|█████▋ | 855/1497 [07:38<05:22, 1.99it/s]epoch: 1 loss: 0.1066555 f1: 0.7262888: 57%|█████▋ | 856/1497 [07:38<05:22, 1.99it/s]epoch: 1 loss: 0.1383247 f1: 0.7262888: 57%|█████▋ | 856/1497 [07:39<05:22, 1.99it/s]epoch: 1 loss: 0.1383247 f1: 0.7262888: 57%|█████▋ | 857/1497 [07:39<05:23, 1.98it/s]epoch: 1 loss: 0.0181779 f1: 0.7262888: 57%|█████▋ | 857/1497 [07:39<05:23, 1.98it/s]epoch: 1 loss: 0.0181779 f1: 0.7262888: 57%|█████▋ | 858/1497 [07:39<05:22, 1.98it/s]epoch: 1 loss: 0.0465784 f1: 0.7262888: 57%|█████▋ | 858/1497 [07:40<05:22, 1.98it/s]epoch: 1 loss: 0.0465784 f1: 0.7262888: 57%|█████▋ | 859/1497 [07:40<05:29, 1.94it/s]epoch: 1 loss: 0.0309249 f1: 0.7262888: 57%|█████▋ | 859/1497 [07:40<05:29, 1.94it/s]epoch: 1 loss: 0.0309249 f1: 0.7262888: 57%|█████▋ | 860/1497 [07:40<05:24, 1.96it/s]epoch: 1 loss: 0.1467674 f1: 0.7262888: 57%|█████▋ | 860/1497 [07:41<05:24, 1.96it/s]epoch: 1 loss: 0.1467674 f1: 0.7262888: 58%|█████▊ | 861/1497 [07:41<05:20, 1.99it/s]epoch: 1 loss: 0.0244217 f1: 0.7262888: 58%|█████▊ | 861/1497 [07:41<05:20, 1.99it/s]epoch: 1 loss: 0.0244217 f1: 0.7262888: 58%|█████▊ | 862/1497 [07:41<05:17, 2.00it/s]epoch: 1 loss: 0.0092365 f1: 0.7262888: 58%|█████▊ | 862/1497 [07:42<05:17, 2.00it/s]epoch: 1 loss: 0.0092365 f1: 0.7262888: 58%|█████▊ | 863/1497 [07:42<05:14, 2.02it/s]epoch: 1 loss: 0.0891105 f1: 0.7262888: 58%|█████▊ | 863/1497 [07:42<05:14, 2.02it/s]epoch: 1 loss: 0.0891105 f1: 0.7262888: 58%|█████▊ | 864/1497 [07:42<05:12, 2.03it/s]epoch: 1 loss: 0.0450804 f1: 0.7262888: 58%|█████▊ | 864/1497 [07:43<05:12, 2.03it/s]epoch: 1 loss: 0.0450804 f1: 0.7262888: 58%|█████▊ | 865/1497 [07:43<05:12, 2.02it/s]epoch: 1 loss: 0.0644351 f1: 0.7262888: 58%|█████▊ | 865/1497 [07:43<05:12, 2.02it/s]epoch: 1 loss: 0.0644351 f1: 0.7262888: 58%|█████▊ | 866/1497 [07:43<05:13, 2.02it/s]epoch: 1 loss: 0.0376861 f1: 0.7262888: 58%|█████▊ | 866/1497 [07:44<05:13, 2.02it/s]epoch: 1 loss: 0.0376861 f1: 0.7262888: 58%|█████▊ | 867/1497 [07:44<05:14, 2.00it/s]epoch: 1 loss: 0.0486758 f1: 0.7262888: 58%|█████▊ | 867/1497 [07:44<05:14, 2.00it/s]epoch: 1 loss: 0.0486758 f1: 0.7262888: 58%|█████▊ | 868/1497 [07:44<05:12, 2.02it/s]epoch: 1 loss: 0.0527195 f1: 0.7262888: 58%|█████▊ | 868/1497 [07:45<05:12, 2.02it/s]epoch: 1 loss: 0.0527195 f1: 0.7262888: 58%|█████▊ | 869/1497 [07:45<05:10, 2.02it/s]epoch: 1 loss: 0.1186740 f1: 0.7262888: 58%|█████▊ | 869/1497 [07:45<05:10, 2.02it/s]epoch: 1 loss: 0.1186740 f1: 0.7262888: 58%|█████▊ | 870/1497 [07:45<05:10, 2.02it/s]epoch: 1 loss: 0.0102590 f1: 0.7262888: 58%|█████▊ | 870/1497 [07:46<05:10, 2.02it/s]epoch: 1 loss: 0.0102590 f1: 0.7262888: 58%|█████▊ | 871/1497 [07:46<05:13, 2.00it/s]epoch: 1 loss: 0.1469511 f1: 0.7262888: 58%|█████▊ | 871/1497 [07:46<05:13, 2.00it/s]epoch: 1 loss: 0.1469511 f1: 0.7262888: 58%|█████▊ | 872/1497 [07:46<05:13, 1.99it/s]epoch: 1 loss: 0.1716805 f1: 0.7262888: 58%|█████▊ | 872/1497 [07:47<05:13, 1.99it/s]epoch: 1 loss: 0.1716805 f1: 0.7262888: 58%|█████▊ | 873/1497 [07:47<05:15, 1.98it/s]epoch: 1 loss: 0.0392489 f1: 0.7262888: 58%|█████▊ | 873/1497 [07:47<05:15, 1.98it/s]epoch: 1 loss: 0.0392489 f1: 0.7262888: 58%|█████▊ | 874/1497 [07:47<05:18, 1.96it/s]epoch: 1 loss: 0.0301579 f1: 0.7262888: 58%|█████▊ | 874/1497 [07:48<05:18, 1.96it/s]epoch: 1 loss: 0.0301579 f1: 0.7262888: 58%|█████▊ | 875/1497 [07:48<05:19, 1.95it/s]epoch: 1 loss: 0.0158123 f1: 0.7262888: 58%|█████▊ | 875/1497 [07:48<05:19, 1.95it/s]epoch: 1 loss: 0.0158123 f1: 0.7262888: 59%|█████▊ | 876/1497 [07:48<05:16, 1.96it/s]epoch: 1 loss: 0.0885139 f1: 0.7262888: 59%|█████▊ | 876/1497 [07:49<05:16, 1.96it/s]epoch: 1 loss: 0.0885139 f1: 0.7262888: 59%|█████▊ | 877/1497 [07:49<05:14, 1.97it/s]epoch: 1 loss: 0.0351266 f1: 0.7262888: 59%|█████▊ | 877/1497 [07:49<05:14, 1.97it/s]epoch: 1 loss: 0.0351266 f1: 0.7262888: 59%|█████▊ | 878/1497 [07:49<05:13, 1.98it/s]epoch: 1 loss: 0.0891661 f1: 0.7262888: 59%|█████▊ | 878/1497 [07:50<05:13, 1.98it/s]epoch: 1 loss: 0.0891661 f1: 0.7262888: 59%|█████▊ | 879/1497 [07:50<05:13, 1.97it/s]epoch: 1 loss: 0.0386531 f1: 0.7262888: 59%|█████▊ | 879/1497 [07:50<05:13, 1.97it/s]epoch: 1 loss: 0.0386531 f1: 0.7262888: 59%|█████▉ | 880/1497 [07:50<05:12, 1.98it/s]epoch: 1 loss: 0.1303358 f1: 0.7262888: 59%|█████▉ | 880/1497 [07:51<05:12, 1.98it/s]epoch: 1 loss: 0.1303358 f1: 0.7262888: 59%|█████▉ | 881/1497 [07:51<05:11, 1.98it/s]epoch: 1 loss: 0.2138734 f1: 0.7262888: 59%|█████▉ | 881/1497 [07:51<05:11, 1.98it/s]epoch: 1 loss: 0.2138734 f1: 0.7262888: 59%|█████▉ | 882/1497 [07:51<05:09, 1.99it/s]epoch: 1 loss: 0.0835994 f1: 0.7262888: 59%|█████▉ | 882/1497 [07:52<05:09, 1.99it/s]epoch: 1 loss: 0.0835994 f1: 0.7262888: 59%|█████▉ | 883/1497 [07:52<05:09, 1.98it/s]epoch: 1 loss: 0.0399215 f1: 0.7262888: 59%|█████▉ | 883/1497 [07:52<05:09, 1.98it/s]epoch: 1 loss: 0.0399215 f1: 0.7262888: 59%|█████▉ | 884/1497 [07:52<05:07, 1.99it/s]epoch: 1 loss: 0.1102345 f1: 0.7262888: 59%|█████▉ | 884/1497 [07:53<05:07, 1.99it/s]epoch: 1 loss: 0.1102345 f1: 0.7262888: 59%|█████▉ | 885/1497 [07:53<05:07, 1.99it/s]epoch: 1 loss: 0.0367152 f1: 0.7262888: 59%|█████▉ | 885/1497 [07:53<05:07, 1.99it/s]epoch: 1 loss: 0.0367152 f1: 0.7262888: 59%|█████▉ | 886/1497 [07:53<05:06, 1.99it/s]epoch: 1 loss: 0.0194695 f1: 0.7262888: 59%|█████▉ | 886/1497 [07:54<05:06, 1.99it/s]epoch: 1 loss: 0.0194695 f1: 0.7262888: 59%|█████▉ | 887/1497 [07:54<05:05, 2.00it/s]epoch: 1 loss: 0.0267288 f1: 0.7262888: 59%|█████▉ | 887/1497 [07:54<05:05, 2.00it/s]epoch: 1 loss: 0.0267288 f1: 0.7262888: 59%|█████▉ | 888/1497 [07:54<05:05, 1.99it/s]epoch: 1 loss: 0.1624919 f1: 0.7262888: 59%|█████▉ | 888/1497 [07:55<05:05, 1.99it/s]epoch: 1 loss: 0.1624919 f1: 0.7262888: 59%|█████▉ | 889/1497 [07:55<05:05, 1.99it/s]epoch: 1 loss: 0.0417879 f1: 0.7262888: 59%|█████▉ | 889/1497 [07:55<05:05, 1.99it/s]epoch: 1 loss: 0.0417879 f1: 0.7262888: 59%|█████▉ | 890/1497 [07:55<05:05, 1.99it/s]epoch: 1 loss: 0.0041652 f1: 0.7262888: 59%|█████▉ | 890/1497 [07:56<05:05, 1.99it/s]epoch: 1 loss: 0.0041652 f1: 0.7262888: 60%|█████▉ | 891/1497 [07:56<05:05, 1.98it/s]epoch: 1 loss: 0.0626647 f1: 0.7262888: 60%|█████▉ | 891/1497 [07:56<05:05, 1.98it/s]epoch: 1 loss: 0.0626647 f1: 0.7262888: 60%|█████▉ | 892/1497 [07:56<05:06, 1.97it/s]epoch: 1 loss: 0.0412949 f1: 0.7262888: 60%|█████▉ | 892/1497 [07:57<05:06, 1.97it/s]epoch: 1 loss: 0.0412949 f1: 0.7262888: 60%|█████▉ | 893/1497 [07:57<05:06, 1.97it/s]epoch: 1 loss: 0.0852861 f1: 0.7262888: 60%|█████▉ | 893/1497 [07:57<05:06, 1.97it/s]epoch: 1 loss: 0.0852861 f1: 0.7262888: 60%|█████▉ | 894/1497 [07:57<05:04, 1.98it/s]epoch: 1 loss: 0.2588138 f1: 0.7262888: 60%|█████▉ | 894/1497 [07:58<05:04, 1.98it/s]epoch: 1 loss: 0.2588138 f1: 0.7262888: 60%|█████▉ | 895/1497 [07:58<05:05, 1.97it/s]epoch: 1 loss: 0.1298221 f1: 0.7262888: 60%|█████▉ | 895/1497 [07:58<05:05, 1.97it/s]epoch: 1 loss: 0.1298221 f1: 0.7262888: 60%|█████▉ | 896/1497 [07:58<05:04, 1.97it/s]epoch: 1 loss: 0.1739900 f1: 0.7262888: 60%|█████▉ | 896/1497 [07:59<05:04, 1.97it/s]epoch: 1 loss: 0.1739900 f1: 0.7262888: 60%|█████▉ | 897/1497 [07:59<05:01, 1.99it/s]epoch: 1 loss: 0.0516678 f1: 0.7262888: 60%|█████▉ | 897/1497 [07:59<05:01, 1.99it/s]epoch: 1 loss: 0.0516678 f1: 0.7262888: 60%|█████▉ | 898/1497 [07:59<04:59, 2.00it/s]epoch: 1 loss: 0.0989709 f1: 0.7262888: 60%|█████▉ | 898/1497 [08:00<04:59, 2.00it/s]epoch: 1 loss: 0.0989709 f1: 0.7262888: 60%|██████ | 899/1497 [08:00<05:00, 1.99it/s]epoch: 1 loss: 0.0370839 f1: 0.7262888: 60%|██████ | 899/1497 [08:00<05:00, 1.99it/s]epoch: 1 loss: 0.0370839 f1: 0.7262888: 60%|██████ | 900/1497 [08:00<05:04, 1.96it/s]epoch: 1 loss: 0.0514164 f1: 0.7262888: 60%|██████ | 900/1497 [08:01<05:04, 1.96it/s]epoch: 1 loss: 0.0514164 f1: 0.7262888: 60%|██████ | 901/1497 [08:01<05:01, 1.98it/s]epoch: 1 loss: 0.0541799 f1: 0.7262888: 60%|██████ | 901/1497 [08:01<05:01, 1.98it/s]epoch: 1 loss: 0.0541799 f1: 0.7262888: 60%|██████ | 902/1497 [08:01<04:59, 1.99it/s]epoch: 1 loss: 0.0412434 f1: 0.7262888: 60%|██████ | 902/1497 [08:02<04:59, 1.99it/s]epoch: 1 loss: 0.0412434 f1: 0.7262888: 60%|██████ | 903/1497 [08:02<04:58, 1.99it/s]epoch: 1 loss: 0.0889424 f1: 0.7262888: 60%|██████ | 903/1497 [08:02<04:58, 1.99it/s]epoch: 1 loss: 0.0889424 f1: 0.7262888: 60%|██████ | 904/1497 [08:02<04:57, 2.00it/s]epoch: 1 loss: 0.0283440 f1: 0.7262888: 60%|██████ | 904/1497 [08:03<04:57, 2.00it/s]epoch: 1 loss: 0.0283440 f1: 0.7262888: 60%|██████ | 905/1497 [08:03<04:55, 2.00it/s]epoch: 1 loss: 0.0076390 f1: 0.7262888: 60%|██████ | 905/1497 [08:03<04:55, 2.00it/s]epoch: 1 loss: 0.0076390 f1: 0.7262888: 61%|██████ | 906/1497 [08:03<04:55, 2.00it/s]epoch: 1 loss: 0.0537650 f1: 0.7262888: 61%|██████ | 906/1497 [08:04<04:55, 2.00it/s]epoch: 1 loss: 0.0537650 f1: 0.7262888: 61%|██████ | 907/1497 [08:04<04:55, 2.00it/s]epoch: 1 loss: 0.1378454 f1: 0.7262888: 61%|██████ | 907/1497 [08:04<04:55, 2.00it/s]epoch: 1 loss: 0.1378454 f1: 0.7262888: 61%|██████ | 908/1497 [08:04<04:53, 2.00it/s]epoch: 1 loss: 0.0293451 f1: 0.7262888: 61%|██████ | 908/1497 [08:05<04:53, 2.00it/s]epoch: 1 loss: 0.0293451 f1: 0.7262888: 61%|██████ | 909/1497 [08:05<04:51, 2.01it/s]epoch: 1 loss: 0.0382812 f1: 0.7262888: 61%|██████ | 909/1497 [08:05<04:51, 2.01it/s]epoch: 1 loss: 0.0382812 f1: 0.7262888: 61%|██████ | 910/1497 [08:05<04:51, 2.01it/s]epoch: 1 loss: 0.0858214 f1: 0.7262888: 61%|██████ | 910/1497 [08:06<04:51, 2.01it/s]epoch: 1 loss: 0.0858214 f1: 0.7262888: 61%|██████ | 911/1497 [08:06<04:50, 2.02it/s]epoch: 1 loss: 0.0263095 f1: 0.7262888: 61%|██████ | 911/1497 [08:06<04:50, 2.02it/s]epoch: 1 loss: 0.0263095 f1: 0.7262888: 61%|██████ | 912/1497 [08:06<04:49, 2.02it/s]epoch: 1 loss: 0.1015395 f1: 0.7262888: 61%|██████ | 912/1497 [08:07<04:49, 2.02it/s]epoch: 1 loss: 0.1015395 f1: 0.7262888: 61%|██████ | 913/1497 [08:07<04:48, 2.02it/s]epoch: 1 loss: 0.0917412 f1: 0.7262888: 61%|██████ | 913/1497 [08:07<04:48, 2.02it/s]epoch: 1 loss: 0.0917412 f1: 0.7262888: 61%|██████ | 914/1497 [08:07<04:48, 2.02it/s]epoch: 1 loss: 0.1444232 f1: 0.7262888: 61%|██████ | 914/1497 [08:08<04:48, 2.02it/s]epoch: 1 loss: 0.1444232 f1: 0.7262888: 61%|██████ | 915/1497 [08:08<04:50, 2.00it/s]epoch: 1 loss: 0.0175575 f1: 0.7262888: 61%|██████ | 915/1497 [08:08<04:50, 2.00it/s]epoch: 1 loss: 0.0175575 f1: 0.7262888: 61%|██████ | 916/1497 [08:08<04:50, 2.00it/s]epoch: 1 loss: 0.0722245 f1: 0.7262888: 61%|██████ | 916/1497 [08:09<04:50, 2.00it/s]epoch: 1 loss: 0.0722245 f1: 0.7262888: 61%|██████▏ | 917/1497 [08:09<04:51, 1.99it/s]epoch: 1 loss: 0.0349533 f1: 0.7262888: 61%|██████▏ | 917/1497 [08:09<04:51, 1.99it/s]epoch: 1 loss: 0.0349533 f1: 0.7262888: 61%|██████▏ | 918/1497 [08:09<04:50, 1.99it/s]epoch: 1 loss: 0.0048308 f1: 0.7262888: 61%|██████▏ | 918/1497 [08:10<04:50, 1.99it/s]epoch: 1 loss: 0.0048308 f1: 0.7262888: 61%|██████▏ | 919/1497 [08:10<04:49, 2.00it/s]epoch: 1 loss: 0.1237217 f1: 0.7262888: 61%|██████▏ | 919/1497 [08:10<04:49, 2.00it/s]epoch: 1 loss: 0.1237217 f1: 0.7262888: 61%|██████▏ | 920/1497 [08:10<04:47, 2.00it/s]epoch: 1 loss: 0.0671547 f1: 0.7262888: 61%|██████▏ | 920/1497 [08:11<04:47, 2.00it/s]epoch: 1 loss: 0.0671547 f1: 0.7262888: 62%|██████▏ | 921/1497 [08:11<04:47, 2.00it/s]epoch: 1 loss: 0.0053303 f1: 0.7262888: 62%|██████▏ | 921/1497 [08:11<04:47, 2.00it/s]epoch: 1 loss: 0.0053303 f1: 0.7262888: 62%|██████▏ | 922/1497 [08:11<04:45, 2.01it/s]epoch: 1 loss: 0.0231710 f1: 0.7262888: 62%|██████▏ | 922/1497 [08:12<04:45, 2.01it/s]epoch: 1 loss: 0.0231710 f1: 0.7262888: 62%|██████▏ | 923/1497 [08:12<04:45, 2.01it/s]epoch: 1 loss: 0.0680029 f1: 0.7262888: 62%|██████▏ | 923/1497 [08:12<04:45, 2.01it/s]epoch: 1 loss: 0.0680029 f1: 0.7262888: 62%|██████▏ | 924/1497 [08:12<04:43, 2.02it/s]epoch: 1 loss: 0.0615517 f1: 0.7262888: 62%|██████▏ | 924/1497 [08:13<04:43, 2.02it/s]epoch: 1 loss: 0.0615517 f1: 0.7262888: 62%|██████▏ | 925/1497 [08:13<04:43, 2.02it/s]epoch: 1 loss: 0.0319181 f1: 0.7262888: 62%|██████▏ | 925/1497 [08:13<04:43, 2.02it/s]epoch: 1 loss: 0.0319181 f1: 0.7262888: 62%|██████▏ | 926/1497 [08:13<04:45, 2.00it/s]epoch: 1 loss: 0.0302241 f1: 0.7262888: 62%|██████▏ | 926/1497 [08:14<04:45, 2.00it/s]epoch: 1 loss: 0.0302241 f1: 0.7262888: 62%|██████▏ | 927/1497 [08:14<04:43, 2.01it/s]epoch: 1 loss: 0.1100502 f1: 0.7262888: 62%|██████▏ | 927/1497 [08:14<04:43, 2.01it/s]epoch: 1 loss: 0.1100502 f1: 0.7262888: 62%|██████▏ | 928/1497 [08:14<04:42, 2.01it/s]epoch: 1 loss: 0.0811311 f1: 0.7262888: 62%|██████▏ | 928/1497 [08:15<04:42, 2.01it/s]epoch: 1 loss: 0.0811311 f1: 0.7262888: 62%|██████▏ | 929/1497 [08:15<04:41, 2.02it/s]epoch: 1 loss: 0.0184818 f1: 0.7262888: 62%|██████▏ | 929/1497 [08:15<04:41, 2.02it/s]epoch: 1 loss: 0.0184818 f1: 0.7262888: 62%|██████▏ | 930/1497 [08:15<04:41, 2.01it/s]epoch: 1 loss: 0.0183923 f1: 0.7262888: 62%|██████▏ | 930/1497 [08:16<04:41, 2.01it/s]epoch: 1 loss: 0.0183923 f1: 0.7262888: 62%|██████▏ | 931/1497 [08:16<04:43, 2.00it/s]epoch: 1 loss: 0.0130534 f1: 0.7262888: 62%|██████▏ | 931/1497 [08:16<04:43, 2.00it/s]epoch: 1 loss: 0.0130534 f1: 0.7262888: 62%|██████▏ | 932/1497 [08:16<04:42, 2.00it/s]epoch: 1 loss: 0.0547145 f1: 0.7262888: 62%|██████▏ | 932/1497 [08:17<04:42, 2.00it/s]epoch: 1 loss: 0.0547145 f1: 0.7262888: 62%|██████▏ | 933/1497 [08:17<04:43, 1.99it/s]epoch: 1 loss: 0.0121533 f1: 0.7262888: 62%|██████▏ | 933/1497 [08:17<04:43, 1.99it/s]epoch: 1 loss: 0.0121533 f1: 0.7262888: 62%|██████▏ | 934/1497 [08:17<04:41, 2.00it/s]epoch: 1 loss: 0.1266699 f1: 0.7262888: 62%|██████▏ | 934/1497 [08:18<04:41, 2.00it/s]epoch: 1 loss: 0.1266699 f1: 0.7262888: 62%|██████▏ | 935/1497 [08:18<04:41, 2.00it/s]epoch: 1 loss: 0.0170545 f1: 0.7262888: 62%|██████▏ | 935/1497 [08:18<04:41, 2.00it/s]epoch: 1 loss: 0.0170545 f1: 0.7262888: 63%|██████▎ | 936/1497 [08:18<04:40, 2.00it/s]epoch: 1 loss: 0.1216670 f1: 0.7262888: 63%|██████▎ | 936/1497 [08:19<04:40, 2.00it/s]epoch: 1 loss: 0.1216670 f1: 0.7262888: 63%|██████▎ | 937/1497 [08:19<04:38, 2.01it/s]epoch: 1 loss: 0.0428759 f1: 0.7262888: 63%|██████▎ | 937/1497 [08:19<04:38, 2.01it/s]epoch: 1 loss: 0.0428759 f1: 0.7262888: 63%|██████▎ | 938/1497 [08:19<04:40, 1.99it/s]epoch: 1 loss: 0.0079449 f1: 0.7262888: 63%|██████▎ | 938/1497 [08:20<04:40, 1.99it/s]epoch: 1 loss: 0.0079449 f1: 0.7262888: 63%|██████▎ | 939/1497 [08:20<04:38, 2.00it/s]epoch: 1 loss: 0.0350663 f1: 0.7262888: 63%|██████▎ | 939/1497 [08:20<04:38, 2.00it/s]epoch: 1 loss: 0.0350663 f1: 0.7262888: 63%|██████▎ | 940/1497 [08:20<04:38, 2.00it/s]epoch: 1 loss: 0.1004297 f1: 0.7262888: 63%|██████▎ | 940/1497 [08:21<04:38, 2.00it/s]epoch: 1 loss: 0.1004297 f1: 0.7262888: 63%|██████▎ | 941/1497 [08:21<04:41, 1.97it/s]epoch: 1 loss: 0.0223516 f1: 0.7262888: 63%|██████▎ | 941/1497 [08:21<04:41, 1.97it/s]epoch: 1 loss: 0.0223516 f1: 0.7262888: 63%|██████▎ | 942/1497 [08:21<04:40, 1.98it/s]epoch: 1 loss: 0.0594937 f1: 0.7262888: 63%|██████▎ | 942/1497 [08:22<04:40, 1.98it/s]epoch: 1 loss: 0.0594937 f1: 0.7262888: 63%|██████▎ | 943/1497 [08:22<04:40, 1.98it/s]epoch: 1 loss: 0.0033979 f1: 0.7262888: 63%|██████▎ | 943/1497 [08:22<04:40, 1.98it/s]epoch: 1 loss: 0.0033979 f1: 0.7262888: 63%|██████▎ | 944/1497 [08:22<04:39, 1.98it/s]epoch: 1 loss: 0.0051575 f1: 0.7262888: 63%|██████▎ | 944/1497 [08:23<04:39, 1.98it/s]epoch: 1 loss: 0.0051575 f1: 0.7262888: 63%|██████▎ | 945/1497 [08:23<04:36, 2.00it/s]epoch: 1 loss: 0.0540486 f1: 0.7262888: 63%|██████▎ | 945/1497 [08:23<04:36, 2.00it/s]epoch: 1 loss: 0.0540486 f1: 0.7262888: 63%|██████▎ | 946/1497 [08:23<04:34, 2.01it/s]epoch: 1 loss: 0.0934597 f1: 0.7262888: 63%|██████▎ | 946/1497 [08:24<04:34, 2.01it/s]epoch: 1 loss: 0.0934597 f1: 0.7262888: 63%|██████▎ | 947/1497 [08:24<04:32, 2.02it/s]epoch: 1 loss: 0.0813767 f1: 0.7262888: 63%|██████▎ | 947/1497 [08:24<04:32, 2.02it/s]epoch: 1 loss: 0.0813767 f1: 0.7262888: 63%|██████▎ | 948/1497 [08:24<04:31, 2.02it/s]epoch: 1 loss: 0.0037616 f1: 0.7262888: 63%|██████▎ | 948/1497 [08:25<04:31, 2.02it/s]epoch: 1 loss: 0.0037616 f1: 0.7262888: 63%|██████▎ | 949/1497 [08:25<04:30, 2.03it/s]epoch: 1 loss: 0.0201014 f1: 0.7262888: 63%|██████▎ | 949/1497 [08:25<04:30, 2.03it/s]epoch: 1 loss: 0.0201014 f1: 0.7262888: 63%|██████▎ | 950/1497 [08:25<04:29, 2.03it/s]epoch: 1 loss: 0.1138526 f1: 0.7262888: 63%|██████▎ | 950/1497 [08:26<04:29, 2.03it/s]epoch: 1 loss: 0.1138526 f1: 0.7262888: 64%|██████▎ | 951/1497 [08:26<04:28, 2.04it/s]epoch: 1 loss: 0.0309040 f1: 0.7262888: 64%|██████▎ | 951/1497 [08:26<04:28, 2.04it/s]epoch: 1 loss: 0.0309040 f1: 0.7262888: 64%|██████▎ | 952/1497 [08:26<04:27, 2.04it/s]epoch: 1 loss: 0.2466692 f1: 0.7262888: 64%|██████▎ | 952/1497 [08:27<04:27, 2.04it/s]epoch: 1 loss: 0.2466692 f1: 0.7262888: 64%|██████▎ | 953/1497 [08:27<04:28, 2.02it/s]epoch: 1 loss: 0.0171863 f1: 0.7262888: 64%|██████▎ | 953/1497 [08:27<04:28, 2.02it/s]epoch: 1 loss: 0.0171863 f1: 0.7262888: 64%|██████▎ | 954/1497 [08:27<04:27, 2.03it/s]epoch: 1 loss: 0.2257369 f1: 0.7262888: 64%|██████▎ | 954/1497 [08:28<04:27, 2.03it/s]epoch: 1 loss: 0.2257369 f1: 0.7262888: 64%|██████▍ | 955/1497 [08:28<04:26, 2.03it/s]epoch: 1 loss: 0.0077250 f1: 0.7262888: 64%|██████▍ | 955/1497 [08:28<04:26, 2.03it/s]epoch: 1 loss: 0.0077250 f1: 0.7262888: 64%|██████▍ | 956/1497 [08:28<04:27, 2.02it/s]epoch: 1 loss: 0.0472980 f1: 0.7262888: 64%|██████▍ | 956/1497 [08:29<04:27, 2.02it/s]epoch: 1 loss: 0.0472980 f1: 0.7262888: 64%|██████▍ | 957/1497 [08:29<04:26, 2.03it/s]epoch: 1 loss: 0.0993755 f1: 0.7262888: 64%|██████▍ | 957/1497 [08:29<04:26, 2.03it/s]epoch: 1 loss: 0.0993755 f1: 0.7262888: 64%|██████▍ | 958/1497 [08:29<04:26, 2.02it/s]epoch: 1 loss: 0.0030888 f1: 0.7262888: 64%|██████▍ | 958/1497 [08:30<04:26, 2.02it/s]epoch: 1 loss: 0.0030888 f1: 0.7262888: 64%|██████▍ | 959/1497 [08:30<04:26, 2.02it/s]epoch: 1 loss: 0.0149669 f1: 0.7262888: 64%|██████▍ | 959/1497 [08:30<04:26, 2.02it/s]epoch: 1 loss: 0.0149669 f1: 0.7262888: 64%|██████▍ | 960/1497 [08:30<04:24, 2.03it/s]epoch: 1 loss: 0.0610588 f1: 0.7262888: 64%|██████▍ | 960/1497 [08:31<04:24, 2.03it/s]epoch: 1 loss: 0.0610588 f1: 0.7262888: 64%|██████▍ | 961/1497 [08:31<04:22, 2.04it/s]epoch: 1 loss: 0.0825948 f1: 0.7262888: 64%|██████▍ | 961/1497 [08:31<04:22, 2.04it/s]epoch: 1 loss: 0.0825948 f1: 0.7262888: 64%|██████▍ | 962/1497 [08:31<04:22, 2.04it/s]epoch: 1 loss: 0.0728263 f1: 0.7262888: 64%|██████▍ | 962/1497 [08:32<04:22, 2.04it/s]epoch: 1 loss: 0.0728263 f1: 0.7262888: 64%|██████▍ | 963/1497 [08:32<04:22, 2.03it/s]epoch: 1 loss: 0.0336787 f1: 0.7262888: 64%|██████▍ | 963/1497 [08:32<04:22, 2.03it/s]epoch: 1 loss: 0.0336787 f1: 0.7262888: 64%|██████▍ | 964/1497 [08:32<04:23, 2.02it/s]epoch: 1 loss: 0.0053402 f1: 0.7262888: 64%|██████▍ | 964/1497 [08:33<04:23, 2.02it/s]epoch: 1 loss: 0.0053402 f1: 0.7262888: 64%|██████▍ | 965/1497 [08:33<04:22, 2.02it/s]epoch: 1 loss: 0.0934295 f1: 0.7262888: 64%|██████▍ | 965/1497 [08:33<04:22, 2.02it/s]epoch: 1 loss: 0.0934295 f1: 0.7262888: 65%|██████▍ | 966/1497 [08:33<04:21, 2.03it/s]epoch: 1 loss: 0.0453267 f1: 0.7262888: 65%|██████▍ | 966/1497 [08:34<04:21, 2.03it/s]epoch: 1 loss: 0.0453267 f1: 0.7262888: 65%|██████▍ | 967/1497 [08:34<04:22, 2.02it/s]epoch: 1 loss: 0.0110312 f1: 0.7262888: 65%|██████▍ | 967/1497 [08:34<04:22, 2.02it/s]epoch: 1 loss: 0.0110312 f1: 0.7262888: 65%|██████▍ | 968/1497 [08:34<04:20, 2.03it/s]epoch: 1 loss: 0.0059587 f1: 0.7262888: 65%|██████▍ | 968/1497 [08:35<04:20, 2.03it/s]epoch: 1 loss: 0.0059587 f1: 0.7262888: 65%|██████▍ | 969/1497 [08:35<04:19, 2.04it/s]epoch: 1 loss: 0.1466050 f1: 0.7262888: 65%|██████▍ | 969/1497 [08:35<04:19, 2.04it/s]epoch: 1 loss: 0.1466050 f1: 0.7262888: 65%|██████▍ | 970/1497 [08:35<04:19, 2.03it/s]epoch: 1 loss: 0.0930673 f1: 0.7262888: 65%|██████▍ | 970/1497 [08:36<04:19, 2.03it/s]epoch: 1 loss: 0.0930673 f1: 0.7262888: 65%|██████▍ | 971/1497 [08:36<04:19, 2.03it/s]epoch: 1 loss: 0.0043641 f1: 0.7262888: 65%|██████▍ | 971/1497 [08:36<04:19, 2.03it/s]epoch: 1 loss: 0.0043641 f1: 0.7262888: 65%|██████▍ | 972/1497 [08:36<04:19, 2.02it/s]epoch: 1 loss: 0.0983466 f1: 0.7262888: 65%|██████▍ | 972/1497 [08:37<04:19, 2.02it/s]epoch: 1 loss: 0.0983466 f1: 0.7262888: 65%|██████▍ | 973/1497 [08:37<04:20, 2.02it/s]epoch: 1 loss: 0.0219711 f1: 0.7262888: 65%|██████▍ | 973/1497 [08:37<04:20, 2.02it/s]epoch: 1 loss: 0.0219711 f1: 0.7262888: 65%|██████▌ | 974/1497 [08:37<04:21, 2.00it/s]epoch: 1 loss: 0.0064466 f1: 0.7262888: 65%|██████▌ | 974/1497 [08:38<04:21, 2.00it/s]epoch: 1 loss: 0.0064466 f1: 0.7262888: 65%|██████▌ | 975/1497 [08:38<04:23, 1.98it/s]epoch: 1 loss: 0.1170068 f1: 0.7262888: 65%|██████▌ | 975/1497 [08:38<04:23, 1.98it/s]epoch: 1 loss: 0.1170068 f1: 0.7262888: 65%|██████▌ | 976/1497 [08:38<04:25, 1.97it/s]epoch: 1 loss: 0.0052620 f1: 0.7262888: 65%|██████▌ | 976/1497 [08:39<04:25, 1.97it/s]epoch: 1 loss: 0.0052620 f1: 0.7262888: 65%|██████▌ | 977/1497 [08:39<04:24, 1.97it/s]epoch: 1 loss: 0.0120260 f1: 0.7262888: 65%|██████▌ | 977/1497 [08:39<04:24, 1.97it/s]epoch: 1 loss: 0.0120260 f1: 0.7262888: 65%|██████▌ | 978/1497 [08:39<04:22, 1.97it/s]epoch: 1 loss: 0.0388607 f1: 0.7262888: 65%|██████▌ | 978/1497 [08:40<04:22, 1.97it/s]epoch: 1 loss: 0.0388607 f1: 0.7262888: 65%|██████▌ | 979/1497 [08:40<04:23, 1.96it/s]epoch: 1 loss: 0.0165507 f1: 0.7262888: 65%|██████▌ | 979/1497 [08:40<04:23, 1.96it/s]epoch: 1 loss: 0.0165507 f1: 0.7262888: 65%|██████▌ | 980/1497 [08:40<04:22, 1.97it/s]epoch: 1 loss: 0.0230119 f1: 0.7262888: 65%|██████▌ | 980/1497 [08:41<04:22, 1.97it/s]epoch: 1 loss: 0.0230119 f1: 0.7262888: 66%|██████▌ | 981/1497 [08:41<04:19, 1.99it/s]epoch: 1 loss: 0.0341803 f1: 0.7262888: 66%|██████▌ | 981/1497 [08:41<04:19, 1.99it/s]epoch: 1 loss: 0.0341803 f1: 0.7262888: 66%|██████▌ | 982/1497 [08:41<04:22, 1.96it/s]epoch: 1 loss: 0.0138703 f1: 0.7262888: 66%|██████▌ | 982/1497 [08:42<04:22, 1.96it/s]epoch: 1 loss: 0.0138703 f1: 0.7262888: 66%|██████▌ | 983/1497 [08:42<04:22, 1.96it/s]epoch: 1 loss: 0.1137445 f1: 0.7262888: 66%|██████▌ | 983/1497 [08:42<04:22, 1.96it/s]epoch: 1 loss: 0.1137445 f1: 0.7262888: 66%|██████▌ | 984/1497 [08:42<04:18, 1.99it/s]epoch: 1 loss: 0.0542309 f1: 0.7262888: 66%|██████▌ | 984/1497 [08:43<04:18, 1.99it/s]epoch: 1 loss: 0.0542309 f1: 0.7262888: 66%|██████▌ | 985/1497 [08:43<04:17, 1.99it/s]epoch: 1 loss: 0.0383111 f1: 0.7262888: 66%|██████▌ | 985/1497 [08:43<04:17, 1.99it/s]epoch: 1 loss: 0.0383111 f1: 0.7262888: 66%|██████▌ | 986/1497 [08:43<04:16, 1.99it/s]epoch: 1 loss: 0.0259308 f1: 0.7262888: 66%|██████▌ | 986/1497 [08:44<04:16, 1.99it/s]epoch: 1 loss: 0.0259308 f1: 0.7262888: 66%|██████▌ | 987/1497 [08:44<04:16, 1.99it/s]epoch: 1 loss: 0.0861352 f1: 0.7262888: 66%|██████▌ | 987/1497 [08:44<04:16, 1.99it/s]epoch: 1 loss: 0.0861352 f1: 0.7262888: 66%|██████▌ | 988/1497 [08:44<04:15, 1.99it/s]epoch: 1 loss: 0.0119524 f1: 0.7262888: 66%|██████▌ | 988/1497 [08:45<04:15, 1.99it/s]epoch: 1 loss: 0.0119524 f1: 0.7262888: 66%|██████▌ | 989/1497 [08:45<04:13, 2.00it/s]epoch: 1 loss: 0.0897825 f1: 0.7262888: 66%|██████▌ | 989/1497 [08:45<04:13, 2.00it/s]epoch: 1 loss: 0.0897825 f1: 0.7262888: 66%|██████▌ | 990/1497 [08:45<04:12, 2.01it/s]epoch: 1 loss: 0.0046061 f1: 0.7262888: 66%|██████▌ | 990/1497 [08:46<04:12, 2.01it/s]epoch: 1 loss: 0.0046061 f1: 0.7262888: 66%|██████▌ | 991/1497 [08:46<04:11, 2.01it/s]epoch: 1 loss: 0.0919826 f1: 0.7262888: 66%|██████▌ | 991/1497 [08:46<04:11, 2.01it/s]epoch: 1 loss: 0.0919826 f1: 0.7262888: 66%|██████▋ | 992/1497 [08:46<04:11, 2.01it/s]epoch: 1 loss: 0.0041350 f1: 0.7262888: 66%|██████▋ | 992/1497 [08:47<04:11, 2.01it/s]epoch: 1 loss: 0.0041350 f1: 0.7262888: 66%|██████▋ | 993/1497 [08:47<04:13, 1.98it/s]epoch: 1 loss: 0.0287726 f1: 0.7262888: 66%|██████▋ | 993/1497 [08:47<04:13, 1.98it/s]epoch: 1 loss: 0.0287726 f1: 0.7262888: 66%|██████▋ | 994/1497 [08:47<04:15, 1.97it/s]epoch: 1 loss: 0.0483208 f1: 0.7262888: 66%|██████▋ | 994/1497 [08:48<04:15, 1.97it/s]epoch: 1 loss: 0.0483208 f1: 0.7262888: 66%|██████▋ | 995/1497 [08:48<04:13, 1.98it/s]epoch: 1 loss: 0.0508065 f1: 0.7262888: 66%|██████▋ | 995/1497 [08:48<04:13, 1.98it/s]epoch: 1 loss: 0.0508065 f1: 0.7262888: 67%|██████▋ | 996/1497 [08:48<04:11, 1.99it/s]epoch: 1 loss: 0.0520286 f1: 0.7262888: 67%|██████▋ | 996/1497 [08:49<04:11, 1.99it/s]epoch: 1 loss: 0.0520286 f1: 0.7262888: 67%|██████▋ | 997/1497 [08:49<04:10, 2.00it/s]
0%| | 0/1998 [00:00<?, ?it/s][A
23%|██▎ | 458/1998 [00:00<00:00, 4578.15it/s][A
46%|████▌ | 918/1998 [00:00<00:00, 4584.56it/s][A
69%|██████▉ | 1381/1998 [00:00<00:00, 4595.45it/s][A
93%|█████████▎| 1852/1998 [00:00<00:00, 4628.80it/s][A
100%|██████████| 1998/1998 [00:00<00:00, 4612.19it/s][A
test: 0%| | 0/63 [00:00<?, ?it/s][A
test: 2%|▏ | 1/63 [00:00<00:14, 4.18it/s][A
test: 3%|▎ | 2/63 [00:00<00:12, 4.81it/s][A
test: 5%|▍ | 3/63 [00:00<00:11, 5.12it/s][A
test: 6%|▋ | 4/63 [00:00<00:11, 5.33it/s][A
test: 8%|▊ | 5/63 [00:00<00:10, 5.48it/s][A
test: 10%|▉ | 6/63 [00:01<00:09, 5.91it/s][A
test: 11%|█ | 7/63 [00:01<00:09, 5.99it/s][A
test: 13%|█▎ | 8/63 [00:01<00:09, 5.74it/s][A
test: 14%|█▍ | 9/63 [00:01<00:09, 5.99it/s][A
test: 16%|█▌ | 10/63 [00:01<00:08, 6.33it/s][A
test: 17%|█▋ | 11/63 [00:01<00:08, 6.32it/s][A
test: 19%|█▉ | 12/63 [00:02<00:08, 5.95it/s][A
test: 21%|██ | 13/63 [00:02<00:07, 6.32it/s][A
test: 22%|██▏ | 14/63 [00:02<00:07, 6.27it/s][A
test: 24%|██▍ | 15/63 [00:02<00:07, 6.01it/s][A
test: 25%|██▌ | 16/63 [00:02<00:07, 6.20it/s][A
test: 27%|██▋ | 17/63 [00:02<00:07, 6.51it/s][A
test: 29%|██▊ | 18/63 [00:02<00:07, 6.30it/s][A
test: 30%|███ | 19/63 [00:03<00:07, 5.71it/s][A
test: 32%|███▏ | 20/63 [00:03<00:07, 6.09it/s][A
test: 33%|███▎ | 21/63 [00:03<00:06, 6.04it/s][A
test: 35%|███▍ | 22/63 [00:03<00:07, 5.66it/s][A
test: 37%|███▋ | 23/63 [00:03<00:06, 5.83it/s][A
test: 38%|███▊ | 24/63 [00:03<00:06, 5.87it/s][A
test: 40%|███▉ | 25/63 [00:04<00:06, 6.28it/s][A
test: 41%|████▏ | 26/63 [00:04<00:05, 6.17it/s][A
test: 43%|████▎ | 27/63 [00:04<00:06, 5.54it/s][A
test: 44%|████▍ | 28/63 [00:04<00:05, 6.01it/s][A
test: 46%|████▌ | 29/63 [00:04<00:05, 6.21it/s][A
test: 48%|████▊ | 30/63 [00:04<00:05, 6.12it/s][A
test: 49%|████▉ | 31/63 [00:05<00:05, 6.04it/s][A
test: 51%|█████ | 32/63 [00:05<00:04, 6.38it/s][A
test: 52%|█████▏ | 33/63 [00:05<00:05, 5.91it/s][A
test: 54%|█████▍ | 34/63 [00:05<00:04, 6.05it/s][A
test: 56%|█████▌ | 35/63 [00:05<00:04, 6.12it/s][A
test: 57%|█████▋ | 36/63 [00:05<00:04, 6.14it/s][A
test: 59%|█████▊ | 37/63 [00:06<00:04, 6.23it/s][A
test: 60%|██████ | 38/63 [00:06<00:03, 6.38it/s][A
test: 62%|██████▏ | 39/63 [00:06<00:03, 6.36it/s][A
test: 63%|██████▎ | 40/63 [00:06<00:03, 6.36it/s][A
test: 65%|██████▌ | 41/63 [00:06<00:03, 5.94it/s][A
test: 67%|██████▋ | 42/63 [00:06<00:03, 6.19it/s][A
test: 68%|██████▊ | 43/63 [00:07<00:03, 6.48it/s][A
test: 70%|██████▉ | 44/63 [00:07<00:02, 6.47it/s][A
test: 71%|███████▏ | 45/63 [00:07<00:02, 6.41it/s][A
test: 73%|███████▎ | 46/63 [00:07<00:02, 6.56it/s][A
test: 75%|███████▍ | 47/63 [00:07<00:02, 6.31it/s][A
test: 76%|███████▌ | 48/63 [00:07<00:02, 6.18it/s][A
test: 78%|███████▊ | 49/63 [00:07<00:02, 6.39it/s][A
test: 79%|███████▉ | 50/63 [00:08<00:01, 6.51it/s][A
test: 81%|████████ | 51/63 [00:08<00:01, 6.30it/s][A
test: 83%|████████▎ | 52/63 [00:08<00:01, 6.61it/s][A
test: 84%|████████▍ | 53/63 [00:08<00:01, 6.58it/s][A
test: 86%|████████▌ | 54/63 [00:08<00:01, 6.77it/s][A
test: 87%|████████▋ | 55/63 [00:08<00:01, 6.20it/s][A
test: 89%|████████▉ | 56/63 [00:09<00:01, 5.61it/s][A
test: 90%|█████████ | 57/63 [00:09<00:01, 5.45it/s][A
test: 92%|█████████▏| 58/63 [00:09<00:00, 5.53it/s][A
test: 94%|█████████▎| 59/63 [00:09<00:00, 5.85it/s][A
test: 95%|█████████▌| 60/63 [00:09<00:00, 5.87it/s][A
test: 97%|█████████▋| 61/63 [00:10<00:00, 5.88it/s][A
test: 98%|█████████▊| 62/63 [00:10<00:00, 6.17it/s][A
test: 100%|██████████| 63/63 [00:10<00:00, 6.85it/s][A
[Aepoch: 1 loss: 0.0882248 f1: 0.7534884: 67%|██████▋ | 997/1497 [09:17<04:10, 2.00it/s]epoch: 1 loss: 0.0882248 f1: 0.7534884: 67%|██████▋ | 998/1497 [09:17<1:12:27, 8.71s/it]epoch: 2 loss: 0.0865444 f1: 0.7534884: 67%|██████▋ | 998/1497 [09:17<1:12:27, 8.71s/it]epoch: 2 loss: 0.0865444 f1: 0.7534884: 67%|██████▋ | 999/1497 [09:17<52:17, 6.30s/it] epoch: 2 loss: 0.0242179 f1: 0.7534884: 67%|██████▋ | 999/1497 [09:18<52:17, 6.30s/it]epoch: 2 loss: 0.0242179 f1: 0.7534884: 67%|██████▋ | 1000/1497 [09:18<37:46, 4.56s/it]epoch: 2 loss: 0.0374285 f1: 0.7534884: 67%|██████▋ | 1000/1497 [09:18<37:46, 4.56s/it]epoch: 2 loss: 0.0374285 f1: 0.7534884: 67%|██████▋ | 1001/1497 [09:18<27:37, 3.34s/it]epoch: 2 loss: 0.1381718 f1: 0.7534884: 67%|██████▋ | 1001/1497 [09:19<27:37, 3.34s/it]epoch: 2 loss: 0.1381718 f1: 0.7534884: 67%|██████▋ | 1002/1497 [09:19<20:32, 2.49s/it]epoch: 2 loss: 0.0603929 f1: 0.7534884: 67%|██████▋ | 1002/1497 [09:19<20:32, 2.49s/it]epoch: 2 loss: 0.0603929 f1: 0.7534884: 67%|██████▋ | 1003/1497 [09:19<15:35, 1.89s/it]epoch: 2 loss: 0.0571879 f1: 0.7534884: 67%|██████▋ | 1003/1497 [09:20<15:35, 1.89s/it]epoch: 2 loss: 0.0571879 f1: 0.7534884: 67%|██████▋ | 1004/1497 [09:20<12:07, 1.48s/it]epoch: 2 loss: 0.1482955 f1: 0.7534884: 67%|██████▋ | 1004/1497 [09:20<12:07, 1.48s/it]epoch: 2 loss: 0.1482955 f1: 0.7534884: 67%|██████▋ | 1005/1497 [09:20<09:42, 1.18s/it]epoch: 2 loss: 0.0208667 f1: 0.7534884: 67%|██████▋ | 1005/1497 [09:21<09:42, 1.18s/it]epoch: 2 loss: 0.0208667 f1: 0.7534884: 67%|██████▋ | 1006/1497 [09:21<08:01, 1.02it/s]epoch: 2 loss: 0.0262995 f1: 0.7534884: 67%|██████▋ | 1006/1497 [09:21<08:01, 1.02it/s]epoch: 2 loss: 0.0262995 f1: 0.7534884: 67%|██████▋ | 1007/1497 [09:21<06:50, 1.19it/s]epoch: 2 loss: 0.0048171 f1: 0.7534884: 67%|██████▋ | 1007/1497 [09:22<06:50, 1.19it/s]epoch: 2 loss: 0.0048171 f1: 0.7534884: 67%|██████▋ | 1008/1497 [09:22<06:01, 1.35it/s]epoch: 2 loss: 0.0338949 f1: 0.7534884: 67%|██████▋ | 1008/1497 [09:22<06:01, 1.35it/s]epoch: 2 loss: 0.0338949 f1: 0.7534884: 67%|██████▋ | 1009/1497 [09:22<05:27, 1.49it/s]epoch: 2 loss: 0.0117199 f1: 0.7534884: 67%|██████▋ | 1009/1497 [09:23<05:27, 1.49it/s]epoch: 2 loss: 0.0117199 f1: 0.7534884: 67%|██████▋ | 1010/1497 [09:23<05:07, 1.58it/s]epoch: 2 loss: 0.1374213 f1: 0.7534884: 67%|██████▋ | 1010/1497 [09:23<05:07, 1.58it/s]epoch: 2 loss: 0.1374213 f1: 0.7534884: 68%|██████▊ | 1011/1497 [09:23<04:48, 1.69it/s]epoch: 2 loss: 0.0619933 f1: 0.7534884: 68%|██████▊ | 1011/1497 [09:24<04:48, 1.69it/s]epoch: 2 loss: 0.0619933 f1: 0.7534884: 68%|██████▊ | 1012/1497 [09:24<04:35, 1.76it/s]epoch: 2 loss: 0.0110504 f1: 0.7534884: 68%|██████▊ | 1012/1497 [09:24<04:35, 1.76it/s]epoch: 2 loss: 0.0110504 f1: 0.7534884: 68%|██████▊ | 1013/1497 [09:24<04:27, 1.81it/s]epoch: 2 loss: 0.1962132 f1: 0.7534884: 68%|██████▊ | 1013/1497 [09:25<04:27, 1.81it/s]epoch: 2 loss: 0.1962132 f1: 0.7534884: 68%|██████▊ | 1014/1497 [09:25<04:19, 1.86it/s]epoch: 2 loss: 0.0411076 f1: 0.7534884: 68%|██████▊ | 1014/1497 [09:25<04:19, 1.86it/s]epoch: 2 loss: 0.0411076 f1: 0.7534884: 68%|██████▊ | 1015/1497 [09:25<04:14, 1.89it/s]epoch: 2 loss: 0.0607899 f1: 0.7534884: 68%|██████▊ | 1015/1497 [09:26<04:14, 1.89it/s]epoch: 2 loss: 0.0607899 f1: 0.7534884: 68%|██████▊ | 1016/1497 [09:26<04:10, 1.92it/s]epoch: 2 loss: 0.0152491 f1: 0.7534884: 68%|██████▊ | 1016/1497 [09:26<04:10, 1.92it/s]epoch: 2 loss: 0.0152491 f1: 0.7534884: 68%|██████▊ | 1017/1497 [09:26<04:08, 1.93it/s]epoch: 2 loss: 0.0113209 f1: 0.7534884: 68%|██████▊ | 1017/1497 [09:27<04:08, 1.93it/s]epoch: 2 loss: 0.0113209 f1: 0.7534884: 68%|██████▊ | 1018/1497 [09:27<04:07, 1.94it/s]epoch: 2 loss: 0.0995148 f1: 0.7534884: 68%|██████▊ | 1018/1497 [09:27<04:07, 1.94it/s]epoch: 2 loss: 0.0995148 f1: 0.7534884: 68%|██████▊ | 1019/1497 [09:27<04:05, 1.95it/s]epoch: 2 loss: 0.0133384 f1: 0.7534884: 68%|██████▊ | 1019/1497 [09:28<04:05, 1.95it/s]epoch: 2 loss: 0.0133384 f1: 0.7534884: 68%|██████▊ | 1020/1497 [09:28<04:04, 1.95it/s]epoch: 2 loss: 0.0067568 f1: 0.7534884: 68%|██████▊ | 1020/1497 [09:28<04:04, 1.95it/s]epoch: 2 loss: 0.0067568 f1: 0.7534884: 68%|██████▊ | 1021/1497 [09:28<04:03, 1.95it/s]epoch: 2 loss: 0.0464024 f1: 0.7534884: 68%|██████▊ | 1021/1497 [09:29<04:03, 1.95it/s]epoch: 2 loss: 0.0464024 f1: 0.7534884: 68%|██████▊ | 1022/1497 [09:29<04:02, 1.96it/s]epoch: 2 loss: 0.0065092 f1: 0.7534884: 68%|██████▊ | 1022/1497 [09:29<04:02, 1.96it/s]epoch: 2 loss: 0.0065092 f1: 0.7534884: 68%|██████▊ | 1023/1497 [09:29<04:00, 1.97it/s]epoch: 2 loss: 0.0893168 f1: 0.7534884: 68%|██████▊ | 1023/1497 [09:30<04:00, 1.97it/s]epoch: 2 loss: 0.0893168 f1: 0.7534884: 68%|██████▊ | 1024/1497 [09:30<03:59, 1.97it/s]epoch: 2 loss: 0.0217322 f1: 0.7534884: 68%|██████▊ | 1024/1497 [09:30<03:59, 1.97it/s]epoch: 2 loss: 0.0217322 f1: 0.7534884: 68%|██████▊ | 1025/1497 [09:30<03:58, 1.98it/s]epoch: 2 loss: 0.0318056 f1: 0.7534884: 68%|██████▊ | 1025/1497 [09:31<03:58, 1.98it/s]epoch: 2 loss: 0.0318056 f1: 0.7534884: 69%|██████▊ | 1026/1497 [09:31<03:57, 1.98it/s]epoch: 2 loss: 0.0950796 f1: 0.7534884: 69%|██████▊ | 1026/1497 [09:31<03:57, 1.98it/s]epoch: 2 loss: 0.0950796 f1: 0.7534884: 69%|██████▊ | 1027/1497 [09:31<03:57, 1.98it/s]epoch: 2 loss: 0.0818834 f1: 0.7534884: 69%|██████▊ | 1027/1497 [09:32<03:57, 1.98it/s]epoch: 2 loss: 0.0818834 f1: 0.7534884: 69%|██████▊ | 1028/1497 [09:32<03:57, 1.97it/s]epoch: 2 loss: 0.0175281 f1: 0.7534884: 69%|██████▊ | 1028/1497 [09:32<03:57, 1.97it/s]epoch: 2 loss: 0.0175281 f1: 0.7534884: 69%|██████▊ | 1029/1497 [09:32<03:58, 1.97it/s]epoch: 2 loss: 0.0672585 f1: 0.7534884: 69%|██████▊ | 1029/1497 [09:33<03:58, 1.97it/s]epoch: 2 loss: 0.0672585 f1: 0.7534884: 69%|██████▉ | 1030/1497 [09:33<03:57, 1.97it/s]epoch: 2 loss: 0.0307478 f1: 0.7534884: 69%|██████▉ | 1030/1497 [09:33<03:57, 1.97it/s]epoch: 2 loss: 0.0307478 f1: 0.7534884: 69%|██████▉ | 1031/1497 [09:33<03:56, 1.97it/s]epoch: 2 loss: 0.0286233 f1: 0.7534884: 69%|██████▉ | 1031/1497 [09:34<03:56, 1.97it/s]epoch: 2 loss: 0.0286233 f1: 0.7534884: 69%|██████▉ | 1032/1497 [09:34<03:54, 1.98it/s]epoch: 2 loss: 0.0551939 f1: 0.7534884: 69%|██████▉ | 1032/1497 [09:34<03:54, 1.98it/s]epoch: 2 loss: 0.0551939 f1: 0.7534884: 69%|██████▉ | 1033/1497 [09:34<03:53, 1.99it/s]epoch: 2 loss: 0.0163078 f1: 0.7534884: 69%|██████▉ | 1033/1497 [09:35<03:53, 1.99it/s]epoch: 2 loss: 0.0163078 f1: 0.7534884: 69%|██████▉ | 1034/1497 [09:35<03:50, 2.01it/s]epoch: 2 loss: 0.1159914 f1: 0.7534884: 69%|██████▉ | 1034/1497 [09:35<03:50, 2.01it/s]epoch: 2 loss: 0.1159914 f1: 0.7534884: 69%|██████▉ | 1035/1497 [09:35<03:50, 2.01it/s]epoch: 2 loss: 0.0156514 f1: 0.7534884: 69%|██████▉ | 1035/1497 [09:36<03:50, 2.01it/s]epoch: 2 loss: 0.0156514 f1: 0.7534884: 69%|██████▉ | 1036/1497 [09:36<03:49, 2.01it/s]epoch: 2 loss: 0.0133555 f1: 0.7534884: 69%|██████▉ | 1036/1497 [09:36<03:49, 2.01it/s]epoch: 2 loss: 0.0133555 f1: 0.7534884: 69%|██████▉ | 1037/1497 [09:36<03:46, 2.03it/s]epoch: 2 loss: 0.1398710 f1: 0.7534884: 69%|██████▉ | 1037/1497 [09:37<03:46, 2.03it/s]epoch: 2 loss: 0.1398710 f1: 0.7534884: 69%|██████▉ | 1038/1497 [09:37<03:47, 2.02it/s]epoch: 2 loss: 0.0430072 f1: 0.7534884: 69%|██████▉ | 1038/1497 [09:37<03:47, 2.02it/s]epoch: 2 loss: 0.0430072 f1: 0.7534884: 69%|██████▉ | 1039/1497 [09:37<03:46, 2.02it/s]epoch: 2 loss: 0.0086777 f1: 0.7534884: 69%|██████▉ | 1039/1497 [09:38<03:46, 2.02it/s]epoch: 2 loss: 0.0086777 f1: 0.7534884: 69%|██████▉ | 1040/1497 [09:38<03:46, 2.02it/s]epoch: 2 loss: 0.0389314 f1: 0.7534884: 69%|██████▉ | 1040/1497 [09:38<03:46, 2.02it/s]epoch: 2 loss: 0.0389314 f1: 0.7534884: 70%|██████▉ | 1041/1497 [09:38<03:46, 2.01it/s]epoch: 2 loss: 0.1170920 f1: 0.7534884: 70%|██████▉ | 1041/1497 [09:39<03:46, 2.01it/s]epoch: 2 loss: 0.1170920 f1: 0.7534884: 70%|██████▉ | 1042/1497 [09:39<03:48, 1.99it/s]epoch: 2 loss: 0.0585015 f1: 0.7534884: 70%|██████▉ | 1042/1497 [09:39<03:48, 1.99it/s]epoch: 2 loss: 0.0585015 f1: 0.7534884: 70%|██████▉ | 1043/1497 [09:39<03:48, 1.98it/s]epoch: 2 loss: 0.0192431 f1: 0.7534884: 70%|██████▉ | 1043/1497 [09:40<03:48, 1.98it/s]epoch: 2 loss: 0.0192431 f1: 0.7534884: 70%|██████▉ | 1044/1497 [09:40<03:47, 1.99it/s]epoch: 2 loss: 0.1202094 f1: 0.7534884: 70%|██████▉ | 1044/1497 [09:40<03:47, 1.99it/s]epoch: 2 loss: 0.1202094 f1: 0.7534884: 70%|██████▉ | 1045/1497 [09:40<03:44, 2.01it/s]epoch: 2 loss: 0.0581795 f1: 0.7534884: 70%|██████▉ | 1045/1497 [09:41<03:44, 2.01it/s]epoch: 2 loss: 0.0581795 f1: 0.7534884: 70%|██████▉ | 1046/1497 [09:41<03:44, 2.01it/s]epoch: 2 loss: 0.0040741 f1: 0.7534884: 70%|██████▉ | 1046/1497 [09:41<03:44, 2.01it/s]epoch: 2 loss: 0.0040741 f1: 0.7534884: 70%|██████▉ | 1047/1497 [09:41<03:45, 2.00it/s]epoch: 2 loss: 0.0525070 f1: 0.7534884: 70%|██████▉ | 1047/1497 [09:42<03:45, 2.00it/s]epoch: 2 loss: 0.0525070 f1: 0.7534884: 70%|███████ | 1048/1497 [09:42<03:45, 1.99it/s]epoch: 2 loss: 0.0096250 f1: 0.7534884: 70%|███████ | 1048/1497 [09:42<03:45, 1.99it/s]epoch: 2 loss: 0.0096250 f1: 0.7534884: 70%|███████ | 1049/1497 [09:42<03:45, 1.99it/s]epoch: 2 loss: 0.0442083 f1: 0.7534884: 70%|███████ | 1049/1497 [09:43<03:45, 1.99it/s]epoch: 2 loss: 0.0442083 f1: 0.7534884: 70%|███████ | 1050/1497 [09:43<03:49, 1.95it/s]epoch: 2 loss: 0.0133437 f1: 0.7534884: 70%|███████ | 1050/1497 [09:44<03:49, 1.95it/s]epoch: 2 loss: 0.0133437 f1: 0.7534884: 70%|███████ | 1051/1497 [09:44<04:14, 1.76it/s]epoch: 2 loss: 0.0191092 f1: 0.7534884: 70%|███████ | 1051/1497 [09:44<04:14, 1.76it/s]epoch: 2 loss: 0.0191092 f1: 0.7534884: 70%|███████ | 1052/1497 [09:44<04:07, 1.80it/s]epoch: 2 loss: 0.1335737 f1: 0.7534884: 70%|███████ | 1052/1497 [09:45<04:07, 1.80it/s]epoch: 2 loss: 0.1335737 f1: 0.7534884: 70%|███████ | 1053/1497 [09:45<04:00, 1.85it/s]epoch: 2 loss: 0.0980593 f1: 0.7534884: 70%|███████ | 1053/1497 [09:45<04:00, 1.85it/s]epoch: 2 loss: 0.0980593 f1: 0.7534884: 70%|███████ | 1054/1497 [09:45<03:53, 1.90it/s]epoch: 2 loss: 0.0936910 f1: 0.7534884: 70%|███████ | 1054/1497 [09:46<03:53, 1.90it/s]epoch: 2 loss: 0.0936910 f1: 0.7534884: 70%|███████ | 1055/1497 [09:46<03:51, 1.91it/s]epoch: 2 loss: 0.0325529 f1: 0.7534884: 70%|███████ | 1055/1497 [09:46<03:51, 1.91it/s]epoch: 2 loss: 0.0325529 f1: 0.7534884: 71%|███████ | 1056/1497 [09:46<03:47, 1.94it/s]epoch: 2 loss: 0.0158520 f1: 0.7534884: 71%|███████ | 1056/1497 [09:47<03:47, 1.94it/s]epoch: 2 loss: 0.0158520 f1: 0.7534884: 71%|███████ | 1057/1497 [09:47<03:46, 1.94it/s]epoch: 2 loss: 0.0155487 f1: 0.7534884: 71%|███████ | 1057/1497 [09:47<03:46, 1.94it/s]epoch: 2 loss: 0.0155487 f1: 0.7534884: 71%|███████ | 1058/1497 [09:47<03:46, 1.94it/s]epoch: 2 loss: 0.1986545 f1: 0.7534884: 71%|███████ | 1058/1497 [09:48<03:46, 1.94it/s]epoch: 2 loss: 0.1986545 f1: 0.7534884: 71%|███████ | 1059/1497 [09:48<03:45, 1.94it/s]epoch: 2 loss: 0.1877207 f1: 0.7534884: 71%|███████ | 1059/1497 [09:48<03:45, 1.94it/s]epoch: 2 loss: 0.1877207 f1: 0.7534884: 71%|███████ | 1060/1497 [09:48<03:44, 1.94it/s]epoch: 2 loss: 0.3111150 f1: 0.7534884: 71%|███████ | 1060/1497 [09:49<03:44, 1.94it/s]epoch: 2 loss: 0.3111150 f1: 0.7534884: 71%|███████ | 1061/1497 [09:49<03:42, 1.96it/s]epoch: 2 loss: 0.0258977 f1: 0.7534884: 71%|███████ | 1061/1497 [09:49<03:42, 1.96it/s]epoch: 2 loss: 0.0258977 f1: 0.7534884: 71%|███████ | 1062/1497 [09:49<03:42, 1.96it/s]epoch: 2 loss: 0.0346662 f1: 0.7534884: 71%|███████ | 1062/1497 [09:50<03:42, 1.96it/s]epoch: 2 loss: 0.0346662 f1: 0.7534884: 71%|███████ | 1063/1497 [09:50<03:40, 1.97it/s]epoch: 2 loss: 0.0030645 f1: 0.7534884: 71%|███████ | 1063/1497 [09:50<03:40, 1.97it/s]epoch: 2 loss: 0.0030645 f1: 0.7534884: 71%|███████ | 1064/1497 [09:50<03:39, 1.97it/s]epoch: 2 loss: 0.1673554 f1: 0.7534884: 71%|███████ | 1064/1497 [09:51<03:39, 1.97it/s]epoch: 2 loss: 0.1673554 f1: 0.7534884: 71%|███████ | 1065/1497 [09:51<03:39, 1.97it/s]epoch: 2 loss: 0.0662018 f1: 0.7534884: 71%|███████ | 1065/1497 [09:51<03:39, 1.97it/s]epoch: 2 loss: 0.0662018 f1: 0.7534884: 71%|███████ | 1066/1497 [09:51<03:39, 1.97it/s]epoch: 2 loss: 0.0306785 f1: 0.7534884: 71%|███████ | 1066/1497 [09:52<03:39, 1.97it/s]epoch: 2 loss: 0.0306785 f1: 0.7534884: 71%|███████▏ | 1067/1497 [09:52<03:38, 1.97it/s]epoch: 2 loss: 0.0128178 f1: 0.7534884: 71%|███████▏ | 1067/1497 [09:52<03:38, 1.97it/s]epoch: 2 loss: 0.0128178 f1: 0.7534884: 71%|███████▏ | 1068/1497 [09:52<03:37, 1.97it/s]epoch: 2 loss: 0.0472850 f1: 0.7534884: 71%|███████▏ | 1068/1497 [09:53<03:37, 1.97it/s]epoch: 2 loss: 0.0472850 f1: 0.7534884: 71%|███████▏ | 1069/1497 [09:53<03:37, 1.96it/s]epoch: 2 loss: 0.1678910 f1: 0.7534884: 71%|███████▏ | 1069/1497 [09:53<03:37, 1.96it/s]epoch: 2 loss: 0.1678910 f1: 0.7534884: 71%|███████▏ | 1070/1497 [09:53<03:36, 1.97it/s]epoch: 2 loss: 0.1043425 f1: 0.7534884: 71%|███████▏ | 1070/1497 [09:54<03:36, 1.97it/s]epoch: 2 loss: 0.1043425 f1: 0.7534884: 72%|███████▏ | 1071/1497 [09:54<03:36, 1.97it/s]epoch: 2 loss: 0.1384628 f1: 0.7534884: 72%|███████▏ | 1071/1497 [09:54<03:36, 1.97it/s]epoch: 2 loss: 0.1384628 f1: 0.7534884: 72%|███████▏ | 1072/1497 [09:54<03:35, 1.97it/s]epoch: 2 loss: 0.1752282 f1: 0.7534884: 72%|███████▏ | 1072/1497 [09:55<03:35, 1.97it/s]epoch: 2 loss: 0.1752282 f1: 0.7534884: 72%|███████▏ | 1073/1497 [09:55<03:35, 1.97it/s]epoch: 2 loss: 0.0226068 f1: 0.7534884: 72%|███████▏ | 1073/1497 [09:55<03:35, 1.97it/s]epoch: 2 loss: 0.0226068 f1: 0.7534884: 72%|███████▏ | 1074/1497 [09:55<03:34, 1.97it/s]epoch: 2 loss: 0.0160697 f1: 0.7534884: 72%|███████▏ | 1074/1497 [09:56<03:34, 1.97it/s]epoch: 2 loss: 0.0160697 f1: 0.7534884: 72%|███████▏ | 1075/1497 [09:56<03:33, 1.97it/s]epoch: 2 loss: 0.0999546 f1: 0.7534884: 72%|███████▏ | 1075/1497 [09:56<03:33, 1.97it/s]epoch: 2 loss: 0.0999546 f1: 0.7534884: 72%|███████▏ | 1076/1497 [09:56<03:34, 1.96it/s]epoch: 2 loss: 0.0279161 f1: 0.7534884: 72%|███████▏ | 1076/1497 [09:57<03:34, 1.96it/s]epoch: 2 loss: 0.0279161 f1: 0.7534884: 72%|███████▏ | 1077/1497 [09:57<03:34, 1.96it/s]epoch: 2 loss: 0.0310759 f1: 0.7534884: 72%|███████▏ | 1077/1497 [09:57<03:34, 1.96it/s]epoch: 2 loss: 0.0310759 f1: 0.7534884: 72%|███████▏ | 1078/1497 [09:57<03:34, 1.96it/s]epoch: 2 loss: 0.0597574 f1: 0.7534884: 72%|███████▏ | 1078/1497 [09:58<03:34, 1.96it/s]epoch: 2 loss: 0.0597574 f1: 0.7534884: 72%|███████▏ | 1079/1497 [09:58<03:33, 1.95it/s]epoch: 2 loss: 0.0092003 f1: 0.7534884: 72%|███████▏ | 1079/1497 [09:58<03:33, 1.95it/s]epoch: 2 loss: 0.0092003 f1: 0.7534884: 72%|███████▏ | 1080/1497 [09:58<03:33, 1.95it/s]epoch: 2 loss: 0.0189401 f1: 0.7534884: 72%|███████▏ | 1080/1497 [09:59<03:33, 1.95it/s]epoch: 2 loss: 0.0189401 f1: 0.7534884: 72%|███████▏ | 1081/1497 [09:59<03:33, 1.95it/s]epoch: 2 loss: 0.0221404 f1: 0.7534884: 72%|███████▏ | 1081/1497 [09:59<03:33, 1.95it/s]epoch: 2 loss: 0.0221404 f1: 0.7534884: 72%|███████▏ | 1082/1497 [09:59<03:32, 1.96it/s]epoch: 2 loss: 0.1205868 f1: 0.7534884: 72%|███████▏ | 1082/1497 [10:00<03:32, 1.96it/s]epoch: 2 loss: 0.1205868 f1: 0.7534884: 72%|███████▏ | 1083/1497 [10:00<03:31, 1.96it/s]epoch: 2 loss: 0.0620931 f1: 0.7534884: 72%|███████▏ | 1083/1497 [10:00<03:31, 1.96it/s]epoch: 2 loss: 0.0620931 f1: 0.7534884: 72%|███████▏ | 1084/1497 [10:00<03:29, 1.97it/s]epoch: 2 loss: 0.0166054 f1: 0.7534884: 72%|███████▏ | 1084/1497 [10:01<03:29, 1.97it/s]epoch: 2 loss: 0.0166054 f1: 0.7534884: 72%|███████▏ | 1085/1497 [10:01<03:28, 1.98it/s]epoch: 2 loss: 0.1152172 f1: 0.7534884: 72%|███████▏ | 1085/1497 [10:01<03:28, 1.98it/s]epoch: 2 loss: 0.1152172 f1: 0.7534884: 73%|███████▎ | 1086/1497 [10:01<03:27, 1.99it/s]epoch: 2 loss: 0.0135624 f1: 0.7534884: 73%|███████▎ | 1086/1497 [10:02<03:27, 1.99it/s]epoch: 2 loss: 0.0135624 f1: 0.7534884: 73%|███████▎ | 1087/1497 [10:02<03:25, 1.99it/s]epoch: 2 loss: 0.0095066 f1: 0.7534884: 73%|███████▎ | 1087/1497 [10:02<03:25, 1.99it/s]epoch: 2 loss: 0.0095066 f1: 0.7534884: 73%|███████▎ | 1088/1497 [10:02<03:25, 1.99it/s]epoch: 2 loss: 0.0238297 f1: 0.7534884: 73%|███████▎ | 1088/1497 [10:03<03:25, 1.99it/s]epoch: 2 loss: 0.0238297 f1: 0.7534884: 73%|███████▎ | 1089/1497 [10:03<03:24, 1.99it/s]epoch: 2 loss: 0.0298869 f1: 0.7534884: 73%|███████▎ | 1089/1497 [10:03<03:24, 1.99it/s]epoch: 2 loss: 0.0298869 f1: 0.7534884: 73%|███████▎ | 1090/1497 [10:03<03:25, 1.98it/s]epoch: 2 loss: 0.0379610 f1: 0.7534884: 73%|███████▎ | 1090/1497 [10:04<03:25, 1.98it/s]epoch: 2 loss: 0.0379610 f1: 0.7534884: 73%|███████▎ | 1091/1497 [10:04<03:25, 1.97it/s]epoch: 2 loss: 0.0134412 f1: 0.7534884: 73%|███████▎ | 1091/1497 [10:05<03:25, 1.97it/s]epoch: 2 loss: 0.0134412 f1: 0.7534884: 73%|███████▎ | 1092/1497 [10:05<03:29, 1.93it/s]epoch: 2 loss: 0.0490644 f1: 0.7534884: 73%|███████▎ | 1092/1497 [10:05<03:29, 1.93it/s]epoch: 2 loss: 0.0490644 f1: 0.7534884: 73%|███████▎ | 1093/1497 [10:05<03:27, 1.94it/s]epoch: 2 loss: 0.0103490 f1: 0.7534884: 73%|███████▎ | 1093/1497 [10:06<03:27, 1.94it/s]epoch: 2 loss: 0.0103490 f1: 0.7534884: 73%|███████▎ | 1094/1497 [10:06<03:25, 1.96it/s]epoch: 2 loss: 0.0179126 f1: 0.7534884: 73%|███████▎ | 1094/1497 [10:06<03:25, 1.96it/s]epoch: 2 loss: 0.0179126 f1: 0.7534884: 73%|███████▎ | 1095/1497 [10:06<03:24, 1.97it/s]epoch: 2 loss: 0.0301734 f1: 0.7534884: 73%|███████▎ | 1095/1497 [10:07<03:24, 1.97it/s]epoch: 2 loss: 0.0301734 f1: 0.7534884: 73%|███████▎ | 1096/1497 [10:07<03:23, 1.97it/s]epoch: 2 loss: 0.0305228 f1: 0.7534884: 73%|███████▎ | 1096/1497 [10:07<03:23, 1.97it/s]epoch: 2 loss: 0.0305228 f1: 0.7534884: 73%|███████▎ | 1097/1497 [10:07<03:22, 1.97it/s]epoch: 2 loss: 0.0042801 f1: 0.7534884: 73%|███████▎ | 1097/1497 [10:08<03:22, 1.97it/s]epoch: 2 loss: 0.0042801 f1: 0.7534884: 73%|███████▎ | 1098/1497 [10:08<03:21, 1.98it/s]epoch: 2 loss: 0.0474587 f1: 0.7534884: 73%|███████▎ | 1098/1497 [10:08<03:21, 1.98it/s]epoch: 2 loss: 0.0474587 f1: 0.7534884: 73%|███████▎ | 1099/1497 [10:08<03:21, 1.97it/s]epoch: 2 loss: 0.1419346 f1: 0.7534884: 73%|███████▎ | 1099/1497 [10:09<03:21, 1.97it/s]epoch: 2 loss: 0.1419346 f1: 0.7534884: 73%|███████▎ | 1100/1497 [10:09<03:20, 1.98it/s]epoch: 2 loss: 0.1025040 f1: 0.7534884: 73%|███████▎ | 1100/1497 [10:09<03:20, 1.98it/s]epoch: 2 loss: 0.1025040 f1: 0.7534884: 74%|███████▎ | 1101/1497 [10:09<03:18, 1.99it/s]epoch: 2 loss: 0.0241082 f1: 0.7534884: 74%|███████▎ | 1101/1497 [10:10<03:18, 1.99it/s]epoch: 2 loss: 0.0241082 f1: 0.7534884: 74%|███████▎ | 1102/1497 [10:10<03:17, 2.00it/s]epoch: 2 loss: 0.0068784 f1: 0.7534884: 74%|███████▎ | 1102/1497 [10:10<03:17, 2.00it/s]epoch: 2 loss: 0.0068784 f1: 0.7534884: 74%|███████▎ | 1103/1497 [10:10<03:15, 2.01it/s]epoch: 2 loss: 0.0414637 f1: 0.7534884: 74%|███████▎ | 1103/1497 [10:11<03:15, 2.01it/s]epoch: 2 loss: 0.0414637 f1: 0.7534884: 74%|███████▎ | 1104/1497 [10:11<03:14, 2.02it/s]epoch: 2 loss: 0.0563039 f1: 0.7534884: 74%|███████▎ | 1104/1497 [10:11<03:14, 2.02it/s]epoch: 2 loss: 0.0563039 f1: 0.7534884: 74%|███████▍ | 1105/1497 [10:11<03:13, 2.03it/s]epoch: 2 loss: 0.1149855 f1: 0.7534884: 74%|███████▍ | 1105/1497 [10:11<03:13, 2.03it/s]epoch: 2 loss: 0.1149855 f1: 0.7534884: 74%|███████▍ | 1106/1497 [10:11<03:11, 2.05it/s]epoch: 2 loss: 0.0308681 f1: 0.7534884: 74%|███████▍ | 1106/1497 [10:12<03:11, 2.05it/s]epoch: 2 loss: 0.0308681 f1: 0.7534884: 74%|███████▍ | 1107/1497 [10:12<03:09, 2.06it/s]epoch: 2 loss: 0.0094040 f1: 0.7534884: 74%|███████▍ | 1107/1497 [10:12<03:09, 2.06it/s]epoch: 2 loss: 0.0094040 f1: 0.7534884: 74%|███████▍ | 1108/1497 [10:12<03:08, 2.06it/s]epoch: 2 loss: 0.0110445 f1: 0.7534884: 74%|███████▍ | 1108/1497 [10:13<03:08, 2.06it/s]epoch: 2 loss: 0.0110445 f1: 0.7534884: 74%|███████▍ | 1109/1497 [10:13<03:08, 2.06it/s]epoch: 2 loss: 0.0379794 f1: 0.7534884: 74%|███████▍ | 1109/1497 [10:13<03:08, 2.06it/s]epoch: 2 loss: 0.0379794 f1: 0.7534884: 74%|███████▍ | 1110/1497 [10:13<03:07, 2.06it/s]epoch: 2 loss: 0.0109009 f1: 0.7534884: 74%|███████▍ | 1110/1497 [10:14<03:07, 2.06it/s]epoch: 2 loss: 0.0109009 f1: 0.7534884: 74%|███████▍ | 1111/1497 [10:14<03:08, 2.05it/s]epoch: 2 loss: 0.0920164 f1: 0.7534884: 74%|███████▍ | 1111/1497 [10:14<03:08, 2.05it/s]epoch: 2 loss: 0.0920164 f1: 0.7534884: 74%|███████▍ | 1112/1497 [10:14<03:08, 2.04it/s]epoch: 2 loss: 0.0590580 f1: 0.7534884: 74%|███████▍ | 1112/1497 [10:15<03:08, 2.04it/s]epoch: 2 loss: 0.0590580 f1: 0.7534884: 74%|███████▍ | 1113/1497 [10:15<03:07, 2.04it/s]epoch: 2 loss: 0.0226469 f1: 0.7534884: 74%|███████▍ | 1113/1497 [10:15<03:07, 2.04it/s]epoch: 2 loss: 0.0226469 f1: 0.7534884: 74%|███████▍ | 1114/1497 [10:15<03:08, 2.03it/s]epoch: 2 loss: 0.0280457 f1: 0.7534884: 74%|███████▍ | 1114/1497 [10:16<03:08, 2.03it/s]epoch: 2 loss: 0.0280457 f1: 0.7534884: 74%|███████▍ | 1115/1497 [10:16<03:07, 2.04it/s]epoch: 2 loss: 0.0607655 f1: 0.7534884: 74%|███████▍ | 1115/1497 [10:16<03:07, 2.04it/s]epoch: 2 loss: 0.0607655 f1: 0.7534884: 75%|███████▍ | 1116/1497 [10:16<03:07, 2.04it/s]epoch: 2 loss: 0.1421725 f1: 0.7534884: 75%|███████▍ | 1116/1497 [10:17<03:07, 2.04it/s]epoch: 2 loss: 0.1421725 f1: 0.7534884: 75%|███████▍ | 1117/1497 [10:17<03:07, 2.03it/s]epoch: 2 loss: 0.1789155 f1: 0.7534884: 75%|███████▍ | 1117/1497 [10:17<03:07, 2.03it/s]epoch: 2 loss: 0.1789155 f1: 0.7534884: 75%|███████▍ | 1118/1497 [10:17<03:08, 2.02it/s]epoch: 2 loss: 0.0167171 f1: 0.7534884: 75%|███████▍ | 1118/1497 [10:18<03:08, 2.02it/s]epoch: 2 loss: 0.0167171 f1: 0.7534884: 75%|███████▍ | 1119/1497 [10:18<03:08, 2.01it/s]epoch: 2 loss: 0.0028308 f1: 0.7534884: 75%|███████▍ | 1119/1497 [10:18<03:08, 2.01it/s]epoch: 2 loss: 0.0028308 f1: 0.7534884: 75%|███████▍ | 1120/1497 [10:18<03:09, 1.99it/s]epoch: 2 loss: 0.0632180 f1: 0.7534884: 75%|███████▍ | 1120/1497 [10:19<03:09, 1.99it/s]epoch: 2 loss: 0.0632180 f1: 0.7534884: 75%|███████▍ | 1121/1497 [10:19<03:09, 1.98it/s]epoch: 2 loss: 0.0468635 f1: 0.7534884: 75%|███████▍ | 1121/1497 [10:19<03:09, 1.98it/s]epoch: 2 loss: 0.0468635 f1: 0.7534884: 75%|███████▍ | 1122/1497 [10:19<03:10, 1.97it/s]epoch: 2 loss: 0.0555965 f1: 0.7534884: 75%|███████▍ | 1122/1497 [10:20<03:10, 1.97it/s]epoch: 2 loss: 0.0555965 f1: 0.7534884: 75%|███████▌ | 1123/1497 [10:20<03:09, 1.98it/s]epoch: 2 loss: 0.0272552 f1: 0.7534884: 75%|███████▌ | 1123/1497 [10:20<03:09, 1.98it/s]epoch: 2 loss: 0.0272552 f1: 0.7534884: 75%|███████▌ | 1124/1497 [10:20<03:09, 1.97it/s]epoch: 2 loss: 0.0065576 f1: 0.7534884: 75%|███████▌ | 1124/1497 [10:21<03:09, 1.97it/s]epoch: 2 loss: 0.0065576 f1: 0.7534884: 75%|███████▌ | 1125/1497 [10:21<03:08, 1.97it/s]epoch: 2 loss: 0.0593253 f1: 0.7534884: 75%|███████▌ | 1125/1497 [10:21<03:08, 1.97it/s]epoch: 2 loss: 0.0593253 f1: 0.7534884: 75%|███████▌ | 1126/1497 [10:21<03:09, 1.96it/s]epoch: 2 loss: 0.0044550 f1: 0.7534884: 75%|███████▌ | 1126/1497 [10:22<03:09, 1.96it/s]epoch: 2 loss: 0.0044550 f1: 0.7534884: 75%|███████▌ | 1127/1497 [10:22<03:08, 1.96it/s]epoch: 2 loss: 0.0150429 f1: 0.7534884: 75%|███████▌ | 1127/1497 [10:22<03:08, 1.96it/s]epoch: 2 loss: 0.0150429 f1: 0.7534884: 75%|███████▌ | 1128/1497 [10:22<03:08, 1.96it/s]epoch: 2 loss: 0.0611229 f1: 0.7534884: 75%|███████▌ | 1128/1497 [10:23<03:08, 1.96it/s]epoch: 2 loss: 0.0611229 f1: 0.7534884: 75%|███████▌ | 1129/1497 [10:23<03:07, 1.97it/s]epoch: 2 loss: 0.0111162 f1: 0.7534884: 75%|███████▌ | 1129/1497 [10:24<03:07, 1.97it/s]epoch: 2 loss: 0.0111162 f1: 0.7534884: 75%|███████▌ | 1130/1497 [10:24<03:06, 1.96it/s]epoch: 2 loss: 0.0378722 f1: 0.7534884: 75%|███████▌ | 1130/1497 [10:24<03:06, 1.96it/s]epoch: 2 loss: 0.0378722 f1: 0.7534884: 76%|███████▌ | 1131/1497 [10:24<03:06, 1.96it/s]epoch: 2 loss: 0.0683741 f1: 0.7534884: 76%|███████▌ | 1131/1497 [10:25<03:06, 1.96it/s]epoch: 2 loss: 0.0683741 f1: 0.7534884: 76%|███████▌ | 1132/1497 [10:25<03:07, 1.95it/s]epoch: 2 loss: 0.0574706 f1: 0.7534884: 76%|███████▌ | 1132/1497 [10:25<03:07, 1.95it/s]epoch: 2 loss: 0.0574706 f1: 0.7534884: 76%|███████▌ | 1133/1497 [10:25<03:10, 1.91it/s]epoch: 2 loss: 0.0036797 f1: 0.7534884: 76%|███████▌ | 1133/1497 [10:26<03:10, 1.91it/s]epoch: 2 loss: 0.0036797 f1: 0.7534884: 76%|███████▌ | 1134/1497 [10:26<03:08, 1.92it/s]epoch: 2 loss: 0.0575976 f1: 0.7534884: 76%|███████▌ | 1134/1497 [10:26<03:08, 1.92it/s]epoch: 2 loss: 0.0575976 f1: 0.7534884: 76%|███████▌ | 1135/1497 [10:26<03:06, 1.94it/s]epoch: 2 loss: 0.0191036 f1: 0.7534884: 76%|███████▌ | 1135/1497 [10:27<03:06, 1.94it/s]epoch: 2 loss: 0.0191036 f1: 0.7534884: 76%|███████▌ | 1136/1497 [10:27<03:05, 1.94it/s]epoch: 2 loss: 0.0266958 f1: 0.7534884: 76%|███████▌ | 1136/1497 [10:27<03:05, 1.94it/s]epoch: 2 loss: 0.0266958 f1: 0.7534884: 76%|███████▌ | 1137/1497 [10:27<03:04, 1.95it/s]epoch: 2 loss: 0.0135337 f1: 0.7534884: 76%|███████▌ | 1137/1497 [10:28<03:04, 1.95it/s]epoch: 2 loss: 0.0135337 f1: 0.7534884: 76%|███████▌ | 1138/1497 [10:28<03:04, 1.95it/s]epoch: 2 loss: 0.0946704 f1: 0.7534884: 76%|███████▌ | 1138/1497 [10:28<03:04, 1.95it/s]epoch: 2 loss: 0.0946704 f1: 0.7534884: 76%|███████▌ | 1139/1497 [10:28<03:04, 1.94it/s]epoch: 2 loss: 0.0303324 f1: 0.7534884: 76%|███████▌ | 1139/1497 [10:29<03:04, 1.94it/s]epoch: 2 loss: 0.0303324 f1: 0.7534884: 76%|███████▌ | 1140/1497 [10:29<03:03, 1.95it/s]epoch: 2 loss: 0.0205169 f1: 0.7534884: 76%|███████▌ | 1140/1497 [10:29<03:03, 1.95it/s]epoch: 2 loss: 0.0205169 f1: 0.7534884: 76%|███████▌ | 1141/1497 [10:29<03:02, 1.95it/s]epoch: 2 loss: 0.0181149 f1: 0.7534884: 76%|███████▌ | 1141/1497 [10:30<03:02, 1.95it/s]epoch: 2 loss: 0.0181149 f1: 0.7534884: 76%|███████▋ | 1142/1497 [10:30<03:02, 1.95it/s]epoch: 2 loss: 0.0063272 f1: 0.7534884: 76%|███████▋ | 1142/1497 [10:30<03:02, 1.95it/s]epoch: 2 loss: 0.0063272 f1: 0.7534884: 76%|███████▋ | 1143/1497 [10:30<03:01, 1.95it/s]epoch: 2 loss: 0.0149962 f1: 0.7534884: 76%|███████▋ | 1143/1497 [10:31<03:01, 1.95it/s]epoch: 2 loss: 0.0149962 f1: 0.7534884: 76%|███████▋ | 1144/1497 [10:31<02:59, 1.96it/s]epoch: 2 loss: 0.0418679 f1: 0.7534884: 76%|███████▋ | 1144/1497 [10:31<02:59, 1.96it/s]epoch: 2 loss: 0.0418679 f1: 0.7534884: 76%|███████▋ | 1145/1497 [10:31<02:58, 1.97it/s]epoch: 2 loss: 0.0138218 f1: 0.7534884: 76%|███████▋ | 1145/1497 [10:32<02:58, 1.97it/s]epoch: 2 loss: 0.0138218 f1: 0.7534884: 77%|███████▋ | 1146/1497 [10:32<02:58, 1.97it/s]epoch: 2 loss: 0.0029506 f1: 0.7534884: 77%|███████▋ | 1146/1497 [10:32<02:58, 1.97it/s]epoch: 2 loss: 0.0029506 f1: 0.7534884: 77%|███████▋ | 1147/1497 [10:32<02:58, 1.96it/s]epoch: 2 loss: 0.0456489 f1: 0.7534884: 77%|███████▋ | 1147/1497 [10:33<02:58, 1.96it/s]epoch: 2 loss: 0.0456489 f1: 0.7534884: 77%|███████▋ | 1148/1497 [10:33<02:56, 1.98it/s]epoch: 2 loss: 0.0234213 f1: 0.7534884: 77%|███████▋ | 1148/1497 [10:33<02:56, 1.98it/s]epoch: 2 loss: 0.0234213 f1: 0.7534884: 77%|███████▋ | 1149/1497 [10:33<02:55, 1.98it/s]epoch: 2 loss: 0.0141909 f1: 0.7534884: 77%|███████▋ | 1149/1497 [10:34<02:55, 1.98it/s]epoch: 2 loss: 0.0141909 f1: 0.7534884: 77%|███████▋ | 1150/1497 [10:34<02:55, 1.97it/s]epoch: 2 loss: 0.0303999 f1: 0.7534884: 77%|███████▋ | 1150/1497 [10:34<02:55, 1.97it/s]epoch: 2 loss: 0.0303999 f1: 0.7534884: 77%|███████▋ | 1151/1497 [10:34<02:55, 1.97it/s]epoch: 2 loss: 0.0164176 f1: 0.7534884: 77%|███████▋ | 1151/1497 [10:35<02:55, 1.97it/s]epoch: 2 loss: 0.0164176 f1: 0.7534884: 77%|███████▋ | 1152/1497 [10:35<02:55, 1.97it/s]epoch: 2 loss: 0.1389565 f1: 0.7534884: 77%|███████▋ | 1152/1497 [10:35<02:55, 1.97it/s]epoch: 2 loss: 0.1389565 f1: 0.7534884: 77%|███████▋ | 1153/1497 [10:35<02:54, 1.98it/s]epoch: 2 loss: 0.0505366 f1: 0.7534884: 77%|███████▋ | 1153/1497 [10:36<02:54, 1.98it/s]epoch: 2 loss: 0.0505366 f1: 0.7534884: 77%|███████▋ | 1154/1497 [10:36<02:53, 1.98it/s]epoch: 2 loss: 0.1248598 f1: 0.7534884: 77%|███████▋ | 1154/1497 [10:36<02:53, 1.98it/s]epoch: 2 loss: 0.1248598 f1: 0.7534884: 77%|███████▋ | 1155/1497 [10:36<02:52, 1.98it/s]epoch: 2 loss: 0.0724297 f1: 0.7534884: 77%|███████▋ | 1155/1497 [10:37<02:52, 1.98it/s]epoch: 2 loss: 0.0724297 f1: 0.7534884: 77%|███████▋ | 1156/1497 [10:37<02:52, 1.98it/s]epoch: 2 loss: 0.0359647 f1: 0.7534884: 77%|███████▋ | 1156/1497 [10:37<02:52, 1.98it/s]epoch: 2 loss: 0.0359647 f1: 0.7534884: 77%|███████▋ | 1157/1497 [10:37<02:51, 1.98it/s]epoch: 2 loss: 0.0281102 f1: 0.7534884: 77%|███████▋ | 1157/1497 [10:38<02:51, 1.98it/s]epoch: 2 loss: 0.0281102 f1: 0.7534884: 77%|███████▋ | 1158/1497 [10:38<02:50, 1.99it/s]epoch: 2 loss: 0.0170195 f1: 0.7534884: 77%|███████▋ | 1158/1497 [10:38<02:50, 1.99it/s]epoch: 2 loss: 0.0170195 f1: 0.7534884: 77%|███████▋ | 1159/1497 [10:38<02:50, 1.98it/s]epoch: 2 loss: 0.0303610 f1: 0.7534884: 77%|███████▋ | 1159/1497 [10:39<02:50, 1.98it/s]epoch: 2 loss: 0.0303610 f1: 0.7534884: 77%|███████▋ | 1160/1497 [10:39<02:50, 1.98it/s]epoch: 2 loss: 0.0612472 f1: 0.7534884: 77%|███████▋ | 1160/1497 [10:39<02:50, 1.98it/s]epoch: 2 loss: 0.0612472 f1: 0.7534884: 78%|███████▊ | 1161/1497 [10:39<02:48, 2.00it/s]epoch: 2 loss: 0.0030926 f1: 0.7534884: 78%|███████▊ | 1161/1497 [10:40<02:48, 2.00it/s]epoch: 2 loss: 0.0030926 f1: 0.7534884: 78%|███████▊ | 1162/1497 [10:40<02:48, 1.98it/s]epoch: 2 loss: 0.0104398 f1: 0.7534884: 78%|███████▊ | 1162/1497 [10:40<02:48, 1.98it/s]epoch: 2 loss: 0.0104398 f1: 0.7534884: 78%|███████▊ | 1163/1497 [10:40<02:47, 2.00it/s]epoch: 2 loss: 0.0245045 f1: 0.7534884: 78%|███████▊ | 1163/1497 [10:41<02:47, 2.00it/s]epoch: 2 loss: 0.0245045 f1: 0.7534884: 78%|███████▊ | 1164/1497 [10:41<02:47, 1.99it/s]epoch: 2 loss: 0.0230896 f1: 0.7534884: 78%|███████▊ | 1164/1497 [10:41<02:47, 1.99it/s]epoch: 2 loss: 0.0230896 f1: 0.7534884: 78%|███████▊ | 1165/1497 [10:41<02:47, 1.99it/s]epoch: 2 loss: 0.0137035 f1: 0.7534884: 78%|███████▊ | 1165/1497 [10:42<02:47, 1.99it/s]epoch: 2 loss: 0.0137035 f1: 0.7534884: 78%|███████▊ | 1166/1497 [10:42<02:46, 1.98it/s]epoch: 2 loss: 0.0409989 f1: 0.7534884: 78%|███████▊ | 1166/1497 [10:42<02:46, 1.98it/s]epoch: 2 loss: 0.0409989 f1: 0.7534884: 78%|███████▊ | 1167/1497 [10:42<02:45, 1.99it/s]epoch: 2 loss: 0.0184860 f1: 0.7534884: 78%|███████▊ | 1167/1497 [10:43<02:45, 1.99it/s]epoch: 2 loss: 0.0184860 f1: 0.7534884: 78%|███████▊ | 1168/1497 [10:43<02:45, 1.98it/s]epoch: 2 loss: 0.1164132 f1: 0.7534884: 78%|███████▊ | 1168/1497 [10:43<02:45, 1.98it/s]epoch: 2 loss: 0.1164132 f1: 0.7534884: 78%|███████▊ | 1169/1497 [10:43<02:45, 1.98it/s]epoch: 2 loss: 0.0767827 f1: 0.7534884: 78%|███████▊ | 1169/1497 [10:44<02:45, 1.98it/s]epoch: 2 loss: 0.0767827 f1: 0.7534884: 78%|███████▊ | 1170/1497 [10:44<02:44, 1.99it/s]epoch: 2 loss: 0.0286695 f1: 0.7534884: 78%|███████▊ | 1170/1497 [10:44<02:44, 1.99it/s]epoch: 2 loss: 0.0286695 f1: 0.7534884: 78%|███████▊ | 1171/1497 [10:44<02:44, 1.98it/s]epoch: 2 loss: 0.0784217 f1: 0.7534884: 78%|███████▊ | 1171/1497 [10:45<02:44, 1.98it/s]epoch: 2 loss: 0.0784217 f1: 0.7534884: 78%|███████▊ | 1172/1497 [10:45<02:44, 1.98it/s]epoch: 2 loss: 0.0253692 f1: 0.7534884: 78%|███████▊ | 1172/1497 [10:45<02:44, 1.98it/s]epoch: 2 loss: 0.0253692 f1: 0.7534884: 78%|███████▊ | 1173/1497 [10:45<02:45, 1.96it/s]epoch: 2 loss: 0.2500013 f1: 0.7534884: 78%|███████▊ | 1173/1497 [10:46<02:45, 1.96it/s]epoch: 2 loss: 0.2500013 f1: 0.7534884: 78%|███████▊ | 1174/1497 [10:46<02:45, 1.95it/s]epoch: 2 loss: 0.0360982 f1: 0.7534884: 78%|███████▊ | 1174/1497 [10:46<02:45, 1.95it/s]epoch: 2 loss: 0.0360982 f1: 0.7534884: 78%|███████▊ | 1175/1497 [10:46<02:45, 1.95it/s]epoch: 2 loss: 0.1131364 f1: 0.7534884: 78%|███████▊ | 1175/1497 [10:47<02:45, 1.95it/s]epoch: 2 loss: 0.1131364 f1: 0.7534884: 79%|███████▊ | 1176/1497 [10:47<02:43, 1.96it/s]epoch: 2 loss: 0.0098800 f1: 0.7534884: 79%|███████▊ | 1176/1497 [10:47<02:43, 1.96it/s]epoch: 2 loss: 0.0098800 f1: 0.7534884: 79%|███████▊ | 1177/1497 [10:47<02:43, 1.96it/s]epoch: 2 loss: 0.0913888 f1: 0.7534884: 79%|███████▊ | 1177/1497 [10:48<02:43, 1.96it/s]epoch: 2 loss: 0.0913888 f1: 0.7534884: 79%|███████▊ | 1178/1497 [10:48<02:42, 1.97it/s]epoch: 2 loss: 0.0387895 f1: 0.7534884: 79%|███████▊ | 1178/1497 [10:48<02:42, 1.97it/s]epoch: 2 loss: 0.0387895 f1: 0.7534884: 79%|███████▉ | 1179/1497 [10:48<02:41, 1.97it/s]epoch: 2 loss: 0.0243455 f1: 0.7534884: 79%|███████▉ | 1179/1497 [10:49<02:41, 1.97it/s]epoch: 2 loss: 0.0243455 f1: 0.7534884: 79%|███████▉ | 1180/1497 [10:49<02:41, 1.97it/s]epoch: 2 loss: 0.0146143 f1: 0.7534884: 79%|███████▉ | 1180/1497 [10:49<02:41, 1.97it/s]epoch: 2 loss: 0.0146143 f1: 0.7534884: 79%|███████▉ | 1181/1497 [10:49<02:40, 1.97it/s]epoch: 2 loss: 0.0320289 f1: 0.7534884: 79%|███████▉ | 1181/1497 [10:50<02:40, 1.97it/s]epoch: 2 loss: 0.0320289 f1: 0.7534884: 79%|███████▉ | 1182/1497 [10:50<02:39, 1.97it/s]epoch: 2 loss: 0.0221362 f1: 0.7534884: 79%|███████▉ | 1182/1497 [10:50<02:39, 1.97it/s]epoch: 2 loss: 0.0221362 f1: 0.7534884: 79%|███████▉ | 1183/1497 [10:50<02:38, 1.98it/s]epoch: 2 loss: 0.1160370 f1: 0.7534884: 79%|███████▉ | 1183/1497 [10:51<02:38, 1.98it/s]epoch: 2 loss: 0.1160370 f1: 0.7534884: 79%|███████▉ | 1184/1497 [10:51<02:35, 2.01it/s]epoch: 2 loss: 0.0272077 f1: 0.7534884: 79%|███████▉ | 1184/1497 [10:51<02:35, 2.01it/s]epoch: 2 loss: 0.0272077 f1: 0.7534884: 79%|███████▉ | 1185/1497 [10:51<02:34, 2.02it/s]epoch: 2 loss: 0.1144423 f1: 0.7534884: 79%|███████▉ | 1185/1497 [10:52<02:34, 2.02it/s]epoch: 2 loss: 0.1144423 f1: 0.7534884: 79%|███████▉ | 1186/1497 [10:52<02:34, 2.02it/s]epoch: 2 loss: 0.0035138 f1: 0.7534884: 79%|███████▉ | 1186/1497 [10:52<02:34, 2.02it/s]epoch: 2 loss: 0.0035138 f1: 0.7534884: 79%|███████▉ | 1187/1497 [10:52<02:34, 2.01it/s]epoch: 2 loss: 0.0100733 f1: 0.7534884: 79%|███████▉ | 1187/1497 [10:53<02:34, 2.01it/s]epoch: 2 loss: 0.0100733 f1: 0.7534884: 79%|███████▉ | 1188/1497 [10:53<02:34, 2.00it/s]epoch: 2 loss: 0.0260010 f1: 0.7534884: 79%|███████▉ | 1188/1497 [10:53<02:34, 2.00it/s]epoch: 2 loss: 0.0260010 f1: 0.7534884: 79%|███████▉ | 1189/1497 [10:53<02:34, 1.99it/s]epoch: 2 loss: 0.0653102 f1: 0.7534884: 79%|███████▉ | 1189/1497 [10:54<02:34, 1.99it/s]epoch: 2 loss: 0.0653102 f1: 0.7534884: 79%|███████▉ | 1190/1497 [10:54<02:34, 1.99it/s]epoch: 2 loss: 0.1123746 f1: 0.7534884: 79%|███████▉ | 1190/1497 [10:54<02:34, 1.99it/s]epoch: 2 loss: 0.1123746 f1: 0.7534884: 80%|███████▉ | 1191/1497 [10:54<02:34, 1.98it/s]epoch: 2 loss: 0.0638909 f1: 0.7534884: 80%|███████▉ | 1191/1497 [10:55<02:34, 1.98it/s]epoch: 2 loss: 0.0638909 f1: 0.7534884: 80%|███████▉ | 1192/1497 [10:55<02:36, 1.95it/s]epoch: 2 loss: 0.0493792 f1: 0.7534884: 80%|███████▉ | 1192/1497 [10:55<02:36, 1.95it/s]epoch: 2 loss: 0.0493792 f1: 0.7534884: 80%|███████▉ | 1193/1497 [10:55<02:36, 1.94it/s]epoch: 2 loss: 0.3368614 f1: 0.7534884: 80%|███████▉ | 1193/1497 [10:56<02:36, 1.94it/s]epoch: 2 loss: 0.3368614 f1: 0.7534884: 80%|███████▉ | 1194/1497 [10:56<02:36, 1.94it/s]epoch: 2 loss: 0.0036864 f1: 0.7534884: 80%|███████▉ | 1194/1497 [10:57<02:36, 1.94it/s]epoch: 2 loss: 0.0036864 f1: 0.7534884: 80%|███████▉ | 1195/1497 [10:57<02:35, 1.95it/s]epoch: 2 loss: 0.0160526 f1: 0.7534884: 80%|███████▉ | 1195/1497 [10:57<02:35, 1.95it/s]epoch: 2 loss: 0.0160526 f1: 0.7534884: 80%|███████▉ | 1196/1497 [10:57<02:34, 1.95it/s]epoch: 2 loss: 0.0108830 f1: 0.7534884: 80%|███████▉ | 1196/1497 [10:58<02:34, 1.95it/s]epoch: 2 loss: 0.0108830 f1: 0.7534884: 80%|███████▉ | 1197/1497 [10:58<02:33, 1.95it/s]epoch: 2 loss: 0.0184482 f1: 0.7534884: 80%|███████▉ | 1197/1497 [10:58<02:33, 1.95it/s]epoch: 2 loss: 0.0184482 f1: 0.7534884: 80%|████████ | 1198/1497 [10:58<02:33, 1.95it/s]epoch: 2 loss: 0.0186377 f1: 0.7534884: 80%|████████ | 1198/1497 [10:59<02:33, 1.95it/s]epoch: 2 loss: 0.0186377 f1: 0.7534884: 80%|████████ | 1199/1497 [10:59<02:32, 1.95it/s]epoch: 2 loss: 0.0033986 f1: 0.7534884: 80%|████████ | 1199/1497 [10:59<02:32, 1.95it/s]epoch: 2 loss: 0.0033986 f1: 0.7534884: 80%|████████ | 1200/1497 [10:59<02:32, 1.94it/s]epoch: 2 loss: 0.0255281 f1: 0.7534884: 80%|████████ | 1200/1497 [11:00<02:32, 1.94it/s]epoch: 2 loss: 0.0255281 f1: 0.7534884: 80%|████████ | 1201/1497 [11:00<02:32, 1.94it/s]epoch: 2 loss: 0.0324815 f1: 0.7534884: 80%|████████ | 1201/1497 [11:00<02:32, 1.94it/s]epoch: 2 loss: 0.0324815 f1: 0.7534884: 80%|████████ | 1202/1497 [11:00<02:32, 1.94it/s]epoch: 2 loss: 0.0136888 f1: 0.7534884: 80%|████████ | 1202/1497 [11:01<02:32, 1.94it/s]epoch: 2 loss: 0.0136888 f1: 0.7534884: 80%|████████ | 1203/1497 [11:01<02:31, 1.95it/s]epoch: 2 loss: 0.0237447 f1: 0.7534884: 80%|████████ | 1203/1497 [11:01<02:31, 1.95it/s]epoch: 2 loss: 0.0237447 f1: 0.7534884: 80%|████████ | 1204/1497 [11:01<02:30, 1.95it/s]epoch: 2 loss: 0.0358624 f1: 0.7534884: 80%|████████ | 1204/1497 [11:02<02:30, 1.95it/s]epoch: 2 loss: 0.0358624 f1: 0.7534884: 80%|████████ | 1205/1497 [11:02<02:30, 1.94it/s]epoch: 2 loss: 0.0098115 f1: 0.7534884: 80%|████████ | 1205/1497 [11:02<02:30, 1.94it/s]epoch: 2 loss: 0.0098115 f1: 0.7534884: 81%|████████ | 1206/1497 [11:02<02:29, 1.95it/s]epoch: 2 loss: 0.0655041 f1: 0.7534884: 81%|████████ | 1206/1497 [11:03<02:29, 1.95it/s]epoch: 2 loss: 0.0655041 f1: 0.7534884: 81%|████████ | 1207/1497 [11:03<02:28, 1.95it/s]epoch: 2 loss: 0.0658078 f1: 0.7534884: 81%|████████ | 1207/1497 [11:03<02:28, 1.95it/s]epoch: 2 loss: 0.0658078 f1: 0.7534884: 81%|████████ | 1208/1497 [11:03<02:26, 1.97it/s]epoch: 2 loss: 0.0174894 f1: 0.7534884: 81%|████████ | 1208/1497 [11:04<02:26, 1.97it/s]epoch: 2 loss: 0.0174894 f1: 0.7534884: 81%|████████ | 1209/1497 [11:04<02:24, 1.99it/s]epoch: 2 loss: 0.0834070 f1: 0.7534884: 81%|████████ | 1209/1497 [11:04<02:24, 1.99it/s]epoch: 2 loss: 0.0834070 f1: 0.7534884: 81%|████████ | 1210/1497 [11:04<02:23, 1.99it/s]epoch: 2 loss: 0.0162443 f1: 0.7534884: 81%|████████ | 1210/1497 [11:05<02:23, 1.99it/s]epoch: 2 loss: 0.0162443 f1: 0.7534884: 81%|████████ | 1211/1497 [11:05<02:22, 2.00it/s]epoch: 2 loss: 0.0660471 f1: 0.7534884: 81%|████████ | 1211/1497 [11:05<02:22, 2.00it/s]epoch: 2 loss: 0.0660471 f1: 0.7534884: 81%|████████ | 1212/1497 [11:05<02:22, 2.00it/s]epoch: 2 loss: 0.0084409 f1: 0.7534884: 81%|████████ | 1212/1497 [11:06<02:22, 2.00it/s]epoch: 2 loss: 0.0084409 f1: 0.7534884: 81%|████████ | 1213/1497 [11:06<02:22, 2.00it/s]epoch: 2 loss: 0.1396146 f1: 0.7534884: 81%|████████ | 1213/1497 [11:06<02:22, 2.00it/s]epoch: 2 loss: 0.1396146 f1: 0.7534884: 81%|████████ | 1214/1497 [11:06<02:24, 1.95it/s]epoch: 2 loss: 0.0320664 f1: 0.7534884: 81%|████████ | 1214/1497 [11:07<02:24, 1.95it/s]epoch: 2 loss: 0.0320664 f1: 0.7534884: 81%|████████ | 1215/1497 [11:07<02:23, 1.96it/s]epoch: 2 loss: 0.1107412 f1: 0.7534884: 81%|████████ | 1215/1497 [11:07<02:23, 1.96it/s]epoch: 2 loss: 0.1107412 f1: 0.7534884: 81%|████████ | 1216/1497 [11:07<02:22, 1.97it/s]epoch: 2 loss: 0.0155558 f1: 0.7534884: 81%|████████ | 1216/1497 [11:08<02:22, 1.97it/s]epoch: 2 loss: 0.0155558 f1: 0.7534884: 81%|████████▏ | 1217/1497 [11:08<02:21, 1.98it/s]epoch: 2 loss: 0.0453105 f1: 0.7534884: 81%|████████▏ | 1217/1497 [11:08<02:21, 1.98it/s]epoch: 2 loss: 0.0453105 f1: 0.7534884: 81%|████████▏ | 1218/1497 [11:08<02:20, 1.98it/s]epoch: 2 loss: 0.0150995 f1: 0.7534884: 81%|████████▏ | 1218/1497 [11:09<02:20, 1.98it/s]epoch: 2 loss: 0.0150995 f1: 0.7534884: 81%|████████▏ | 1219/1497 [11:09<02:20, 1.98it/s]epoch: 2 loss: 0.0314730 f1: 0.7534884: 81%|████████▏ | 1219/1497 [11:09<02:20, 1.98it/s]epoch: 2 loss: 0.0314730 f1: 0.7534884: 81%|████████▏ | 1220/1497 [11:09<02:18, 1.99it/s]epoch: 2 loss: 0.0304422 f1: 0.7534884: 81%|████████▏ | 1220/1497 [11:10<02:18, 1.99it/s]epoch: 2 loss: 0.0304422 f1: 0.7534884: 82%|████████▏ | 1221/1497 [11:10<02:17, 2.01it/s]epoch: 2 loss: 0.0257934 f1: 0.7534884: 82%|████████▏ | 1221/1497 [11:10<02:17, 2.01it/s]epoch: 2 loss: 0.0257934 f1: 0.7534884: 82%|████████▏ | 1222/1497 [11:10<02:17, 2.00it/s]epoch: 2 loss: 0.0101562 f1: 0.7534884: 82%|████████▏ | 1222/1497 [11:11<02:17, 2.00it/s]epoch: 2 loss: 0.0101562 f1: 0.7534884: 82%|████████▏ | 1223/1497 [11:11<02:16, 2.01it/s]epoch: 2 loss: 0.0080202 f1: 0.7534884: 82%|████████▏ | 1223/1497 [11:11<02:16, 2.01it/s]epoch: 2 loss: 0.0080202 f1: 0.7534884: 82%|████████▏ | 1224/1497 [11:11<02:15, 2.02it/s]epoch: 2 loss: 0.0434022 f1: 0.7534884: 82%|████████▏ | 1224/1497 [11:12<02:15, 2.02it/s]epoch: 2 loss: 0.0434022 f1: 0.7534884: 82%|████████▏ | 1225/1497 [11:12<02:15, 2.01it/s]epoch: 2 loss: 0.0099115 f1: 0.7534884: 82%|████████▏ | 1225/1497 [11:12<02:15, 2.01it/s]epoch: 2 loss: 0.0099115 f1: 0.7534884: 82%|████████▏ | 1226/1497 [11:12<02:14, 2.01it/s]epoch: 2 loss: 0.0248981 f1: 0.7534884: 82%|████████▏ | 1226/1497 [11:13<02:14, 2.01it/s]epoch: 2 loss: 0.0248981 f1: 0.7534884: 82%|████████▏ | 1227/1497 [11:13<02:15, 2.00it/s]epoch: 2 loss: 0.0577389 f1: 0.7534884: 82%|████████▏ | 1227/1497 [11:13<02:15, 2.00it/s]epoch: 2 loss: 0.0577389 f1: 0.7534884: 82%|████████▏ | 1228/1497 [11:13<02:15, 1.98it/s]epoch: 2 loss: 0.0815874 f1: 0.7534884: 82%|████████▏ | 1228/1497 [11:14<02:15, 1.98it/s]epoch: 2 loss: 0.0815874 f1: 0.7534884: 82%|████████▏ | 1229/1497 [11:14<02:15, 1.98it/s]epoch: 2 loss: 0.0720370 f1: 0.7534884: 82%|████████▏ | 1229/1497 [11:14<02:15, 1.98it/s]epoch: 2 loss: 0.0720370 f1: 0.7534884: 82%|████████▏ | 1230/1497 [11:14<02:14, 1.99it/s]epoch: 2 loss: 0.1591252 f1: 0.7534884: 82%|████████▏ | 1230/1497 [11:15<02:14, 1.99it/s]epoch: 2 loss: 0.1591252 f1: 0.7534884: 82%|████████▏ | 1231/1497 [11:15<02:13, 1.99it/s]epoch: 2 loss: 0.1168299 f1: 0.7534884: 82%|████████▏ | 1231/1497 [11:15<02:13, 1.99it/s]epoch: 2 loss: 0.1168299 f1: 0.7534884: 82%|████████▏ | 1232/1497 [11:15<02:13, 1.98it/s]epoch: 2 loss: 0.0161805 f1: 0.7534884: 82%|████████▏ | 1232/1497 [11:16<02:13, 1.98it/s]epoch: 2 loss: 0.0161805 f1: 0.7534884: 82%|████████▏ | 1233/1497 [11:16<02:14, 1.97it/s]epoch: 2 loss: 0.0794459 f1: 0.7534884: 82%|████████▏ | 1233/1497 [11:16<02:14, 1.97it/s]epoch: 2 loss: 0.0794459 f1: 0.7534884: 82%|████████▏ | 1234/1497 [11:16<02:14, 1.96it/s]epoch: 2 loss: 0.0311683 f1: 0.7534884: 82%|████████▏ | 1234/1497 [11:17<02:14, 1.96it/s]epoch: 2 loss: 0.0311683 f1: 0.7534884: 82%|████████▏ | 1235/1497 [11:17<02:13, 1.96it/s]epoch: 2 loss: 0.0223401 f1: 0.7534884: 82%|████████▏ | 1235/1497 [11:17<02:13, 1.96it/s]epoch: 2 loss: 0.0223401 f1: 0.7534884: 83%|████████▎ | 1236/1497 [11:17<02:13, 1.96it/s]epoch: 2 loss: 0.0321761 f1: 0.7534884: 83%|████████▎ | 1236/1497 [11:18<02:13, 1.96it/s]epoch: 2 loss: 0.0321761 f1: 0.7534884: 83%|████████▎ | 1237/1497 [11:18<02:12, 1.96it/s]epoch: 2 loss: 0.1108131 f1: 0.7534884: 83%|████████▎ | 1237/1497 [11:18<02:12, 1.96it/s]epoch: 2 loss: 0.1108131 f1: 0.7534884: 83%|████████▎ | 1238/1497 [11:18<02:11, 1.97it/s]epoch: 2 loss: 0.0892392 f1: 0.7534884: 83%|████████▎ | 1238/1497 [11:19<02:11, 1.97it/s]epoch: 2 loss: 0.0892392 f1: 0.7534884: 83%|████████▎ | 1239/1497 [11:19<02:11, 1.97it/s]epoch: 2 loss: 0.0667490 f1: 0.7534884: 83%|████████▎ | 1239/1497 [11:19<02:11, 1.97it/s]epoch: 2 loss: 0.0667490 f1: 0.7534884: 83%|████████▎ | 1240/1497 [11:19<02:10, 1.97it/s]epoch: 2 loss: 0.0426746 f1: 0.7534884: 83%|████████▎ | 1240/1497 [11:20<02:10, 1.97it/s]epoch: 2 loss: 0.0426746 f1: 0.7534884: 83%|████████▎ | 1241/1497 [11:20<02:09, 1.97it/s]epoch: 2 loss: 0.2227785 f1: 0.7534884: 83%|████████▎ | 1241/1497 [11:20<02:09, 1.97it/s]epoch: 2 loss: 0.2227785 f1: 0.7534884: 83%|████████▎ | 1242/1497 [11:20<02:10, 1.96it/s]epoch: 2 loss: 0.0572099 f1: 0.7534884: 83%|████████▎ | 1242/1497 [11:21<02:10, 1.96it/s]epoch: 2 loss: 0.0572099 f1: 0.7534884: 83%|████████▎ | 1243/1497 [11:21<02:09, 1.97it/s]epoch: 2 loss: 0.0158231 f1: 0.7534884: 83%|████████▎ | 1243/1497 [11:21<02:09, 1.97it/s]epoch: 2 loss: 0.0158231 f1: 0.7534884: 83%|████████▎ | 1244/1497 [11:21<02:08, 1.97it/s]epoch: 2 loss: 0.0276767 f1: 0.7534884: 83%|████████▎ | 1244/1497 [11:22<02:08, 1.97it/s]epoch: 2 loss: 0.0276767 f1: 0.7534884: 83%|████████▎ | 1245/1497 [11:22<02:08, 1.96it/s]epoch: 2 loss: 0.0077493 f1: 0.7534884: 83%|████████▎ | 1245/1497 [11:22<02:08, 1.96it/s]epoch: 2 loss: 0.0077493 f1: 0.7534884: 83%|████████▎ | 1246/1497 [11:22<02:08, 1.95it/s]epoch: 2 loss: 0.0659272 f1: 0.7534884: 83%|████████▎ | 1246/1497 [11:23<02:08, 1.95it/s]epoch: 2 loss: 0.0659272 f1: 0.7534884: 83%|████████▎ | 1247/1497 [11:23<02:07, 1.96it/s]epoch: 2 loss: 0.0302929 f1: 0.7534884: 83%|████████▎ | 1247/1497 [11:23<02:07, 1.96it/s]epoch: 2 loss: 0.0302929 f1: 0.7534884: 83%|████████▎ | 1248/1497 [11:23<02:07, 1.96it/s]epoch: 2 loss: 0.0699513 f1: 0.7534884: 83%|████████▎ | 1248/1497 [11:24<02:07, 1.96it/s]epoch: 2 loss: 0.0699513 f1: 0.7534884: 83%|████████▎ | 1249/1497 [11:24<02:05, 1.97it/s]epoch: 2 loss: 0.0583774 f1: 0.7534884: 83%|████████▎ | 1249/1497 [11:24<02:05, 1.97it/s]epoch: 2 loss: 0.0583774 f1: 0.7534884: 84%|████████▎ | 1250/1497 [11:24<02:05, 1.97it/s]epoch: 2 loss: 0.1351787 f1: 0.7534884: 84%|████████▎ | 1250/1497 [11:25<02:05, 1.97it/s]epoch: 2 loss: 0.1351787 f1: 0.7534884: 84%|████████▎ | 1251/1497 [11:25<02:05, 1.97it/s]epoch: 2 loss: 0.1027923 f1: 0.7534884: 84%|████████▎ | 1251/1497 [11:25<02:05, 1.97it/s]epoch: 2 loss: 0.1027923 f1: 0.7534884: 84%|████████▎ | 1252/1497 [11:25<02:04, 1.97it/s]epoch: 2 loss: 0.1001146 f1: 0.7534884: 84%|████████▎ | 1252/1497 [11:26<02:04, 1.97it/s]epoch: 2 loss: 0.1001146 f1: 0.7534884: 84%|████████▎ | 1253/1497 [11:26<02:04, 1.97it/s]epoch: 2 loss: 0.0569644 f1: 0.7534884: 84%|████████▎ | 1253/1497 [11:26<02:04, 1.97it/s]epoch: 2 loss: 0.0569644 f1: 0.7534884: 84%|████████▍ | 1254/1497 [11:26<02:03, 1.97it/s]epoch: 2 loss: 0.0152554 f1: 0.7534884: 84%|████████▍ | 1254/1497 [11:27<02:03, 1.97it/s]epoch: 2 loss: 0.0152554 f1: 0.7534884: 84%|████████▍ | 1255/1497 [11:27<02:05, 1.92it/s]epoch: 2 loss: 0.1590813 f1: 0.7534884: 84%|████████▍ | 1255/1497 [11:27<02:05, 1.92it/s]epoch: 2 loss: 0.1590813 f1: 0.7534884: 84%|████████▍ | 1256/1497 [11:27<02:04, 1.94it/s]epoch: 2 loss: 0.1052238 f1: 0.7534884: 84%|████████▍ | 1256/1497 [11:28<02:04, 1.94it/s]epoch: 2 loss: 0.1052238 f1: 0.7534884: 84%|████████▍ | 1257/1497 [11:28<02:03, 1.95it/s]epoch: 2 loss: 0.0295312 f1: 0.7534884: 84%|████████▍ | 1257/1497 [11:28<02:03, 1.95it/s]epoch: 2 loss: 0.0295312 f1: 0.7534884: 84%|████████▍ | 1258/1497 [11:28<02:02, 1.95it/s]epoch: 2 loss: 0.0037559 f1: 0.7534884: 84%|████████▍ | 1258/1497 [11:29<02:02, 1.95it/s]epoch: 2 loss: 0.0037559 f1: 0.7534884: 84%|████████▍ | 1259/1497 [11:29<02:01, 1.96it/s]epoch: 2 loss: 0.0645773 f1: 0.7534884: 84%|████████▍ | 1259/1497 [11:30<02:01, 1.96it/s]epoch: 2 loss: 0.0645773 f1: 0.7534884: 84%|████████▍ | 1260/1497 [11:30<02:00, 1.96it/s]epoch: 2 loss: 0.2123787 f1: 0.7534884: 84%|████████▍ | 1260/1497 [11:30<02:00, 1.96it/s]epoch: 2 loss: 0.2123787 f1: 0.7534884: 84%|████████▍ | 1261/1497 [11:30<02:00, 1.97it/s]epoch: 2 loss: 0.0250967 f1: 0.7534884: 84%|████████▍ | 1261/1497 [11:31<02:00, 1.97it/s]epoch: 2 loss: 0.0250967 f1: 0.7534884: 84%|████████▍ | 1262/1497 [11:31<01:59, 1.96it/s]epoch: 2 loss: 0.1383885 f1: 0.7534884: 84%|████████▍ | 1262/1497 [11:31<01:59, 1.96it/s]epoch: 2 loss: 0.1383885 f1: 0.7534884: 84%|████████▍ | 1263/1497 [11:31<01:59, 1.96it/s]epoch: 2 loss: 0.0204152 f1: 0.7534884: 84%|████████▍ | 1263/1497 [11:32<01:59, 1.96it/s]epoch: 2 loss: 0.0204152 f1: 0.7534884: 84%|████████▍ | 1264/1497 [11:32<01:59, 1.96it/s]epoch: 2 loss: 0.0080053 f1: 0.7534884: 84%|████████▍ | 1264/1497 [11:32<01:59, 1.96it/s]epoch: 2 loss: 0.0080053 f1: 0.7534884: 85%|████████▍ | 1265/1497 [11:32<01:58, 1.96it/s]epoch: 2 loss: 0.0141333 f1: 0.7534884: 85%|████████▍ | 1265/1497 [11:33<01:58, 1.96it/s]epoch: 2 loss: 0.0141333 f1: 0.7534884: 85%|████████▍ | 1266/1497 [11:33<01:58, 1.96it/s]epoch: 2 loss: 0.0641509 f1: 0.7534884: 85%|████████▍ | 1266/1497 [11:33<01:58, 1.96it/s]epoch: 2 loss: 0.0641509 f1: 0.7534884: 85%|████████▍ | 1267/1497 [11:33<01:57, 1.96it/s]epoch: 2 loss: 0.0214351 f1: 0.7534884: 85%|████████▍ | 1267/1497 [11:34<01:57, 1.96it/s]epoch: 2 loss: 0.0214351 f1: 0.7534884: 85%|████████▍ | 1268/1497 [11:34<01:56, 1.96it/s]epoch: 2 loss: 0.0083630 f1: 0.7534884: 85%|████████▍ | 1268/1497 [11:34<01:56, 1.96it/s]epoch: 2 loss: 0.0083630 f1: 0.7534884: 85%|████████▍ | 1269/1497 [11:34<01:56, 1.97it/s]epoch: 2 loss: 0.0040333 f1: 0.7534884: 85%|████████▍ | 1269/1497 [11:35<01:56, 1.97it/s]epoch: 2 loss: 0.0040333 f1: 0.7534884: 85%|████████▍ | 1270/1497 [11:35<01:55, 1.97it/s]epoch: 2 loss: 0.0138681 f1: 0.7534884: 85%|████████▍ | 1270/1497 [11:35<01:55, 1.97it/s]epoch: 2 loss: 0.0138681 f1: 0.7534884: 85%|████████▍ | 1271/1497 [11:35<01:54, 1.98it/s]epoch: 2 loss: 0.0028504 f1: 0.7534884: 85%|████████▍ | 1271/1497 [11:36<01:54, 1.98it/s]epoch: 2 loss: 0.0028504 f1: 0.7534884: 85%|████████▍ | 1272/1497 [11:36<01:53, 1.98it/s]epoch: 2 loss: 0.0066893 f1: 0.7534884: 85%|████████▍ | 1272/1497 [11:36<01:53, 1.98it/s]epoch: 2 loss: 0.0066893 f1: 0.7534884: 85%|████████▌ | 1273/1497 [11:36<01:53, 1.97it/s]epoch: 2 loss: 0.0148753 f1: 0.7534884: 85%|████████▌ | 1273/1497 [11:37<01:53, 1.97it/s]epoch: 2 loss: 0.0148753 f1: 0.7534884: 85%|████████▌ | 1274/1497 [11:37<01:53, 1.97it/s]epoch: 2 loss: 0.0071555 f1: 0.7534884: 85%|████████▌ | 1274/1497 [11:37<01:53, 1.97it/s]epoch: 2 loss: 0.0071555 f1: 0.7534884: 85%|████████▌ | 1275/1497 [11:37<01:53, 1.96it/s]epoch: 2 loss: 0.0475193 f1: 0.7534884: 85%|████████▌ | 1275/1497 [11:38<01:53, 1.96it/s]epoch: 2 loss: 0.0475193 f1: 0.7534884: 85%|████████▌ | 1276/1497 [11:38<01:52, 1.97it/s]epoch: 2 loss: 0.0638888 f1: 0.7534884: 85%|████████▌ | 1276/1497 [11:38<01:52, 1.97it/s]epoch: 2 loss: 0.0638888 f1: 0.7534884: 85%|████████▌ | 1277/1497 [11:38<01:51, 1.98it/s]epoch: 2 loss: 0.0404732 f1: 0.7534884: 85%|████████▌ | 1277/1497 [11:39<01:51, 1.98it/s]epoch: 2 loss: 0.0404732 f1: 0.7534884: 85%|████████▌ | 1278/1497 [11:39<01:50, 1.99it/s]epoch: 2 loss: 0.0736104 f1: 0.7534884: 85%|████████▌ | 1278/1497 [11:39<01:50, 1.99it/s]epoch: 2 loss: 0.0736104 f1: 0.7534884: 85%|████████▌ | 1279/1497 [11:39<01:48, 2.00it/s]epoch: 2 loss: 0.0490354 f1: 0.7534884: 85%|████████▌ | 1279/1497 [11:40<01:48, 2.00it/s]epoch: 2 loss: 0.0490354 f1: 0.7534884: 86%|████████▌ | 1280/1497 [11:40<01:48, 2.01it/s]epoch: 2 loss: 0.0693171 f1: 0.7534884: 86%|████████▌ | 1280/1497 [11:40<01:48, 2.01it/s]epoch: 2 loss: 0.0693171 f1: 0.7534884: 86%|████████▌ | 1281/1497 [11:40<01:46, 2.02it/s]epoch: 2 loss: 0.0103109 f1: 0.7534884: 86%|████████▌ | 1281/1497 [11:41<01:46, 2.02it/s]epoch: 2 loss: 0.0103109 f1: 0.7534884: 86%|████████▌ | 1282/1497 [11:41<01:46, 2.01it/s]epoch: 2 loss: 0.0636699 f1: 0.7534884: 86%|████████▌ | 1282/1497 [11:41<01:46, 2.01it/s]epoch: 2 loss: 0.0636699 f1: 0.7534884: 86%|████████▌ | 1283/1497 [11:41<01:45, 2.02it/s]epoch: 2 loss: 0.0050702 f1: 0.7534884: 86%|████████▌ | 1283/1497 [11:42<01:45, 2.02it/s]epoch: 2 loss: 0.0050702 f1: 0.7534884: 86%|████████▌ | 1284/1497 [11:42<01:45, 2.02it/s]epoch: 2 loss: 0.0743762 f1: 0.7534884: 86%|████████▌ | 1284/1497 [11:42<01:45, 2.02it/s]epoch: 2 loss: 0.0743762 f1: 0.7534884: 86%|████████▌ | 1285/1497 [11:42<01:44, 2.02it/s]epoch: 2 loss: 0.1629817 f1: 0.7534884: 86%|████████▌ | 1285/1497 [11:43<01:44, 2.02it/s]epoch: 2 loss: 0.1629817 f1: 0.7534884: 86%|████████▌ | 1286/1497 [11:43<01:45, 2.01it/s]epoch: 2 loss: 0.0230423 f1: 0.7534884: 86%|████████▌ | 1286/1497 [11:43<01:45, 2.01it/s]epoch: 2 loss: 0.0230423 f1: 0.7534884: 86%|████████▌ | 1287/1497 [11:43<01:45, 1.99it/s]epoch: 2 loss: 0.0079680 f1: 0.7534884: 86%|████████▌ | 1287/1497 [11:44<01:45, 1.99it/s]epoch: 2 loss: 0.0079680 f1: 0.7534884: 86%|████████▌ | 1288/1497 [11:44<01:45, 1.98it/s]epoch: 2 loss: 0.0784445 f1: 0.7534884: 86%|████████▌ | 1288/1497 [11:44<01:45, 1.98it/s]epoch: 2 loss: 0.0784445 f1: 0.7534884: 86%|████████▌ | 1289/1497 [11:44<01:44, 1.99it/s]epoch: 2 loss: 0.0130782 f1: 0.7534884: 86%|████████▌ | 1289/1497 [11:45<01:44, 1.99it/s]epoch: 2 loss: 0.0130782 f1: 0.7534884: 86%|████████▌ | 1290/1497 [11:45<01:44, 1.98it/s]epoch: 2 loss: 0.0071584 f1: 0.7534884: 86%|████████▌ | 1290/1497 [11:45<01:44, 1.98it/s]epoch: 2 loss: 0.0071584 f1: 0.7534884: 86%|████████▌ | 1291/1497 [11:45<01:43, 1.99it/s]epoch: 2 loss: 0.1398070 f1: 0.7534884: 86%|████████▌ | 1291/1497 [11:46<01:43, 1.99it/s]epoch: 2 loss: 0.1398070 f1: 0.7534884: 86%|████████▋ | 1292/1497 [11:46<01:43, 1.97it/s]epoch: 2 loss: 0.1263616 f1: 0.7534884: 86%|████████▋ | 1292/1497 [11:46<01:43, 1.97it/s]epoch: 2 loss: 0.1263616 f1: 0.7534884: 86%|████████▋ | 1293/1497 [11:46<01:43, 1.96it/s]epoch: 2 loss: 0.1257958 f1: 0.7534884: 86%|████████▋ | 1293/1497 [11:47<01:43, 1.96it/s]epoch: 2 loss: 0.1257958 f1: 0.7534884: 86%|████████▋ | 1294/1497 [11:47<01:44, 1.95it/s]epoch: 2 loss: 0.0214074 f1: 0.7534884: 86%|████████▋ | 1294/1497 [11:47<01:44, 1.95it/s]epoch: 2 loss: 0.0214074 f1: 0.7534884: 87%|████████▋ | 1295/1497 [11:47<01:45, 1.92it/s]epoch: 2 loss: 0.0114854 f1: 0.7534884: 87%|████████▋ | 1295/1497 [11:48<01:45, 1.92it/s]epoch: 2 loss: 0.0114854 f1: 0.7534884: 87%|████████▋ | 1296/1497 [11:48<01:45, 1.90it/s]epoch: 2 loss: 0.0255157 f1: 0.7534884: 87%|████████▋ | 1296/1497 [11:48<01:45, 1.90it/s]epoch: 2 loss: 0.0255157 f1: 0.7534884: 87%|████████▋ | 1297/1497 [11:48<01:44, 1.92it/s]epoch: 2 loss: 0.0781200 f1: 0.7534884: 87%|████████▋ | 1297/1497 [11:49<01:44, 1.92it/s]epoch: 2 loss: 0.0781200 f1: 0.7534884: 87%|████████▋ | 1298/1497 [11:49<01:43, 1.93it/s]epoch: 2 loss: 0.0200658 f1: 0.7534884: 87%|████████▋ | 1298/1497 [11:49<01:43, 1.93it/s]epoch: 2 loss: 0.0200658 f1: 0.7534884: 87%|████████▋ | 1299/1497 [11:49<01:41, 1.94it/s]epoch: 2 loss: 0.0146047 f1: 0.7534884: 87%|████████▋ | 1299/1497 [11:50<01:41, 1.94it/s]epoch: 2 loss: 0.0146047 f1: 0.7534884: 87%|████████▋ | 1300/1497 [11:50<01:41, 1.95it/s]epoch: 2 loss: 0.1340806 f1: 0.7534884: 87%|████████▋ | 1300/1497 [11:50<01:41, 1.95it/s]epoch: 2 loss: 0.1340806 f1: 0.7534884: 87%|████████▋ | 1301/1497 [11:50<01:40, 1.96it/s]epoch: 2 loss: 0.0605787 f1: 0.7534884: 87%|████████▋ | 1301/1497 [11:51<01:40, 1.96it/s]epoch: 2 loss: 0.0605787 f1: 0.7534884: 87%|████████▋ | 1302/1497 [11:51<01:39, 1.95it/s]epoch: 2 loss: 0.0063815 f1: 0.7534884: 87%|████████▋ | 1302/1497 [11:51<01:39, 1.95it/s]epoch: 2 loss: 0.0063815 f1: 0.7534884: 87%|████████▋ | 1303/1497 [11:51<01:39, 1.95it/s]epoch: 2 loss: 0.0636539 f1: 0.7534884: 87%|████████▋ | 1303/1497 [11:52<01:39, 1.95it/s]epoch: 2 loss: 0.0636539 f1: 0.7534884: 87%|████████▋ | 1304/1497 [11:52<01:39, 1.94it/s]epoch: 2 loss: 0.0418624 f1: 0.7534884: 87%|████████▋ | 1304/1497 [11:52<01:39, 1.94it/s]epoch: 2 loss: 0.0418624 f1: 0.7534884: 87%|████████▋ | 1305/1497 [11:52<01:39, 1.93it/s]epoch: 2 loss: 0.1857270 f1: 0.7534884: 87%|████████▋ | 1305/1497 [11:53<01:39, 1.93it/s]epoch: 2 loss: 0.1857270 f1: 0.7534884: 87%|████████▋ | 1306/1497 [11:53<01:38, 1.93it/s]epoch: 2 loss: 0.1049277 f1: 0.7534884: 87%|████████▋ | 1306/1497 [11:53<01:38, 1.93it/s]epoch: 2 loss: 0.1049277 f1: 0.7534884: 87%|████████▋ | 1307/1497 [11:53<01:38, 1.93it/s]epoch: 2 loss: 0.0529954 f1: 0.7534884: 87%|████████▋ | 1307/1497 [11:54<01:38, 1.93it/s]epoch: 2 loss: 0.0529954 f1: 0.7534884: 87%|████████▋ | 1308/1497 [11:54<01:38, 1.93it/s]epoch: 2 loss: 0.0230357 f1: 0.7534884: 87%|████████▋ | 1308/1497 [11:54<01:38, 1.93it/s]epoch: 2 loss: 0.0230357 f1: 0.7534884: 87%|████████▋ | 1309/1497 [11:54<01:37, 1.93it/s]epoch: 2 loss: 0.0174185 f1: 0.7534884: 87%|████████▋ | 1309/1497 [11:55<01:37, 1.93it/s]epoch: 2 loss: 0.0174185 f1: 0.7534884: 88%|████████▊ | 1310/1497 [11:55<01:36, 1.94it/s]epoch: 2 loss: 0.1181877 f1: 0.7534884: 88%|████████▊ | 1310/1497 [11:55<01:36, 1.94it/s]epoch: 2 loss: 0.1181877 f1: 0.7534884: 88%|████████▊ | 1311/1497 [11:55<01:35, 1.95it/s]epoch: 2 loss: 0.0598124 f1: 0.7534884: 88%|████████▊ | 1311/1497 [11:56<01:35, 1.95it/s]epoch: 2 loss: 0.0598124 f1: 0.7534884: 88%|████████▊ | 1312/1497 [11:56<01:34, 1.96it/s]epoch: 2 loss: 0.0677690 f1: 0.7534884: 88%|████████▊ | 1312/1497 [11:56<01:34, 1.96it/s]epoch: 2 loss: 0.0677690 f1: 0.7534884: 88%|████████▊ | 1313/1497 [11:56<01:34, 1.95it/s]epoch: 2 loss: 0.0804777 f1: 0.7534884: 88%|████████▊ | 1313/1497 [11:57<01:34, 1.95it/s]epoch: 2 loss: 0.0804777 f1: 0.7534884: 88%|████████▊ | 1314/1497 [11:57<01:34, 1.93it/s]epoch: 2 loss: 0.0780268 f1: 0.7534884: 88%|████████▊ | 1314/1497 [11:58<01:34, 1.93it/s]epoch: 2 loss: 0.0780268 f1: 0.7534884: 88%|████████▊ | 1315/1497 [11:58<01:34, 1.93it/s]epoch: 2 loss: 0.0040932 f1: 0.7534884: 88%|████████▊ | 1315/1497 [11:58<01:34, 1.93it/s]epoch: 2 loss: 0.0040932 f1: 0.7534884: 88%|████████▊ | 1316/1497 [11:58<01:35, 1.90it/s]epoch: 2 loss: 0.0111809 f1: 0.7534884: 88%|████████▊ | 1316/1497 [11:59<01:35, 1.90it/s]epoch: 2 loss: 0.0111809 f1: 0.7534884: 88%|████████▊ | 1317/1497 [11:59<01:34, 1.90it/s]epoch: 2 loss: 0.0250803 f1: 0.7534884: 88%|████████▊ | 1317/1497 [11:59<01:34, 1.90it/s]epoch: 2 loss: 0.0250803 f1: 0.7534884: 88%|████████▊ | 1318/1497 [11:59<01:33, 1.90it/s]epoch: 2 loss: 0.0418717 f1: 0.7534884: 88%|████████▊ | 1318/1497 [12:00<01:33, 1.90it/s]epoch: 2 loss: 0.0418717 f1: 0.7534884: 88%|████████▊ | 1319/1497 [12:00<01:32, 1.92it/s]epoch: 2 loss: 0.0229323 f1: 0.7534884: 88%|████████▊ | 1319/1497 [12:00<01:32, 1.92it/s]epoch: 2 loss: 0.0229323 f1: 0.7534884: 88%|████████▊ | 1320/1497 [12:00<01:30, 1.95it/s]epoch: 2 loss: 0.1763228 f1: 0.7534884: 88%|████████▊ | 1320/1497 [12:01<01:30, 1.95it/s]epoch: 2 loss: 0.1763228 f1: 0.7534884: 88%|████████▊ | 1321/1497 [12:01<01:28, 1.98it/s]epoch: 2 loss: 0.0710974 f1: 0.7534884: 88%|████████▊ | 1321/1497 [12:01<01:28, 1.98it/s]epoch: 2 loss: 0.0710974 f1: 0.7534884: 88%|████████▊ | 1322/1497 [12:01<01:27, 2.00it/s]epoch: 2 loss: 0.0082498 f1: 0.7534884: 88%|████████▊ | 1322/1497 [12:02<01:27, 2.00it/s]epoch: 2 loss: 0.0082498 f1: 0.7534884: 88%|████████▊ | 1323/1497 [12:02<01:27, 2.00it/s]epoch: 2 loss: 0.0083826 f1: 0.7534884: 88%|████████▊ | 1323/1497 [12:02<01:27, 2.00it/s]epoch: 2 loss: 0.0083826 f1: 0.7534884: 88%|████████▊ | 1324/1497 [12:02<01:27, 1.99it/s]epoch: 2 loss: 0.0100527 f1: 0.7534884: 88%|████████▊ | 1324/1497 [12:03<01:27, 1.99it/s]epoch: 2 loss: 0.0100527 f1: 0.7534884: 89%|████████▊ | 1325/1497 [12:03<01:26, 1.99it/s]epoch: 2 loss: 0.0399602 f1: 0.7534884: 89%|████████▊ | 1325/1497 [12:03<01:26, 1.99it/s]epoch: 2 loss: 0.0399602 f1: 0.7534884: 89%|████████▊ | 1326/1497 [12:03<01:26, 1.98it/s]epoch: 2 loss: 0.0186436 f1: 0.7534884: 89%|████████▊ | 1326/1497 [12:04<01:26, 1.98it/s]epoch: 2 loss: 0.0186436 f1: 0.7534884: 89%|████████▊ | 1327/1497 [12:04<01:26, 1.97it/s]epoch: 2 loss: 0.1707716 f1: 0.7534884: 89%|████████▊ | 1327/1497 [12:04<01:26, 1.97it/s]epoch: 2 loss: 0.1707716 f1: 0.7534884: 89%|████████▊ | 1328/1497 [12:04<01:25, 1.97it/s]epoch: 2 loss: 0.0315336 f1: 0.7534884: 89%|████████▊ | 1328/1497 [12:05<01:25, 1.97it/s]epoch: 2 loss: 0.0315336 f1: 0.7534884: 89%|████████▉ | 1329/1497 [12:05<01:25, 1.97it/s]epoch: 2 loss: 0.0625530 f1: 0.7534884: 89%|████████▉ | 1329/1497 [12:05<01:25, 1.97it/s]epoch: 2 loss: 0.0625530 f1: 0.7534884: 89%|████████▉ | 1330/1497 [12:05<01:24, 1.97it/s]epoch: 2 loss: 0.0163815 f1: 0.7534884: 89%|████████▉ | 1330/1497 [12:06<01:24, 1.97it/s]epoch: 2 loss: 0.0163815 f1: 0.7534884: 89%|████████▉ | 1331/1497 [12:06<01:24, 1.98it/s]epoch: 2 loss: 0.0097832 f1: 0.7534884: 89%|████████▉ | 1331/1497 [12:06<01:24, 1.98it/s]epoch: 2 loss: 0.0097832 f1: 0.7534884: 89%|████████▉ | 1332/1497 [12:06<01:23, 1.98it/s]epoch: 2 loss: 0.0152885 f1: 0.7534884: 89%|████████▉ | 1332/1497 [12:07<01:23, 1.98it/s]epoch: 2 loss: 0.0152885 f1: 0.7534884: 89%|████████▉ | 1333/1497 [12:07<01:23, 1.97it/s]epoch: 2 loss: 0.0200703 f1: 0.7534884: 89%|████████▉ | 1333/1497 [12:07<01:23, 1.97it/s]epoch: 2 loss: 0.0200703 f1: 0.7534884: 89%|████████▉ | 1334/1497 [12:07<01:22, 1.97it/s]epoch: 2 loss: 0.0659150 f1: 0.7534884: 89%|████████▉ | 1334/1497 [12:08<01:22, 1.97it/s]epoch: 2 loss: 0.0659150 f1: 0.7534884: 89%|████████▉ | 1335/1497 [12:08<01:21, 1.98it/s]epoch: 2 loss: 0.0413359 f1: 0.7534884: 89%|████████▉ | 1335/1497 [12:08<01:21, 1.98it/s]epoch: 2 loss: 0.0413359 f1: 0.7534884: 89%|████████▉ | 1336/1497 [12:08<01:22, 1.95it/s]epoch: 2 loss: 0.1432622 f1: 0.7534884: 89%|████████▉ | 1336/1497 [12:09<01:22, 1.95it/s]epoch: 2 loss: 0.1432622 f1: 0.7534884: 89%|████████▉ | 1337/1497 [12:09<01:21, 1.96it/s]epoch: 2 loss: 0.0287093 f1: 0.7534884: 89%|████████▉ | 1337/1497 [12:09<01:21, 1.96it/s]epoch: 2 loss: 0.0287093 f1: 0.7534884: 89%|████████▉ | 1338/1497 [12:09<01:20, 1.97it/s]epoch: 2 loss: 0.0230655 f1: 0.7534884: 89%|████████▉ | 1338/1497 [12:10<01:20, 1.97it/s]epoch: 2 loss: 0.0230655 f1: 0.7534884: 89%|████████▉ | 1339/1497 [12:10<01:19, 1.99it/s]epoch: 2 loss: 0.0651039 f1: 0.7534884: 89%|████████▉ | 1339/1497 [12:10<01:19, 1.99it/s]epoch: 2 loss: 0.0651039 f1: 0.7534884: 90%|████████▉ | 1340/1497 [12:10<01:18, 2.01it/s]epoch: 2 loss: 0.1149397 f1: 0.7534884: 90%|████████▉ | 1340/1497 [12:11<01:18, 2.01it/s]epoch: 2 loss: 0.1149397 f1: 0.7534884: 90%|████████▉ | 1341/1497 [12:11<01:17, 2.02it/s]epoch: 2 loss: 0.0025129 f1: 0.7534884: 90%|████████▉ | 1341/1497 [12:11<01:17, 2.02it/s]epoch: 2 loss: 0.0025129 f1: 0.7534884: 90%|████████▉ | 1342/1497 [12:11<01:16, 2.02it/s]epoch: 2 loss: 0.1042080 f1: 0.7534884: 90%|████████▉ | 1342/1497 [12:12<01:16, 2.02it/s]epoch: 2 loss: 0.1042080 f1: 0.7534884: 90%|████████▉ | 1343/1497 [12:12<01:15, 2.03it/s]epoch: 2 loss: 0.0734522 f1: 0.7534884: 90%|████████▉ | 1343/1497 [12:12<01:15, 2.03it/s]epoch: 2 loss: 0.0734522 f1: 0.7534884: 90%|████████▉ | 1344/1497 [12:12<01:15, 2.04it/s]epoch: 2 loss: 0.0309584 f1: 0.7534884: 90%|████████▉ | 1344/1497 [12:13<01:15, 2.04it/s]epoch: 2 loss: 0.0309584 f1: 0.7534884: 90%|████████▉ | 1345/1497 [12:13<01:14, 2.05it/s]epoch: 2 loss: 0.0549662 f1: 0.7534884: 90%|████████▉ | 1345/1497 [12:13<01:14, 2.05it/s]epoch: 2 loss: 0.0549662 f1: 0.7534884: 90%|████████▉ | 1346/1497 [12:13<01:13, 2.05it/s]epoch: 2 loss: 0.0392405 f1: 0.7534884: 90%|████████▉ | 1346/1497 [12:14<01:13, 2.05it/s]epoch: 2 loss: 0.0392405 f1: 0.7534884: 90%|████████▉ | 1347/1497 [12:14<01:13, 2.05it/s]epoch: 2 loss: 0.0270934 f1: 0.7534884: 90%|████████▉ | 1347/1497 [12:14<01:13, 2.05it/s]epoch: 2 loss: 0.0270934 f1: 0.7534884: 90%|█████████ | 1348/1497 [12:14<01:12, 2.06it/s]epoch: 2 loss: 0.0189925 f1: 0.7534884: 90%|█████████ | 1348/1497 [12:15<01:12, 2.06it/s]epoch: 2 loss: 0.0189925 f1: 0.7534884: 90%|█████████ | 1349/1497 [12:15<01:11, 2.06it/s]epoch: 2 loss: 0.0625135 f1: 0.7534884: 90%|█████████ | 1349/1497 [12:15<01:11, 2.06it/s]epoch: 2 loss: 0.0625135 f1: 0.7534884: 90%|█████████ | 1350/1497 [12:15<01:11, 2.06it/s]epoch: 2 loss: 0.0071147 f1: 0.7534884: 90%|█████████ | 1350/1497 [12:16<01:11, 2.06it/s]epoch: 2 loss: 0.0071147 f1: 0.7534884: 90%|█████████ | 1351/1497 [12:16<01:11, 2.06it/s]epoch: 2 loss: 0.0149843 f1: 0.7534884: 90%|█████████ | 1351/1497 [12:16<01:11, 2.06it/s]epoch: 2 loss: 0.0149843 f1: 0.7534884: 90%|█████████ | 1352/1497 [12:16<01:11, 2.04it/s]epoch: 2 loss: 0.0871585 f1: 0.7534884: 90%|█████████ | 1352/1497 [12:17<01:11, 2.04it/s]epoch: 2 loss: 0.0871585 f1: 0.7534884: 90%|█████████ | 1353/1497 [12:17<01:11, 2.02it/s]epoch: 2 loss: 0.0396727 f1: 0.7534884: 90%|█████████ | 1353/1497 [12:17<01:11, 2.02it/s]epoch: 2 loss: 0.0396727 f1: 0.7534884: 90%|█████████ | 1354/1497 [12:17<01:11, 2.00it/s]epoch: 2 loss: 0.0565145 f1: 0.7534884: 90%|█████████ | 1354/1497 [12:18<01:11, 2.00it/s]epoch: 2 loss: 0.0565145 f1: 0.7534884: 91%|█████████ | 1355/1497 [12:18<01:10, 2.00it/s]epoch: 2 loss: 0.0547221 f1: 0.7534884: 91%|█████████ | 1355/1497 [12:18<01:10, 2.00it/s]epoch: 2 loss: 0.0547221 f1: 0.7534884: 91%|█████████ | 1356/1497 [12:18<01:09, 2.02it/s]epoch: 2 loss: 0.0027927 f1: 0.7534884: 91%|█████████ | 1356/1497 [12:19<01:09, 2.02it/s]epoch: 2 loss: 0.0027927 f1: 0.7534884: 91%|█████████ | 1357/1497 [12:19<01:08, 2.03it/s]epoch: 2 loss: 0.0123363 f1: 0.7534884: 91%|█████████ | 1357/1497 [12:19<01:08, 2.03it/s]epoch: 2 loss: 0.0123363 f1: 0.7534884: 91%|█████████ | 1358/1497 [12:19<01:08, 2.02it/s]epoch: 2 loss: 0.0398795 f1: 0.7534884: 91%|█████████ | 1358/1497 [12:20<01:08, 2.02it/s]epoch: 2 loss: 0.0398795 f1: 0.7534884: 91%|█████████ | 1359/1497 [12:20<01:08, 2.02it/s]epoch: 2 loss: 0.0143812 f1: 0.7534884: 91%|█████████ | 1359/1497 [12:20<01:08, 2.02it/s]epoch: 2 loss: 0.0143812 f1: 0.7534884: 91%|█████████ | 1360/1497 [12:20<01:07, 2.03it/s]epoch: 2 loss: 0.0408454 f1: 0.7534884: 91%|█████████ | 1360/1497 [12:21<01:07, 2.03it/s]epoch: 2 loss: 0.0408454 f1: 0.7534884: 91%|█████████ | 1361/1497 [12:21<01:06, 2.04it/s]epoch: 2 loss: 0.0188622 f1: 0.7534884: 91%|█████████ | 1361/1497 [12:21<01:06, 2.04it/s]epoch: 2 loss: 0.0188622 f1: 0.7534884: 91%|█████████ | 1362/1497 [12:21<01:05, 2.05it/s]epoch: 2 loss: 0.0169552 f1: 0.7534884: 91%|█████████ | 1362/1497 [12:21<01:05, 2.05it/s]epoch: 2 loss: 0.0169552 f1: 0.7534884: 91%|█████████ | 1363/1497 [12:21<01:05, 2.06it/s]epoch: 2 loss: 0.0284439 f1: 0.7534884: 91%|█████████ | 1363/1497 [12:22<01:05, 2.06it/s]epoch: 2 loss: 0.0284439 f1: 0.7534884: 91%|█████████ | 1364/1497 [12:22<01:04, 2.06it/s]epoch: 2 loss: 0.0442689 f1: 0.7534884: 91%|█████████ | 1364/1497 [12:22<01:04, 2.06it/s]epoch: 2 loss: 0.0442689 f1: 0.7534884: 91%|█████████ | 1365/1497 [12:22<01:03, 2.06it/s]epoch: 2 loss: 0.0437630 f1: 0.7534884: 91%|█████████ | 1365/1497 [12:23<01:03, 2.06it/s]epoch: 2 loss: 0.0437630 f1: 0.7534884: 91%|█████████ | 1366/1497 [12:23<01:03, 2.06it/s]epoch: 2 loss: 0.0458391 f1: 0.7534884: 91%|█████████ | 1366/1497 [12:23<01:03, 2.06it/s]epoch: 2 loss: 0.0458391 f1: 0.7534884: 91%|█████████▏| 1367/1497 [12:23<01:03, 2.06it/s]epoch: 2 loss: 0.1513015 f1: 0.7534884: 91%|█████████▏| 1367/1497 [12:24<01:03, 2.06it/s]epoch: 2 loss: 0.1513015 f1: 0.7534884: 91%|█████████▏| 1368/1497 [12:24<01:02, 2.07it/s]epoch: 2 loss: 0.0484499 f1: 0.7534884: 91%|█████████▏| 1368/1497 [12:24<01:02, 2.07it/s]epoch: 2 loss: 0.0484499 f1: 0.7534884: 91%|█████████▏| 1369/1497 [12:24<01:02, 2.05it/s]epoch: 2 loss: 0.1042715 f1: 0.7534884: 91%|█████████▏| 1369/1497 [12:25<01:02, 2.05it/s]epoch: 2 loss: 0.1042715 f1: 0.7534884: 92%|█████████▏| 1370/1497 [12:25<01:01, 2.05it/s]epoch: 2 loss: 0.0306718 f1: 0.7534884: 92%|█████████▏| 1370/1497 [12:25<01:01, 2.05it/s]epoch: 2 loss: 0.0306718 f1: 0.7534884: 92%|█████████▏| 1371/1497 [12:25<01:01, 2.06it/s]epoch: 2 loss: 0.1326022 f1: 0.7534884: 92%|█████████▏| 1371/1497 [12:26<01:01, 2.06it/s]epoch: 2 loss: 0.1326022 f1: 0.7534884: 92%|█████████▏| 1372/1497 [12:26<01:00, 2.06it/s]epoch: 2 loss: 0.0254550 f1: 0.7534884: 92%|█████████▏| 1372/1497 [12:26<01:00, 2.06it/s]epoch: 2 loss: 0.0254550 f1: 0.7534884: 92%|█████████▏| 1373/1497 [12:26<01:00, 2.06it/s]epoch: 2 loss: 0.1400220 f1: 0.7534884: 92%|█████████▏| 1373/1497 [12:27<01:00, 2.06it/s]epoch: 2 loss: 0.1400220 f1: 0.7534884: 92%|█████████▏| 1374/1497 [12:27<01:00, 2.04it/s]epoch: 2 loss: 0.0037926 f1: 0.7534884: 92%|█████████▏| 1374/1497 [12:27<01:00, 2.04it/s]epoch: 2 loss: 0.0037926 f1: 0.7534884: 92%|█████████▏| 1375/1497 [12:27<01:00, 2.02it/s]epoch: 2 loss: 0.0816729 f1: 0.7534884: 92%|█████████▏| 1375/1497 [12:28<01:00, 2.02it/s]epoch: 2 loss: 0.0816729 f1: 0.7534884: 92%|█████████▏| 1376/1497 [12:28<00:59, 2.03it/s]epoch: 2 loss: 0.0211183 f1: 0.7534884: 92%|█████████▏| 1376/1497 [12:28<00:59, 2.03it/s]epoch: 2 loss: 0.0211183 f1: 0.7534884: 92%|█████████▏| 1377/1497 [12:28<00:59, 2.03it/s]epoch: 2 loss: 0.0176284 f1: 0.7534884: 92%|█████████▏| 1377/1497 [12:29<00:59, 2.03it/s]epoch: 2 loss: 0.0176284 f1: 0.7534884: 92%|█████████▏| 1378/1497 [12:29<00:59, 2.01it/s]epoch: 2 loss: 0.0879003 f1: 0.7534884: 92%|█████████▏| 1378/1497 [12:29<00:59, 2.01it/s]epoch: 2 loss: 0.0879003 f1: 0.7534884: 92%|█████████▏| 1379/1497 [12:29<00:58, 2.01it/s]epoch: 2 loss: 0.0436956 f1: 0.7534884: 92%|█████████▏| 1379/1497 [12:30<00:58, 2.01it/s]epoch: 2 loss: 0.0436956 f1: 0.7534884: 92%|█████████▏| 1380/1497 [12:30<00:57, 2.02it/s]epoch: 2 loss: 0.0189853 f1: 0.7534884: 92%|█████████▏| 1380/1497 [12:30<00:57, 2.02it/s]epoch: 2 loss: 0.0189853 f1: 0.7534884: 92%|█████████▏| 1381/1497 [12:30<00:56, 2.04it/s]epoch: 2 loss: 0.1043618 f1: 0.7534884: 92%|█████████▏| 1381/1497 [12:31<00:56, 2.04it/s]epoch: 2 loss: 0.1043618 f1: 0.7534884: 92%|█████████▏| 1382/1497 [12:31<00:55, 2.06it/s]epoch: 2 loss: 0.2587200 f1: 0.7534884: 92%|█████████▏| 1382/1497 [12:31<00:55, 2.06it/s]epoch: 2 loss: 0.2587200 f1: 0.7534884: 92%|█████████▏| 1383/1497 [12:31<00:55, 2.05it/s]epoch: 2 loss: 0.0696512 f1: 0.7534884: 92%|█████████▏| 1383/1497 [12:32<00:55, 2.05it/s]epoch: 2 loss: 0.0696512 f1: 0.7534884: 92%|█████████▏| 1384/1497 [12:32<00:54, 2.06it/s]epoch: 2 loss: 0.0534706 f1: 0.7534884: 92%|█████████▏| 1384/1497 [12:32<00:54, 2.06it/s]epoch: 2 loss: 0.0534706 f1: 0.7534884: 93%|█████████▎| 1385/1497 [12:32<00:54, 2.06it/s]epoch: 2 loss: 0.2038280 f1: 0.7534884: 93%|█████████▎| 1385/1497 [12:33<00:54, 2.06it/s]epoch: 2 loss: 0.2038280 f1: 0.7534884: 93%|█████████▎| 1386/1497 [12:33<00:53, 2.06it/s]epoch: 2 loss: 0.0030039 f1: 0.7534884: 93%|█████████▎| 1386/1497 [12:33<00:53, 2.06it/s]epoch: 2 loss: 0.0030039 f1: 0.7534884: 93%|█████████▎| 1387/1497 [12:33<00:53, 2.07it/s]epoch: 2 loss: 0.0239200 f1: 0.7534884: 93%|█████████▎| 1387/1497 [12:34<00:53, 2.07it/s]epoch: 2 loss: 0.0239200 f1: 0.7534884: 93%|█████████▎| 1388/1497 [12:34<00:52, 2.06it/s]epoch: 2 loss: 0.0044061 f1: 0.7534884: 93%|█████████▎| 1388/1497 [12:34<00:52, 2.06it/s]epoch: 2 loss: 0.0044061 f1: 0.7534884: 93%|█████████▎| 1389/1497 [12:34<00:52, 2.07it/s]epoch: 2 loss: 0.0270178 f1: 0.7534884: 93%|█████████▎| 1389/1497 [12:35<00:52, 2.07it/s]epoch: 2 loss: 0.0270178 f1: 0.7534884: 93%|█████████▎| 1390/1497 [12:35<00:51, 2.07it/s]epoch: 2 loss: 0.0458736 f1: 0.7534884: 93%|█████████▎| 1390/1497 [12:35<00:51, 2.07it/s]epoch: 2 loss: 0.0458736 f1: 0.7534884: 93%|█████████▎| 1391/1497 [12:35<00:51, 2.07it/s]epoch: 2 loss: 0.0149704 f1: 0.7534884: 93%|█████████▎| 1391/1497 [12:36<00:51, 2.07it/s]epoch: 2 loss: 0.0149704 f1: 0.7534884: 93%|█████████▎| 1392/1497 [12:36<00:50, 2.07it/s]epoch: 2 loss: 0.0237245 f1: 0.7534884: 93%|█████████▎| 1392/1497 [12:36<00:50, 2.07it/s]epoch: 2 loss: 0.0237245 f1: 0.7534884: 93%|█████████▎| 1393/1497 [12:36<00:50, 2.06it/s]epoch: 2 loss: 0.0804670 f1: 0.7534884: 93%|█████████▎| 1393/1497 [12:37<00:50, 2.06it/s]epoch: 2 loss: 0.0804670 f1: 0.7534884: 93%|█████████▎| 1394/1497 [12:37<00:50, 2.05it/s]epoch: 2 loss: 0.0262227 f1: 0.7534884: 93%|█████████▎| 1394/1497 [12:37<00:50, 2.05it/s]epoch: 2 loss: 0.0262227 f1: 0.7534884: 93%|█████████▎| 1395/1497 [12:37<00:49, 2.04it/s]epoch: 2 loss: 0.0139776 f1: 0.7534884: 93%|█████████▎| 1395/1497 [12:38<00:49, 2.04it/s]epoch: 2 loss: 0.0139776 f1: 0.7534884: 93%|█████████▎| 1396/1497 [12:38<00:49, 2.03it/s]epoch: 2 loss: 0.0666381 f1: 0.7534884: 93%|█████████▎| 1396/1497 [12:38<00:49, 2.03it/s]epoch: 2 loss: 0.0666381 f1: 0.7534884: 93%|█████████▎| 1397/1497 [12:38<00:49, 2.04it/s]epoch: 2 loss: 0.0368136 f1: 0.7534884: 93%|█████████▎| 1397/1497 [12:39<00:49, 2.04it/s]epoch: 2 loss: 0.0368136 f1: 0.7534884: 93%|█████████▎| 1398/1497 [12:39<00:48, 2.03it/s]epoch: 2 loss: 0.0963077 f1: 0.7534884: 93%|█████████▎| 1398/1497 [12:39<00:48, 2.03it/s]epoch: 2 loss: 0.0963077 f1: 0.7534884: 93%|█████████▎| 1399/1497 [12:39<00:48, 2.02it/s]epoch: 2 loss: 0.0553483 f1: 0.7534884: 93%|█████████▎| 1399/1497 [12:40<00:48, 2.02it/s]epoch: 2 loss: 0.0553483 f1: 0.7534884: 94%|█████████▎| 1400/1497 [12:40<00:48, 2.00it/s]epoch: 2 loss: 0.0456652 f1: 0.7534884: 94%|█████████▎| 1400/1497 [12:40<00:48, 2.00it/s]epoch: 2 loss: 0.0456652 f1: 0.7534884: 94%|█████████▎| 1401/1497 [12:40<00:47, 2.01it/s]epoch: 2 loss: 0.0242458 f1: 0.7534884: 94%|█████████▎| 1401/1497 [12:41<00:47, 2.01it/s]epoch: 2 loss: 0.0242458 f1: 0.7534884: 94%|█████████▎| 1402/1497 [12:41<00:47, 2.01it/s]epoch: 2 loss: 0.0073331 f1: 0.7534884: 94%|█████████▎| 1402/1497 [12:41<00:47, 2.01it/s]epoch: 2 loss: 0.0073331 f1: 0.7534884: 94%|█████████▎| 1403/1497 [12:41<00:46, 2.02it/s]epoch: 2 loss: 0.0396405 f1: 0.7534884: 94%|█████████▎| 1403/1497 [12:42<00:46, 2.02it/s]epoch: 2 loss: 0.0396405 f1: 0.7534884: 94%|█████████▍| 1404/1497 [12:42<00:46, 2.01it/s]epoch: 2 loss: 0.0032324 f1: 0.7534884: 94%|█████████▍| 1404/1497 [12:42<00:46, 2.01it/s]epoch: 2 loss: 0.0032324 f1: 0.7534884: 94%|█████████▍| 1405/1497 [12:42<00:45, 2.00it/s]epoch: 2 loss: 0.0094969 f1: 0.7534884: 94%|█████████▍| 1405/1497 [12:43<00:45, 2.00it/s]epoch: 2 loss: 0.0094969 f1: 0.7534884: 94%|█████████▍| 1406/1497 [12:43<00:45, 2.01it/s]epoch: 2 loss: 0.0109303 f1: 0.7534884: 94%|█████████▍| 1406/1497 [12:43<00:45, 2.01it/s]epoch: 2 loss: 0.0109303 f1: 0.7534884: 94%|█████████▍| 1407/1497 [12:43<00:44, 2.02it/s]epoch: 2 loss: 0.0949404 f1: 0.7534884: 94%|█████████▍| 1407/1497 [12:44<00:44, 2.02it/s]epoch: 2 loss: 0.0949404 f1: 0.7534884: 94%|█████████▍| 1408/1497 [12:44<00:43, 2.03it/s]epoch: 2 loss: 0.0284669 f1: 0.7534884: 94%|█████████▍| 1408/1497 [12:44<00:43, 2.03it/s]epoch: 2 loss: 0.0284669 f1: 0.7534884: 94%|█████████▍| 1409/1497 [12:44<00:43, 2.04it/s]epoch: 2 loss: 0.0098845 f1: 0.7534884: 94%|█████████▍| 1409/1497 [12:45<00:43, 2.04it/s]epoch: 2 loss: 0.0098845 f1: 0.7534884: 94%|█████████▍| 1410/1497 [12:45<00:42, 2.05it/s]epoch: 2 loss: 0.0403983 f1: 0.7534884: 94%|█████████▍| 1410/1497 [12:45<00:42, 2.05it/s]epoch: 2 loss: 0.0403983 f1: 0.7534884: 94%|█████████▍| 1411/1497 [12:45<00:42, 2.04it/s]epoch: 2 loss: 0.0231897 f1: 0.7534884: 94%|█████████▍| 1411/1497 [12:46<00:42, 2.04it/s]epoch: 2 loss: 0.0231897 f1: 0.7534884: 94%|█████████▍| 1412/1497 [12:46<00:42, 2.02it/s]epoch: 2 loss: 0.0245235 f1: 0.7534884: 94%|█████████▍| 1412/1497 [12:46<00:42, 2.02it/s]epoch: 2 loss: 0.0245235 f1: 0.7534884: 94%|█████████▍| 1413/1497 [12:46<00:41, 2.04it/s]epoch: 2 loss: 0.0587143 f1: 0.7534884: 94%|█████████▍| 1413/1497 [12:46<00:41, 2.04it/s]epoch: 2 loss: 0.0587143 f1: 0.7534884: 94%|█████████▍| 1414/1497 [12:46<00:40, 2.03it/s]epoch: 2 loss: 0.0848201 f1: 0.7534884: 94%|█████████▍| 1414/1497 [12:47<00:40, 2.03it/s]epoch: 2 loss: 0.0848201 f1: 0.7534884: 95%|█████████▍| 1415/1497 [12:47<00:40, 2.03it/s]epoch: 2 loss: 0.0033282 f1: 0.7534884: 95%|█████████▍| 1415/1497 [12:48<00:40, 2.03it/s]epoch: 2 loss: 0.0033282 f1: 0.7534884: 95%|█████████▍| 1416/1497 [12:48<00:40, 1.98it/s]epoch: 2 loss: 0.1066514 f1: 0.7534884: 95%|█████████▍| 1416/1497 [12:48<00:40, 1.98it/s]epoch: 2 loss: 0.1066514 f1: 0.7534884: 95%|█████████▍| 1417/1497 [12:48<00:40, 1.97it/s]epoch: 2 loss: 0.0958104 f1: 0.7534884: 95%|█████████▍| 1417/1497 [12:49<00:40, 1.97it/s]epoch: 2 loss: 0.0958104 f1: 0.7534884: 95%|█████████▍| 1418/1497 [12:49<00:40, 1.95it/s]epoch: 2 loss: 0.0170944 f1: 0.7534884: 95%|█████████▍| 1418/1497 [12:49<00:40, 1.95it/s]epoch: 2 loss: 0.0170944 f1: 0.7534884: 95%|█████████▍| 1419/1497 [12:49<00:40, 1.94it/s]epoch: 2 loss: 0.1104763 f1: 0.7534884: 95%|█████████▍| 1419/1497 [12:50<00:40, 1.94it/s]epoch: 2 loss: 0.1104763 f1: 0.7534884: 95%|█████████▍| 1420/1497 [12:50<00:40, 1.92it/s]epoch: 2 loss: 0.0354565 f1: 0.7534884: 95%|█████████▍| 1420/1497 [12:50<00:40, 1.92it/s]epoch: 2 loss: 0.0354565 f1: 0.7534884: 95%|█████████▍| 1421/1497 [12:50<00:39, 1.92it/s]epoch: 2 loss: 0.0763545 f1: 0.7534884: 95%|█████████▍| 1421/1497 [12:51<00:39, 1.92it/s]epoch: 2 loss: 0.0763545 f1: 0.7534884: 95%|█████████▍| 1422/1497 [12:51<00:38, 1.93it/s]epoch: 2 loss: 0.0732757 f1: 0.7534884: 95%|█████████▍| 1422/1497 [12:51<00:38, 1.93it/s]epoch: 2 loss: 0.0732757 f1: 0.7534884: 95%|█████████▌| 1423/1497 [12:51<00:38, 1.94it/s]epoch: 2 loss: 0.0479094 f1: 0.7534884: 95%|█████████▌| 1423/1497 [12:52<00:38, 1.94it/s]epoch: 2 loss: 0.0479094 f1: 0.7534884: 95%|█████████▌| 1424/1497 [12:52<00:37, 1.94it/s]epoch: 2 loss: 0.0321108 f1: 0.7534884: 95%|█████████▌| 1424/1497 [12:52<00:37, 1.94it/s]epoch: 2 loss: 0.0321108 f1: 0.7534884: 95%|█████████▌| 1425/1497 [12:52<00:36, 1.96it/s]epoch: 2 loss: 0.0700717 f1: 0.7534884: 95%|█████████▌| 1425/1497 [12:53<00:36, 1.96it/s]epoch: 2 loss: 0.0700717 f1: 0.7534884: 95%|█████████▌| 1426/1497 [12:53<00:36, 1.96it/s]epoch: 2 loss: 0.1020330 f1: 0.7534884: 95%|█████████▌| 1426/1497 [12:53<00:36, 1.96it/s]epoch: 2 loss: 0.1020330 f1: 0.7534884: 95%|█████████▌| 1427/1497 [12:53<00:35, 1.97it/s]epoch: 2 loss: 0.0162533 f1: 0.7534884: 95%|█████████▌| 1427/1497 [12:54<00:35, 1.97it/s]epoch: 2 loss: 0.0162533 f1: 0.7534884: 95%|█████████▌| 1428/1497 [12:54<00:34, 1.98it/s]epoch: 2 loss: 0.0125528 f1: 0.7534884: 95%|█████████▌| 1428/1497 [12:54<00:34, 1.98it/s]epoch: 2 loss: 0.0125528 f1: 0.7534884: 95%|█████████▌| 1429/1497 [12:54<00:33, 2.01it/s]epoch: 2 loss: 0.1208976 f1: 0.7534884: 95%|█████████▌| 1429/1497 [12:55<00:33, 2.01it/s]epoch: 2 loss: 0.1208976 f1: 0.7534884: 96%|█████████▌| 1430/1497 [12:55<00:33, 2.02it/s]epoch: 2 loss: 0.0822922 f1: 0.7534884: 96%|█████████▌| 1430/1497 [12:55<00:33, 2.02it/s]epoch: 2 loss: 0.0822922 f1: 0.7534884: 96%|█████████▌| 1431/1497 [12:55<00:32, 2.04it/s]epoch: 2 loss: 0.0281185 f1: 0.7534884: 96%|█████████▌| 1431/1497 [12:56<00:32, 2.04it/s]epoch: 2 loss: 0.0281185 f1: 0.7534884: 96%|█████████▌| 1432/1497 [12:56<00:31, 2.04it/s]epoch: 2 loss: 0.0082374 f1: 0.7534884: 96%|█████████▌| 1432/1497 [12:56<00:31, 2.04it/s]epoch: 2 loss: 0.0082374 f1: 0.7534884: 96%|█████████▌| 1433/1497 [12:56<00:31, 2.02it/s]epoch: 2 loss: 0.0233201 f1: 0.7534884: 96%|█████████▌| 1433/1497 [12:57<00:31, 2.02it/s]epoch: 2 loss: 0.0233201 f1: 0.7534884: 96%|█████████▌| 1434/1497 [12:57<00:31, 2.01it/s]epoch: 2 loss: 0.0068763 f1: 0.7534884: 96%|█████████▌| 1434/1497 [12:57<00:31, 2.01it/s]epoch: 2 loss: 0.0068763 f1: 0.7534884: 96%|█████████▌| 1435/1497 [12:57<00:30, 2.01it/s]epoch: 2 loss: 0.0110418 f1: 0.7534884: 96%|█████████▌| 1435/1497 [12:58<00:30, 2.01it/s]epoch: 2 loss: 0.0110418 f1: 0.7534884: 96%|█████████▌| 1436/1497 [12:58<00:30, 2.01it/s]epoch: 2 loss: 0.0238045 f1: 0.7534884: 96%|█████████▌| 1436/1497 [12:58<00:30, 2.01it/s]epoch: 2 loss: 0.0238045 f1: 0.7534884: 96%|█████████▌| 1437/1497 [12:58<00:29, 2.01it/s]epoch: 2 loss: 0.0138156 f1: 0.7534884: 96%|█████████▌| 1437/1497 [12:59<00:29, 2.01it/s]epoch: 2 loss: 0.0138156 f1: 0.7534884: 96%|█████████▌| 1438/1497 [12:59<00:29, 2.01it/s]epoch: 2 loss: 0.0074654 f1: 0.7534884: 96%|█████████▌| 1438/1497 [12:59<00:29, 2.01it/s]epoch: 2 loss: 0.0074654 f1: 0.7534884: 96%|█████████▌| 1439/1497 [12:59<00:28, 2.01it/s]epoch: 2 loss: 0.0676955 f1: 0.7534884: 96%|█████████▌| 1439/1497 [13:00<00:28, 2.01it/s]epoch: 2 loss: 0.0676955 f1: 0.7534884: 96%|█████████▌| 1440/1497 [13:00<00:28, 1.99it/s]epoch: 2 loss: 0.0096973 f1: 0.7534884: 96%|█████████▌| 1440/1497 [13:00<00:28, 1.99it/s]epoch: 2 loss: 0.0096973 f1: 0.7534884: 96%|█████████▋| 1441/1497 [13:00<00:28, 1.97it/s]epoch: 2 loss: 0.0763880 f1: 0.7534884: 96%|█████████▋| 1441/1497 [13:01<00:28, 1.97it/s]epoch: 2 loss: 0.0763880 f1: 0.7534884: 96%|█████████▋| 1442/1497 [13:01<00:27, 1.97it/s]epoch: 2 loss: 0.0731691 f1: 0.7534884: 96%|█████████▋| 1442/1497 [13:01<00:27, 1.97it/s]epoch: 2 loss: 0.0731691 f1: 0.7534884: 96%|█████████▋| 1443/1497 [13:01<00:27, 2.00it/s]epoch: 2 loss: 0.2495193 f1: 0.7534884: 96%|█████████▋| 1443/1497 [13:02<00:27, 2.00it/s]epoch: 2 loss: 0.2495193 f1: 0.7534884: 96%|█████████▋| 1444/1497 [13:02<00:26, 2.00it/s]epoch: 2 loss: 0.0086037 f1: 0.7534884: 96%|█████████▋| 1444/1497 [13:02<00:26, 2.00it/s]epoch: 2 loss: 0.0086037 f1: 0.7534884: 97%|█████████▋| 1445/1497 [13:02<00:25, 2.02it/s]epoch: 2 loss: 0.0424856 f1: 0.7534884: 97%|█████████▋| 1445/1497 [13:03<00:25, 2.02it/s]epoch: 2 loss: 0.0424856 f1: 0.7534884: 97%|█████████▋| 1446/1497 [13:03<00:25, 2.03it/s]epoch: 2 loss: 0.1354143 f1: 0.7534884: 97%|█████████▋| 1446/1497 [13:03<00:25, 2.03it/s]epoch: 2 loss: 0.1354143 f1: 0.7534884: 97%|█████████▋| 1447/1497 [13:03<00:24, 2.04it/s]epoch: 2 loss: 0.0135276 f1: 0.7534884: 97%|█████████▋| 1447/1497 [13:04<00:24, 2.04it/s]epoch: 2 loss: 0.0135276 f1: 0.7534884: 97%|█████████▋| 1448/1497 [13:04<00:23, 2.04it/s]epoch: 2 loss: 0.0109496 f1: 0.7534884: 97%|█████████▋| 1448/1497 [13:04<00:23, 2.04it/s]epoch: 2 loss: 0.0109496 f1: 0.7534884: 97%|█████████▋| 1449/1497 [13:04<00:23, 2.02it/s]epoch: 2 loss: 0.1032324 f1: 0.7534884: 97%|█████████▋| 1449/1497 [13:05<00:23, 2.02it/s]epoch: 2 loss: 0.1032324 f1: 0.7534884: 97%|█████████▋| 1450/1497 [13:05<00:23, 2.00it/s]epoch: 2 loss: 0.0479416 f1: 0.7534884: 97%|█████████▋| 1450/1497 [13:05<00:23, 2.00it/s]epoch: 2 loss: 0.0479416 f1: 0.7534884: 97%|█████████▋| 1451/1497 [13:05<00:23, 1.99it/s]epoch: 2 loss: 0.0335531 f1: 0.7534884: 97%|█████████▋| 1451/1497 [13:06<00:23, 1.99it/s]epoch: 2 loss: 0.0335531 f1: 0.7534884: 97%|█████████▋| 1452/1497 [13:06<00:22, 1.98it/s]epoch: 2 loss: 0.0649863 f1: 0.7534884: 97%|█████████▋| 1452/1497 [13:06<00:22, 1.98it/s]epoch: 2 loss: 0.0649863 f1: 0.7534884: 97%|█████████▋| 1453/1497 [13:06<00:22, 1.97it/s]epoch: 2 loss: 0.0541434 f1: 0.7534884: 97%|█████████▋| 1453/1497 [13:07<00:22, 1.97it/s]epoch: 2 loss: 0.0541434 f1: 0.7534884: 97%|█████████▋| 1454/1497 [13:07<00:21, 1.97it/s]epoch: 2 loss: 0.0291349 f1: 0.7534884: 97%|█████████▋| 1454/1497 [13:07<00:21, 1.97it/s]epoch: 2 loss: 0.0291349 f1: 0.7534884: 97%|█████████▋| 1455/1497 [13:07<00:21, 1.97it/s]epoch: 2 loss: 0.0133080 f1: 0.7534884: 97%|█████████▋| 1455/1497 [13:08<00:21, 1.97it/s]epoch: 2 loss: 0.0133080 f1: 0.7534884: 97%|█████████▋| 1456/1497 [13:08<00:21, 1.95it/s]epoch: 2 loss: 0.0143245 f1: 0.7534884: 97%|█████████▋| 1456/1497 [13:08<00:21, 1.95it/s]epoch: 2 loss: 0.0143245 f1: 0.7534884: 97%|█████████▋| 1457/1497 [13:08<00:20, 1.95it/s]epoch: 2 loss: 0.0725932 f1: 0.7534884: 97%|█████████▋| 1457/1497 [13:09<00:20, 1.95it/s]epoch: 2 loss: 0.0725932 f1: 0.7534884: 97%|█████████▋| 1458/1497 [13:09<00:19, 1.97it/s]epoch: 2 loss: 0.0282501 f1: 0.7534884: 97%|█████████▋| 1458/1497 [13:09<00:19, 1.97it/s]epoch: 2 loss: 0.0282501 f1: 0.7534884: 97%|█████████▋| 1459/1497 [13:09<00:19, 1.97it/s]epoch: 2 loss: 0.0386611 f1: 0.7534884: 97%|█████████▋| 1459/1497 [13:10<00:19, 1.97it/s]epoch: 2 loss: 0.0386611 f1: 0.7534884: 98%|█████████▊| 1460/1497 [13:10<00:18, 1.99it/s]epoch: 2 loss: 0.0149623 f1: 0.7534884: 98%|█████████▊| 1460/1497 [13:10<00:18, 1.99it/s]epoch: 2 loss: 0.0149623 f1: 0.7534884: 98%|█████████▊| 1461/1497 [13:10<00:18, 1.95it/s]epoch: 2 loss: 0.0657206 f1: 0.7534884: 98%|█████████▊| 1461/1497 [13:11<00:18, 1.95it/s]epoch: 2 loss: 0.0657206 f1: 0.7534884: 98%|█████████▊| 1462/1497 [13:11<00:17, 1.96it/s]epoch: 2 loss: 0.0286788 f1: 0.7534884: 98%|█████████▊| 1462/1497 [13:11<00:17, 1.96it/s]epoch: 2 loss: 0.0286788 f1: 0.7534884: 98%|█████████▊| 1463/1497 [13:11<00:17, 1.97it/s]epoch: 2 loss: 0.0347097 f1: 0.7534884: 98%|█████████▊| 1463/1497 [13:12<00:17, 1.97it/s]epoch: 2 loss: 0.0347097 f1: 0.7534884: 98%|█████████▊| 1464/1497 [13:12<00:16, 1.98it/s]epoch: 2 loss: 0.0153873 f1: 0.7534884: 98%|█████████▊| 1464/1497 [13:12<00:16, 1.98it/s]epoch: 2 loss: 0.0153873 f1: 0.7534884: 98%|█████████▊| 1465/1497 [13:12<00:16, 1.99it/s]epoch: 2 loss: 0.0411975 f1: 0.7534884: 98%|█████████▊| 1465/1497 [13:13<00:16, 1.99it/s]epoch: 2 loss: 0.0411975 f1: 0.7534884: 98%|█████████▊| 1466/1497 [13:13<00:15, 2.00it/s]epoch: 2 loss: 0.0085960 f1: 0.7534884: 98%|█████████▊| 1466/1497 [13:13<00:15, 2.00it/s]epoch: 2 loss: 0.0085960 f1: 0.7534884: 98%|█████████▊| 1467/1497 [13:13<00:15, 2.00it/s]epoch: 2 loss: 0.0878691 f1: 0.7534884: 98%|█████████▊| 1467/1497 [13:14<00:15, 2.00it/s]epoch: 2 loss: 0.0878691 f1: 0.7534884: 98%|█████████▊| 1468/1497 [13:14<00:14, 1.99it/s]epoch: 2 loss: 0.0251055 f1: 0.7534884: 98%|█████████▊| 1468/1497 [13:14<00:14, 1.99it/s]epoch: 2 loss: 0.0251055 f1: 0.7534884: 98%|█████████▊| 1469/1497 [13:14<00:14, 1.99it/s]epoch: 2 loss: 0.0139089 f1: 0.7534884: 98%|█████████▊| 1469/1497 [13:15<00:14, 1.99it/s]epoch: 2 loss: 0.0139089 f1: 0.7534884: 98%|█████████▊| 1470/1497 [13:15<00:13, 1.99it/s]epoch: 2 loss: 0.1445772 f1: 0.7534884: 98%|█████████▊| 1470/1497 [13:15<00:13, 1.99it/s]epoch: 2 loss: 0.1445772 f1: 0.7534884: 98%|█████████▊| 1471/1497 [13:15<00:12, 2.00it/s]epoch: 2 loss: 0.0640335 f1: 0.7534884: 98%|█████████▊| 1471/1497 [13:16<00:12, 2.00it/s]epoch: 2 loss: 0.0640335 f1: 0.7534884: 98%|█████████▊| 1472/1497 [13:16<00:12, 2.00it/s]epoch: 2 loss: 0.0222863 f1: 0.7534884: 98%|█████████▊| 1472/1497 [13:16<00:12, 2.00it/s]epoch: 2 loss: 0.0222863 f1: 0.7534884: 98%|█████████▊| 1473/1497 [13:16<00:11, 2.00it/s]epoch: 2 loss: 0.0036518 f1: 0.7534884: 98%|█████████▊| 1473/1497 [13:17<00:11, 2.00it/s]epoch: 2 loss: 0.0036518 f1: 0.7534884: 98%|█████████▊| 1474/1497 [13:17<00:11, 1.98it/s]epoch: 2 loss: 0.0601188 f1: 0.7534884: 98%|█████████▊| 1474/1497 [13:17<00:11, 1.98it/s]epoch: 2 loss: 0.0601188 f1: 0.7534884: 99%|█████████▊| 1475/1497 [13:17<00:11, 1.99it/s]epoch: 2 loss: 0.0414398 f1: 0.7534884: 99%|█████████▊| 1475/1497 [13:18<00:11, 1.99it/s]epoch: 2 loss: 0.0414398 f1: 0.7534884: 99%|█████████▊| 1476/1497 [13:18<00:10, 1.97it/s]epoch: 2 loss: 0.0096280 f1: 0.7534884: 99%|█████████▊| 1476/1497 [13:18<00:10, 1.97it/s]epoch: 2 loss: 0.0096280 f1: 0.7534884: 99%|█████████▊| 1477/1497 [13:18<00:10, 1.98it/s]epoch: 2 loss: 0.0600173 f1: 0.7534884: 99%|█████████▊| 1477/1497 [13:19<00:10, 1.98it/s]epoch: 2 loss: 0.0600173 f1: 0.7534884: 99%|█████████▊| 1478/1497 [13:19<00:09, 1.98it/s]epoch: 2 loss: 0.0154092 f1: 0.7534884: 99%|█████████▊| 1478/1497 [13:19<00:09, 1.98it/s]epoch: 2 loss: 0.0154092 f1: 0.7534884: 99%|█████████▉| 1479/1497 [13:19<00:09, 1.98it/s]epoch: 2 loss: 0.0296047 f1: 0.7534884: 99%|█████████▉| 1479/1497 [13:20<00:09, 1.98it/s]epoch: 2 loss: 0.0296047 f1: 0.7534884: 99%|█████████▉| 1480/1497 [13:20<00:08, 1.99it/s]epoch: 2 loss: 0.2033139 f1: 0.7534884: 99%|█████████▉| 1480/1497 [13:20<00:08, 1.99it/s]epoch: 2 loss: 0.2033139 f1: 0.7534884: 99%|█████████▉| 1481/1497 [13:20<00:08, 1.99it/s]epoch: 2 loss: 0.0601019 f1: 0.7534884: 99%|█████████▉| 1481/1497 [13:21<00:08, 1.99it/s]epoch: 2 loss: 0.0601019 f1: 0.7534884: 99%|█████████▉| 1482/1497 [13:21<00:07, 1.99it/s]epoch: 2 loss: 0.0982848 f1: 0.7534884: 99%|█████████▉| 1482/1497 [13:21<00:07, 1.99it/s]epoch: 2 loss: 0.0982848 f1: 0.7534884: 99%|█████████▉| 1483/1497 [13:21<00:07, 1.99it/s]epoch: 2 loss: 0.0399020 f1: 0.7534884: 99%|█████████▉| 1483/1497 [13:22<00:07, 1.99it/s]epoch: 2 loss: 0.0399020 f1: 0.7534884: 99%|█████████▉| 1484/1497 [13:22<00:06, 1.99it/s]epoch: 2 loss: 0.0026644 f1: 0.7534884: 99%|█████████▉| 1484/1497 [13:22<00:06, 1.99it/s]epoch: 2 loss: 0.0026644 f1: 0.7534884: 99%|█████████▉| 1485/1497 [13:22<00:06, 1.99it/s]epoch: 2 loss: 0.0482176 f1: 0.7534884: 99%|█████████▉| 1485/1497 [13:23<00:06, 1.99it/s]epoch: 2 loss: 0.0482176 f1: 0.7534884: 99%|█████████▉| 1486/1497 [13:23<00:05, 2.00it/s]epoch: 2 loss: 0.0029265 f1: 0.7534884: 99%|█████████▉| 1486/1497 [13:23<00:05, 2.00it/s]epoch: 2 loss: 0.0029265 f1: 0.7534884: 99%|█████████▉| 1487/1497 [13:23<00:04, 2.02it/s]epoch: 2 loss: 0.1185379 f1: 0.7534884: 99%|█████████▉| 1487/1497 [13:24<00:04, 2.02it/s]epoch: 2 loss: 0.1185379 f1: 0.7534884: 99%|█████████▉| 1488/1497 [13:24<00:04, 2.02it/s]epoch: 2 loss: 0.1164342 f1: 0.7534884: 99%|█████████▉| 1488/1497 [13:24<00:04, 2.02it/s]epoch: 2 loss: 0.1164342 f1: 0.7534884: 99%|█████████▉| 1489/1497 [13:24<00:03, 2.04it/s]epoch: 2 loss: 0.0489044 f1: 0.7534884: 99%|█████████▉| 1489/1497 [13:25<00:03, 2.04it/s]epoch: 2 loss: 0.0489044 f1: 0.7534884: 100%|█████████▉| 1490/1497 [13:25<00:03, 2.03it/s]epoch: 2 loss: 0.1648774 f1: 0.7534884: 100%|█████████▉| 1490/1497 [13:25<00:03, 2.03it/s]epoch: 2 loss: 0.1648774 f1: 0.7534884: 100%|█████████▉| 1491/1497 [13:25<00:02, 2.03it/s]epoch: 2 loss: 0.0418452 f1: 0.7534884: 100%|█████████▉| 1491/1497 [13:26<00:02, 2.03it/s]epoch: 2 loss: 0.0418452 f1: 0.7534884: 100%|█████████▉| 1492/1497 [13:26<00:02, 2.03it/s]epoch: 2 loss: 0.0473516 f1: 0.7534884: 100%|█████████▉| 1492/1497 [13:26<00:02, 2.03it/s]epoch: 2 loss: 0.0473516 f1: 0.7534884: 100%|█████████▉| 1493/1497 [13:26<00:01, 2.01it/s]epoch: 2 loss: 0.0890494 f1: 0.7534884: 100%|█████████▉| 1493/1497 [13:27<00:01, 2.01it/s]epoch: 2 loss: 0.0890494 f1: 0.7534884: 100%|█████████▉| 1494/1497 [13:27<00:01, 2.00it/s]epoch: 2 loss: 0.1532649 f1: 0.7534884: 100%|█████████▉| 1494/1497 [13:27<00:01, 2.00it/s]epoch: 2 loss: 0.1532649 f1: 0.7534884: 100%|█████████▉| 1495/1497 [13:27<00:01, 1.99it/s]epoch: 2 loss: 0.0101394 f1: 0.7534884: 100%|█████████▉| 1495/1497 [13:28<00:01, 1.99it/s]epoch: 2 loss: 0.0101394 f1: 0.7534884: 100%|█████████▉| 1496/1497 [13:28<00:00, 1.99it/s]
0%| | 0/1998 [00:00<?, ?it/s][A
23%|██▎ | 460/1998 [00:00<00:00, 4597.15it/s][A
46%|████▌ | 912/1998 [00:00<00:00, 4571.90it/s][A
69%|██████▉ | 1381/1998 [00:00<00:00, 4604.61it/s][A
92%|█████████▏| 1841/1998 [00:00<00:00, 4602.62it/s][A
100%|██████████| 1998/1998 [00:00<00:00, 4586.53it/s][A
test: 0%| | 0/63 [00:00<?, ?it/s][A
test: 2%|▏ | 1/63 [00:00<00:14, 4.18it/s][A
test: 3%|▎ | 2/63 [00:00<00:12, 4.80it/s][A
test: 5%|▍ | 3/63 [00:00<00:11, 5.11it/s][A
test: 6%|▋ | 4/63 [00:00<00:11, 5.30it/s][A
test: 8%|▊ | 5/63 [00:00<00:10, 5.46it/s][A
test: 10%|▉ | 6/63 [00:01<00:09, 5.89it/s][A
test: 11%|█ | 7/63 [00:01<00:09, 5.98it/s][A
test: 13%|█▎ | 8/63 [00:01<00:09, 5.73it/s][A
test: 14%|█▍ | 9/63 [00:01<00:09, 5.98it/s][A
test: 16%|█▌ | 10/63 [00:01<00:08, 6.33it/s][A
test: 17%|█▋ | 11/63 [00:01<00:08, 6.33it/s][A
test: 19%|█▉ | 12/63 [00:02<00:08, 5.96it/s][A
test: 21%|██ | 13/63 [00:02<00:07, 6.35it/s][A
test: 22%|██▏ | 14/63 [00:02<00:07, 6.21it/s][A
test: 24%|██▍ | 15/63 [00:02<00:08, 5.91it/s][A
test: 25%|██▌ | 16/63 [00:02<00:07, 5.99it/s][A
test: 27%|██▋ | 17/63 [00:02<00:07, 6.35it/s][A
test: 29%|██▊ | 18/63 [00:02<00:07, 6.19it/s][A
test: 30%|███ | 19/63 [00:03<00:07, 5.64it/s][A
test: 32%|███▏ | 20/63 [00:03<00:07, 6.05it/s][A
test: 33%|███▎ | 21/63 [00:03<00:06, 6.01it/s][A
test: 35%|███▍ | 22/63 [00:03<00:07, 5.64it/s][A
test: 37%|███▋ | 23/63 [00:03<00:06, 5.82it/s][A
test: 38%|███▊ | 24/63 [00:04<00:06, 5.86it/s][A
test: 40%|███▉ | 25/63 [00:04<00:06, 6.27it/s][A
test: 41%|████▏ | 26/63 [00:04<00:06, 6.17it/s][A
test: 43%|████▎ | 27/63 [00:04<00:06, 5.54it/s][A
test: 44%|████▍ | 28/63 [00:04<00:05, 6.01it/s][A
test: 46%|████▌ | 29/63 [00:04<00:05, 6.21it/s][A
test: 48%|████▊ | 30/63 [00:04<00:05, 6.11it/s][A
test: 49%|████▉ | 31/63 [00:05<00:05, 6.03it/s][A
test: 51%|█████ | 32/63 [00:05<00:04, 6.38it/s][A
test: 52%|█████▏ | 33/63 [00:05<00:05, 5.91it/s][A
test: 54%|█████▍ | 34/63 [00:05<00:04, 6.06it/s][A
test: 56%|█████▌ | 35/63 [00:05<00:04, 6.13it/s][A
test: 57%|█████▋ | 36/63 [00:05<00:04, 6.15it/s][A
test: 59%|█████▊ | 37/63 [00:06<00:04, 6.23it/s][A
test: 60%|██████ | 38/63 [00:06<00:03, 6.38it/s][A
test: 62%|██████▏ | 39/63 [00:06<00:03, 6.36it/s][A
test: 63%|██████▎ | 40/63 [00:06<00:03, 6.33it/s][A
test: 65%|██████▌ | 41/63 [00:06<00:03, 5.91it/s][A
test: 67%|██████▋ | 42/63 [00:06<00:03, 6.18it/s][A
test: 68%|██████▊ | 43/63 [00:07<00:03, 6.47it/s][A
test: 70%|██████▉ | 44/63 [00:07<00:02, 6.48it/s][A
test: 71%|███████▏ | 45/63 [00:07<00:02, 6.43it/s][A
test: 73%|███████▎ | 46/63 [00:07<00:02, 6.56it/s][A
test: 75%|███████▍ | 47/63 [00:07<00:02, 6.30it/s][A
test: 76%|███████▌ | 48/63 [00:07<00:02, 6.18it/s][A
test: 78%|███████▊ | 49/63 [00:08<00:02, 6.39it/s][A
test: 79%|███████▉ | 50/63 [00:08<00:01, 6.51it/s][A
test: 81%|████████ | 51/63 [00:08<00:01, 6.31it/s][A
test: 83%|████████▎ | 52/63 [00:08<00:01, 6.63it/s][A
test: 84%|████████▍ | 53/63 [00:08<00:01, 6.59it/s][A
test: 86%|████████▌ | 54/63 [00:08<00:01, 6.79it/s][A
test: 87%|████████▋ | 55/63 [00:08<00:01, 6.24it/s][A
test: 89%|████████▉ | 56/63 [00:09<00:01, 5.64it/s][A
test: 90%|█████████ | 57/63 [00:09<00:01, 5.48it/s][A
test: 92%|█████████▏| 58/63 [00:09<00:00, 5.56it/s][A
test: 94%|█████████▎| 59/63 [00:09<00:00, 5.87it/s][A
test: 95%|█████████▌| 60/63 [00:09<00:00, 5.88it/s][A
test: 97%|█████████▋| 61/63 [00:10<00:00, 5.89it/s][A
test: 98%|█████████▊| 62/63 [00:10<00:00, 6.18it/s][A
test: 100%|██████████| 63/63 [00:10<00:00, 6.85it/s][A
[Aepoch: 2 loss: 0.0063789 f1: 0.7750939: 100%|█████████▉| 1496/1497 [13:56<00:00, 1.99it/s]epoch: 2 loss: 0.0063789 f1: 0.7750939: 100%|██████████| 1497/1497 [13:56<00:00, 8.74s/it]
0%| | 0/1998 [00:00<?, ?it/s] 23%|██▎ | 458/1998 [00:00<00:00, 4577.01it/s] 45%|████▍ | 896/1998 [00:00<00:00, 4515.88it/s] 67%|██████▋ | 1341/1998 [00:00<00:00, 4494.24it/s] 90%|█████████ | 1800/1998 [00:00<00:00, 4521.18it/s]100%|██████████| 1998/1998 [00:01<00:00, 1878.33it/s]
test: 0%| | 0/63 [00:00<?, ?it/s]test: 2%|▏ | 1/63 [00:00<00:15, 4.09it/s]test: 3%|▎ | 2/63 [00:00<00:12, 4.74it/s]test: 5%|▍ | 3/63 [00:00<00:11, 5.06it/s]test: 6%|▋ | 4/63 [00:00<00:11, 5.30it/s]test: 8%|▊ | 5/63 [00:00<00:10, 5.47it/s]test: 10%|▉ | 6/63 [00:01<00:09, 5.92it/s]test: 11%|█ | 7/63 [00:01<00:09, 6.00it/s]test: 13%|█▎ | 8/63 [00:01<00:09, 5.75it/s]test: 14%|█▍ | 9/63 [00:01<00:09, 5.99it/s]test: 16%|█▌ | 10/63 [00:01<00:08, 6.34it/s]test: 17%|█▋ | 11/63 [00:01<00:08, 6.34it/s]test: 19%|█▉ | 12/63 [00:02<00:08, 5.97it/s]test: 21%|██ | 13/63 [00:02<00:07, 6.35it/s]test: 22%|██▏ | 14/63 [00:02<00:07, 6.30it/s]test: 24%|██▍ | 15/63 [00:02<00:07, 6.05it/s]test: 25%|██▌ | 16/63 [00:02<00:07, 6.24it/s]test: 27%|██▋ | 17/63 [00:02<00:06, 6.57it/s]test: 29%|██▊ | 18/63 [00:02<00:07, 6.34it/s]test: 30%|███ | 19/63 [00:03<00:07, 5.73it/s]test: 32%|███▏ | 20/63 [00:03<00:07, 6.14it/s]test: 33%|███▎ | 21/63 [00:03<00:06, 6.08it/s]test: 35%|███▍ | 22/63 [00:03<00:07, 5.70it/s]test: 37%|███▋ | 23/63 [00:03<00:06, 5.87it/s]test: 38%|███▊ | 24/63 [00:03<00:06, 5.91it/s]test: 40%|███▉ | 25/63 [00:04<00:06, 6.33it/s]test: 41%|████▏ | 26/63 [00:04<00:05, 6.22it/s]test: 43%|████▎ | 27/63 [00:04<00:06, 5.58it/s]test: 44%|████▍ | 28/63 [00:04<00:05, 6.05it/s]test: 46%|████▌ | 29/63 [00:04<00:05, 6.24it/s]test: 48%|████▊ | 30/63 [00:04<00:05, 6.15it/s]test: 49%|████▉ | 31/63 [00:05<00:05, 6.07it/s]test: 51%|█████ | 32/63 [00:05<00:04, 6.41it/s]test: 52%|█████▏ | 33/63 [00:05<00:05, 5.93it/s]test: 54%|█████▍ | 34/63 [00:05<00:04, 6.07it/s]test: 56%|█████▌ | 35/63 [00:05<00:04, 6.14it/s]test: 57%|█████▋ | 36/63 [00:05<00:04, 6.16it/s]test: 59%|█████▊ | 37/63 [00:06<00:04, 6.26it/s]test: 60%|██████ | 38/63 [00:06<00:03, 6.41it/s]test: 62%|██████▏ | 39/63 [00:06<00:03, 6.39it/s]test: 63%|██████▎ | 40/63 [00:06<00:03, 6.37it/s]test: 65%|██████▌ | 41/63 [00:06<00:03, 5.94it/s]test: 67%|██████▋ | 42/63 [00:06<00:03, 6.21it/s]test: 68%|██████▊ | 43/63 [00:07<00:03, 6.50it/s]test: 70%|██████▉ | 44/63 [00:07<00:02, 6.50it/s]test: 71%|███████▏ | 45/63 [00:07<00:02, 6.46it/s]test: 73%|███████▎ | 46/63 [00:07<00:02, 6.62it/s]test: 75%|███████▍ | 47/63 [00:07<00:02, 6.35it/s]test: 76%|███████▌ | 48/63 [00:07<00:02, 6.21it/s]test: 78%|███████▊ | 49/63 [00:07<00:02, 6.43it/s]test: 79%|███████▉ | 50/63 [00:08<00:01, 6.54it/s]test: 81%|████████ | 51/63 [00:08<00:01, 6.33it/s]test: 83%|████████▎ | 52/63 [00:08<00:01, 6.65it/s]test: 84%|████████▍ | 53/63 [00:08<00:01, 6.61it/s]test: 86%|████████▌ | 54/63 [00:08<00:01, 6.81it/s]test: 87%|████████▋ | 55/63 [00:08<00:01, 6.25it/s]test: 89%|████████▉ | 56/63 [00:09<00:01, 5.63it/s]test: 90%|█████████ | 57/63 [00:09<00:01, 5.47it/s]test: 92%|█████████▏| 58/63 [00:09<00:00, 5.55it/s]test: 94%|█████████▎| 59/63 [00:09<00:00, 5.86it/s]test: 95%|█████████▌| 60/63 [00:09<00:00, 5.87it/s]test: 97%|█████████▋| 61/63 [00:09<00:00, 5.77it/s]test: 98%|█████████▊| 62/63 [00:10<00:00, 6.02it/s]test: 100%|██████████| 63/63 [00:10<00:00, 6.62it/s]
typing.weight 参数未加载!!!
typing.bias 参数未加载!!!
Results: %s {
"test_f1": 0.7750939345142244,
"test_precision": 0.7925356750823271,
"test_recall": 0.7584033613445378
}