Graph Vae Training - Log
Graph Vae Training - Log
00s - Debugger warning: It seems that frozen modules are being used, which
may
5.3s 2 0.00s - make the debugger miss breakpoints. Please pass -Xfrozen_modules=off
5.3s 3 0.00s - to python to disable frozen modules.
5.3s 4 0.00s - Note: Debugging will proceed. Set PYDEVD_DISABLE_FILE_VALIDATION=1
to disable this validation.
6.0s 5 0.00s - Debugger warning: It seems that frozen modules are being used, which
may
6.0s 6 0.00s - make the debugger miss breakpoints. Please pass -Xfrozen_modules=off
6.0s 7 0.00s - to python to disable frozen modules.
6.0s 8 0.00s - Note: Debugging will proceed. Set PYDEVD_DISABLE_FILE_VALIDATION=1
to disable this validation.
10.0s 9 Collecting torch_geometric
10.1s 10 Downloading torch_geometric-2.6.1-py3-none-any.whl.metadata (63 kB)
10.1s 11 #[?25l #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m0.0/63.1 kB#[0m #[31m?#[0m eta #[36m-:--:--#[0m
#[2K #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m63.1/63.1 kB#[0m
#[31m2.7 MB/s#[0m eta #[36m0:00:00#[0m
10.1s 12 #[?25hRequirement already satisfied: aiohttp in
/usr/local/lib/python3.11/dist-packages (from torch_geometric) (3.11.18)
10.1s 13 Requirement already satisfied: fsspec in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (2025.3.2)
10.1s 14 Requirement already satisfied: jinja2 in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (3.1.6)
10.1s 15 Requirement already satisfied: numpy in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (1.26.4)
10.1s 16 Requirement already satisfied: psutil>=5.8.0 in
/usr/local/lib/python3.11/dist-packages (from torch_geometric) (7.0.0)
10.1s 17 Requirement already satisfied: pyparsing in
/usr/local/lib/python3.11/dist-packages (from torch_geometric) (3.0.9)
10.1s 18 Requirement already satisfied: requests in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (2.32.3)
10.1s 19 Requirement already satisfied: tqdm in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (4.67.1)
10.1s 20 Requirement already satisfied: aiohappyeyeballs>=2.3.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (2.6.1)
10.2s 21 Requirement already satisfied: aiosignal>=1.1.2 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (1.3.2)
10.2s 22 Requirement already satisfied: attrs>=17.3.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (25.3.0)
10.2s 23 Requirement already satisfied: frozenlist>=1.1.1 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (1.6.0)
10.2s 24 Requirement already satisfied: multidict<7.0,>=4.5 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (6.4.3)
10.2s 25 Requirement already satisfied: propcache>=0.2.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (0.3.1)
10.2s 26 Requirement already satisfied: yarl<2.0,>=1.17.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (1.20.0)
10.2s 27 Requirement already satisfied: MarkupSafe>=2.0 in
/usr/local/lib/python3.11/dist-packages (from jinja2->torch_geometric) (3.0.2)
10.2s 28 Requirement already satisfied: mkl_fft in /usr/local/lib/python3.11/dist-
packages (from numpy->torch_geometric) (1.3.8)
10.2s 29 Requirement already satisfied: mkl_random in
/usr/local/lib/python3.11/dist-packages (from numpy->torch_geometric) (1.2.4)
10.2s 30 Requirement already satisfied: mkl_umath in
/usr/local/lib/python3.11/dist-packages (from numpy->torch_geometric) (0.1.1)
10.2s 31 Requirement already satisfied: mkl in /usr/local/lib/python3.11/dist-
packages (from numpy->torch_geometric) (2025.1.0)
10.2s 32 Requirement already satisfied: tbb4py in /usr/local/lib/python3.11/dist-
packages (from numpy->torch_geometric) (2022.1.0)
10.2s 33 Requirement already satisfied: mkl-service in
/usr/local/lib/python3.11/dist-packages (from numpy->torch_geometric) (2.4.1)
10.2s 34 Requirement already satisfied: charset-normalizer<4,>=2 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric) (3.4.2)
10.2s 35 Requirement already satisfied: idna<4,>=2.5 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric) (3.10)
10.2s 36 Requirement already satisfied: urllib3<3,>=1.21.1 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric) (2.4.0)
10.2s 37 Requirement already satisfied: certifi>=2017.4.17 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric)
(2025.4.26)
10.2s 38 Requirement already satisfied: intel-openmp<2026,>=2024 in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->torch_geometric)
(2024.2.0)
10.2s 39 Requirement already satisfied: tbb==2022.* in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->torch_geometric)
(2022.1.0)
10.2s 40 Requirement already satisfied: tcmlib==1.* in
/usr/local/lib/python3.11/dist-packages (from tbb==2022.*->mkl->numpy-
>torch_geometric) (1.3.0)
10.2s 41 Requirement already satisfied: intel-cmplr-lib-rt in
/usr/local/lib/python3.11/dist-packages (from mkl_umath->numpy->torch_geometric)
(2024.2.0)
10.2s 42 Requirement already satisfied: intel-cmplr-lib-ur==2024.2.0 in
/usr/local/lib/python3.11/dist-packages (from intel-openmp<2026,>=2024->mkl->numpy-
>torch_geometric) (2024.2.0)
10.2s 43 Downloading torch_geometric-2.6.1-py3-none-any.whl (1.1 MB)
10.3s 44 #[?25l #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m0.0/1.1
MB#[0m #[31m?#[0m eta #[36m-:--:--#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m1.1/1.1
MB#[0m #[31m34.0 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m1.1/1.1 MB#[0m
#[31m23.0 MB/s#[0m eta #[36m0:00:00#[0m
11.9s 45 #[?25hInstalling collected packages: torch_geometric
12.7s 46 Successfully installed torch_geometric-2.6.1
14.1s 47 Requirement already satisfied: tqdm in /usr/local/lib/python3.11/dist-
packages (4.67.1)
17.1s 48 Collecting rdkit
17.1s 49 Downloading rdkit-2025.3.3-cp311-cp311-manylinux_2_28_x86_64.whl.metadata
(4.0 kB)
17.2s 50 Requirement already satisfied: numpy in /usr/local/lib/python3.11/dist-
packages (from rdkit) (1.26.4)
17.2s 51 Requirement already satisfied: Pillow in /usr/local/lib/python3.11/dist-
packages (from rdkit) (11.1.0)
17.2s 52 Requirement already satisfied: mkl_fft in /usr/local/lib/python3.11/dist-
packages (from numpy->rdkit) (1.3.8)
17.2s 53 Requirement already satisfied: mkl_random in
/usr/local/lib/python3.11/dist-packages (from numpy->rdkit) (1.2.4)
17.2s 54 Requirement already satisfied: mkl_umath in
/usr/local/lib/python3.11/dist-packages (from numpy->rdkit) (0.1.1)
17.2s 55 Requirement already satisfied: mkl in /usr/local/lib/python3.11/dist-
packages (from numpy->rdkit) (2025.1.0)
17.2s 56 Requirement already satisfied: tbb4py in /usr/local/lib/python3.11/dist-
packages (from numpy->rdkit) (2022.1.0)
17.2s 57 Requirement already satisfied: mkl-service in
/usr/local/lib/python3.11/dist-packages (from numpy->rdkit) (2.4.1)
17.2s 58 Requirement already satisfied: intel-openmp<2026,>=2024 in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->rdkit) (2024.2.0)
17.2s 59 Requirement already satisfied: tbb==2022.* in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->rdkit) (2022.1.0)
17.2s 60 Requirement already satisfied: tcmlib==1.* in
/usr/local/lib/python3.11/dist-packages (from tbb==2022.*->mkl->numpy->rdkit)
(1.3.0)
17.2s 61 Requirement already satisfied: intel-cmplr-lib-rt in
/usr/local/lib/python3.11/dist-packages (from mkl_umath->numpy->rdkit) (2024.2.0)
17.2s 62 Requirement already satisfied: intel-cmplr-lib-ur==2024.2.0 in
/usr/local/lib/python3.11/dist-packages (from intel-openmp<2026,>=2024->mkl->numpy-
>rdkit) (2024.2.0)
17.2s 63 Downloading rdkit-2025.3.3-cp311-cp311-manylinux_2_28_x86_64.whl (34.9 MB)
17.6s 64 #[?25l #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m0.0/34.9
MB#[0m #[31m?#[0m eta #[36m-:--:--#[0m
#[2K #[91m╸#[0m#[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m0.5/34.9
MB#[0m #[31m14.2 MB/s#[0m eta #[36m0:00:03#[0m
#[2K #[91m━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m3.8/34.9 MB#[0m #[31m55.7 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m9.7/34.9 MB#[0m #[31m93.5 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m16.1/34.9 MB#[0m #[31m184.5 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━#[0m
#[32m22.1/34.9 MB#[0m #[31m178.2 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m#[90m━━━━━━━#[0m
#[32m28.4/34.9 MB#[0m #[31m181.2 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m34.9/34.9 MB#[0m
#[31m53.3 MB/s#[0m eta #[36m0:00:00#[0m
19.2s 65 #[?25hInstalling collected packages: rdkit
20.4s 66 Successfully installed rdkit-2025.3.3
46.2s 67 총 로드된 그래프 개수: 100000
46.2s 68 Logical CPU cores: 4
46.5s 69 Using 2 GPUs with PyG DataParallel
46.5s 70
/usr/local/lib/python3.11/dist-packages/torch_geometric/nn/data_parallel.py:60:
UserWarning: 'DataParallel' is usually much slower than 'DistributedDataParallel'
even on a single machine. Please consider switching to 'DistributedDataParallel'
for multi-GPU training.
46.5s 71 warnings.warn("'DataParallel' is usually much slower than "
91.1s 72 Train E01: 0%| | 0/25 [00:00<?, ?batch/s]
Train E01: 0%| | 0/25 [00:03<?, ?batch/s, N=16.3778, E=1.3234,
KL=0.0500, wKL=0.0250]
Train E01: 4%|▍ | 1/25 [00:03<01:21, 3.38s/batch, N=16.3778, E=1.3234,
KL=0.0500, wKL=0.0250]
Train E01: 4%|▍ | 1/25 [00:05<01:21, 3.38s/batch, N=14.4663, E=1.2715,
KL=0.0678, wKL=0.0250]
Train E01: 8%|▊ | 2/25 [00:05<00:55, 2.43s/batch, N=14.4663, E=1.2715,
KL=0.0678, wKL=0.0250]
Train E01: 8%|▊ | 2/25 [00:06<00:55, 2.43s/batch, N=12.2940, E=1.1990,
KL=0.1175, wKL=0.0250]
Train E01: 12%|█▏ | 3/25 [00:06<00:45, 2.08s/batch, N=12.2940, E=1.1990,
KL=0.1175, wKL=0.0250]
Train E01: 12%|█▏ | 3/25 [00:08<00:45, 2.08s/batch, N=9.4547, E=1.0925,
KL=0.2018, wKL=0.0250]
Train E01: 16%|█▌ | 4/25 [00:08<00:40, 1.91s/batch, N=9.4547, E=1.0925,
KL=0.2018, wKL=0.0250]
Train E01: 16%|█▌ | 4/25 [00:10<00:40, 1.91s/batch, N=6.4252, E=0.9867,
KL=0.3325, wKL=0.0250]
Train E01: 20%|██ | 5/25 [00:10<00:36, 1.82s/batch, N=6.4252, E=0.9867,
KL=0.3325, wKL=0.0250]
Train E01: 20%|██ | 5/25 [00:11<00:36, 1.82s/batch, N=3.5332, E=0.9469,
KL=0.5261, wKL=0.0250]
Train E01: 24%|██▍ | 6/25 [00:11<00:34, 1.81s/batch, N=3.5332, E=0.9469,
KL=0.5261, wKL=0.0250]
Train E01: 24%|██▍ | 6/25 [00:13<00:34, 1.81s/batch, N=1.8069, E=0.9927,
KL=0.7991, wKL=0.0250]
Train E01: 28%|██▊ | 7/25 [00:13<00:32, 1.78s/batch, N=1.8069, E=0.9927,
KL=0.7991, wKL=0.0250]
Train E01: 28%|██▊ | 7/25 [00:15<00:32, 1.78s/batch, N=2.3330, E=1.0492,
KL=1.1288, wKL=0.0250]
Train E01: 32%|███▏ | 8/25 [00:15<00:30, 1.79s/batch, N=2.3330, E=1.0492,
KL=1.1288, wKL=0.0250]
Train E01: 32%|███▏ | 8/25 [00:17<00:30, 1.79s/batch, N=3.8587, E=1.0467,
KL=1.3643, wKL=0.0250]
Train E01: 36%|███▌ | 9/25 [00:17<00:28, 1.78s/batch, N=3.8587, E=1.0467,
KL=1.3643, wKL=0.0250]
Train E01: 36%|███▌ | 9/25 [00:18<00:28, 1.78s/batch, N=3.8348, E=0.9965,
KL=1.4050, wKL=0.0250]
Train E01: 40%|████ | 10/25 [00:18<00:26, 1.76s/batch, N=3.8348, E=0.9965,
KL=1.4050, wKL=0.0250]
Train E01: 40%|████ | 10/25 [00:20<00:26, 1.76s/batch, N=2.8877, E=0.9400,
KL=1.3302, wKL=0.0250]
Train E01: 44%|████▍ | 11/25 [00:20<00:24, 1.75s/batch, N=2.8877, E=0.9400,
KL=1.3302, wKL=0.0250]
Train E01: 44%|████▍ | 11/25 [00:22<00:24, 1.75s/batch, N=2.0811, E=0.9114,
KL=1.2041, wKL=0.0250]
Train E01: 48%|████▊ | 12/25 [00:22<00:23, 1.80s/batch, N=2.0811, E=0.9114,
KL=1.2041, wKL=0.0250]
Train E01: 48%|████▊ | 12/25 [00:24<00:23, 1.80s/batch, N=1.7251, E=0.9285,
KL=1.0838, wKL=0.0250]
Train E01: 52%|█████▏ | 13/25 [00:24<00:21, 1.78s/batch, N=1.7251, E=0.9285,
KL=1.0838, wKL=0.0250]
Train E01: 52%|█████▏ | 13/25 [00:25<00:21, 1.78s/batch, N=1.7046, E=0.9585,
KL=0.9776, wKL=0.0250]
Train E01: 56%|█████▌ | 14/25 [00:25<00:19, 1.75s/batch, N=1.7046, E=0.9585,
KL=0.9776, wKL=0.0250]
Train E01: 56%|█████▌ | 14/25 [00:27<00:19, 1.75s/batch, N=1.9528, E=0.9682,
KL=0.9068, wKL=0.0250]
Train E01: 60%|██████ | 15/25 [00:27<00:17, 1.72s/batch, N=1.9528, E=0.9682,
KL=0.9068, wKL=0.0250]
Train E01: 60%|██████ | 15/25 [00:29<00:17, 1.72s/batch, N=2.1785, E=0.9598,
KL=0.8592, wKL=0.0250]
Train E01: 64%|██████▍ | 16/25 [00:29<00:15, 1.71s/batch, N=2.1785, E=0.9598,
KL=0.8592, wKL=0.0250]
Train E01: 64%|██████▍ | 16/25 [00:31<00:15, 1.71s/batch, N=2.3986, E=0.9408,
KL=0.8431, wKL=0.0250]
Train E01: 68%|██████▊ | 17/25 [00:31<00:13, 1.71s/batch, N=2.3986, E=0.9408,
KL=0.8431, wKL=0.0250]
Train E01: 68%|██████▊ | 17/25 [00:32<00:13, 1.71s/batch, N=2.4209, E=0.9233,
KL=0.8474, wKL=0.0250]
Train E01: 72%|███████▏ | 18/25 [00:32<00:11, 1.70s/batch, N=2.4209, E=0.9233,
KL=0.8474, wKL=0.0250]
Train E01: 72%|███████▏ | 18/25 [00:34<00:11, 1.70s/batch, N=2.1500, E=0.9119,
KL=0.8644, wKL=0.0250]
Train E01: 76%|███████▌ | 19/25 [00:34<00:10, 1.71s/batch, N=2.1500, E=0.9119,
KL=0.8644, wKL=0.0250]
Train E01: 76%|███████▌ | 19/25 [00:36<00:10, 1.71s/batch, N=1.9749, E=0.9128,
KL=0.9021, wKL=0.0250]
Train E01: 80%|████████ | 20/25 [00:36<00:09, 1.92s/batch, N=1.9749, E=0.9128,
KL=0.9021, wKL=0.0250]
Train E01: 80%|████████ | 20/25 [00:38<00:09, 1.92s/batch, N=1.9611, E=0.9146,
KL=0.9556, wKL=0.0250]
Train E01: 84%|████████▍ | 21/25 [00:38<00:07, 1.86s/batch, N=1.9611, E=0.9146,
KL=0.9556, wKL=0.0250]
Train E01: 84%|████████▍ | 21/25 [00:40<00:07, 1.86s/batch, N=1.6205, E=0.9207,
KL=1.0094, wKL=0.0250]
Train E01: 88%|████████▊ | 22/25 [00:40<00:05, 1.83s/batch, N=1.6205, E=0.9207,
KL=1.0094, wKL=0.0250]
Train E01: 88%|████████▊ | 22/25 [00:42<00:05, 1.83s/batch, N=1.6097, E=0.9339,
KL=1.0726, wKL=0.0250]
Train E01: 92%|█████████▏| 23/25 [00:42<00:03, 1.79s/batch, N=1.6097, E=0.9339,
KL=1.0726, wKL=0.0250]
Train E01: 92%|█████████▏| 23/25 [00:43<00:03, 1.79s/batch, N=1.6439, E=0.9437,
KL=1.1365, wKL=0.0250]
Train E01: 96%|█████████▌| 24/25 [00:43<00:01, 1.78s/batch, N=1.6439, E=0.9437,
KL=1.1365, wKL=0.0250]
Train E01: 96%|█████████▌| 24/25 [00:44<00:01, 1.78s/batch, N=1.7865, E=0.9421,
KL=1.1876, wKL=0.0250]
Train E01: 100%|██████████| 25/25 [00:44<00:00, 1.49s/batch, N=1.7865, E=0.9421,
KL=1.1876, wKL=0.0250]
Train E01: 100%|██████████| 25/25 [00:44<00:00, 1.78s/batch, N=1.7865, E=0.9421,
KL=1.1876, wKL=0.0250]
91.1s 73 [Epoch 001] Total: 5.2451 | N: 4.2366 | E: 0.9979 | KL(0.03×0.5): 0.8388
126.1s 74 Train E02: 0%| | 0/25 [00:00<?, ?batch/s]
Train E02: 0%| | 0/25 [00:01<?, ?batch/s, N=1.7216, E=0.9424, KL=1.2219,
wKL=0.0500]
Train E02: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.7216, E=0.9424,
KL=1.2219, wKL=0.0500]
Train E02: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.6840, E=0.9336,
KL=1.2407, wKL=0.0500]
Train E02: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.6840, E=0.9336,
KL=1.2407, wKL=0.0500]
Train E02: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.6079, E=0.9265,
KL=1.2377, wKL=0.0500]
Train E02: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.6079, E=0.9265,
KL=1.2377, wKL=0.0500]
Train E02: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.7404, E=0.9155,
KL=1.2212, wKL=0.0500]
Train E02: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.7404, E=0.9155,
KL=1.2212, wKL=0.0500]
Train E02: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.6856, E=0.9111,
KL=1.1927, wKL=0.0500]
Train E02: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.6856, E=0.9111,
KL=1.1927, wKL=0.0500]
Train E02: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.6592, E=0.9112,
KL=1.1547, wKL=0.0500]
Train E02: 24%|██▍ | 6/25 [00:08<00:27, 1.46s/batch, N=1.6592, E=0.9112,
KL=1.1547, wKL=0.0500]
Train E02: 24%|██▍ | 6/25 [00:10<00:27, 1.46s/batch, N=1.5871, E=0.9136,
KL=1.1167, wKL=0.0500]
Train E02: 28%|██▊ | 7/25 [00:10<00:26, 1.46s/batch, N=1.5871, E=0.9136,
KL=1.1167, wKL=0.0500]
Train E02: 28%|██▊ | 7/25 [00:11<00:26, 1.46s/batch, N=1.6743, E=0.9214,
KL=1.0801, wKL=0.0500]
Train E02: 32%|███▏ | 8/25 [00:11<00:24, 1.44s/batch, N=1.6743, E=0.9214,
KL=1.0801, wKL=0.0500]
Train E02: 32%|███▏ | 8/25 [00:12<00:24, 1.44s/batch, N=1.5113, E=0.9265,
KL=1.0507, wKL=0.0500]
Train E02: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.5113, E=0.9265,
KL=1.0507, wKL=0.0500]
Train E02: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.5577, E=0.9259,
KL=1.0311, wKL=0.0500]
Train E02: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.5577, E=0.9259,
KL=1.0311, wKL=0.0500]
Train E02: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.5992, E=0.9226,
KL=1.0194, wKL=0.0500]
Train E02: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5992, E=0.9226,
KL=1.0194, wKL=0.0500]
Train E02: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5659, E=0.9180,
KL=1.0132, wKL=0.0500]
Train E02: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.5659, E=0.9180,
KL=1.0132, wKL=0.0500]
Train E02: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.7059, E=0.9148,
KL=1.0194, wKL=0.0500]
Train E02: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.7059, E=0.9148,
KL=1.0194, wKL=0.0500]
Train E02: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5493, E=0.9080,
KL=1.0261, wKL=0.0500]
Train E02: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5493, E=0.9080,
KL=1.0261, wKL=0.0500]
Train E02: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5759, E=0.9085,
KL=1.0412, wKL=0.0500]
Train E02: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.5759, E=0.9085,
KL=1.0412, wKL=0.0500]
Train E02: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.5779, E=0.9083,
KL=1.0570, wKL=0.0500]
Train E02: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5779, E=0.9083,
KL=1.0570, wKL=0.0500]
Train E02: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.5443, E=0.9110,
KL=1.0723, wKL=0.0500]
Train E02: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.5443, E=0.9110,
KL=1.0723, wKL=0.0500]
Train E02: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.4236, E=0.9160,
KL=1.0849, wKL=0.0500]
Train E02: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4236, E=0.9160,
KL=1.0849, wKL=0.0500]
Train E02: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.5439, E=0.9182,
KL=1.0960, wKL=0.0500]
Train E02: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.5439, E=0.9182,
KL=1.0960, wKL=0.0500]
Train E02: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5512, E=0.9174,
KL=1.1042, wKL=0.0500]
Train E02: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.5512, E=0.9174,
KL=1.1042, wKL=0.0500]
Train E02: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5165, E=0.9108,
KL=1.1034, wKL=0.0500]
Train E02: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5165, E=0.9108,
KL=1.1034, wKL=0.0500]
Train E02: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.6609, E=0.9121,
KL=1.1046, wKL=0.0500]
Train E02: 88%|████████▊ | 22/25 [00:30<00:04, 1.40s/batch, N=1.6609, E=0.9121,
KL=1.1046, wKL=0.0500]
Train E02: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5487, E=0.9143,
KL=1.0942, wKL=0.0500]
Train E02: 92%|█████████▏| 23/25 [00:32<00:03, 1.58s/batch, N=1.5487, E=0.9143,
KL=1.0942, wKL=0.0500]
Train E02: 92%|█████████▏| 23/25 [00:34<00:03, 1.58s/batch, N=1.5304, E=0.9107,
KL=1.0848, wKL=0.0500]
Train E02: 96%|█████████▌| 24/25 [00:34<00:01, 1.53s/batch, N=1.5304, E=0.9107,
KL=1.0848, wKL=0.0500]
Train E02: 96%|█████████▌| 24/25 [00:35<00:01, 1.53s/batch, N=1.5503, E=0.9086,
KL=1.0723, wKL=0.0500]
Train E02: 100%|██████████| 25/25 [00:35<00:00, 1.25s/batch, N=1.5503, E=0.9086,
KL=1.0723, wKL=0.0500]
Train E02: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5503, E=0.9086,
KL=1.0723, wKL=0.0500]
126.1s 75 [Epoch 002] Total: 2.5408 | N: 1.5960 | E: 0.9173 | KL(0.05×0.5): 1.1023
160.5s 76 Train E03: 0%| | 0/25 [00:00<?, ?batch/s]
Train E03: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4887, E=0.9101, KL=1.0587,
wKL=0.0750]
Train E03: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4887, E=0.9101,
KL=1.0587, wKL=0.0750]
Train E03: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5675, E=0.9102,
KL=1.0493, wKL=0.0750]
Train E03: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5675, E=0.9102,
KL=1.0493, wKL=0.0750]
Train E03: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.7098, E=0.9097,
KL=1.0406, wKL=0.0750]
Train E03: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.7098, E=0.9097,
KL=1.0406, wKL=0.0750]
Train E03: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5659, E=0.9111,
KL=1.0251, wKL=0.0750]
Train E03: 16%|█▌ | 4/25 [00:05<00:30, 1.45s/batch, N=1.5659, E=0.9111,
KL=1.0251, wKL=0.0750]
Train E03: 16%|█▌ | 4/25 [00:07<00:30, 1.45s/batch, N=1.5190, E=0.9094,
KL=1.0161, wKL=0.0750]
Train E03: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.5190, E=0.9094,
KL=1.0161, wKL=0.0750]
Train E03: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.4910, E=0.9135,
KL=1.0106, wKL=0.0750]
Train E03: 24%|██▍ | 6/25 [00:08<00:27, 1.43s/batch, N=1.4910, E=0.9135,
KL=1.0106, wKL=0.0750]
Train E03: 24%|██▍ | 6/25 [00:09<00:27, 1.43s/batch, N=1.4577, E=0.9107,
KL=1.0042, wKL=0.0750]
Train E03: 28%|██▊ | 7/25 [00:09<00:25, 1.42s/batch, N=1.4577, E=0.9107,
KL=1.0042, wKL=0.0750]
Train E03: 28%|██▊ | 7/25 [00:11<00:25, 1.42s/batch, N=1.4954, E=0.9083,
KL=1.0020, wKL=0.0750]
Train E03: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.4954, E=0.9083,
KL=1.0020, wKL=0.0750]
Train E03: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.4463, E=0.9097,
KL=0.9997, wKL=0.0750]
Train E03: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4463, E=0.9097,
KL=0.9997, wKL=0.0750]
Train E03: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.4520, E=0.9138,
KL=0.9958, wKL=0.0750]
Train E03: 40%|████ | 10/25 [00:14<00:21, 1.40s/batch, N=1.4520, E=0.9138,
KL=0.9958, wKL=0.0750]
Train E03: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.5459, E=0.9066,
KL=0.9952, wKL=0.0750]
Train E03: 44%|████▍ | 11/25 [00:15<00:19, 1.42s/batch, N=1.5459, E=0.9066,
KL=0.9952, wKL=0.0750]
Train E03: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.5385, E=0.9100,
KL=0.9893, wKL=0.0750]
Train E03: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.5385, E=0.9100,
KL=0.9893, wKL=0.0750]
Train E03: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.5819, E=0.9125,
KL=0.9865, wKL=0.0750]
Train E03: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5819, E=0.9125,
KL=0.9865, wKL=0.0750]
Train E03: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.5051, E=0.9091,
KL=0.9792, wKL=0.0750]
Train E03: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5051, E=0.9091,
KL=0.9792, wKL=0.0750]
Train E03: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.6486, E=0.9072,
KL=0.9758, wKL=0.0750]
Train E03: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.6486, E=0.9072,
KL=0.9758, wKL=0.0750]
Train E03: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5611, E=0.9140,
KL=0.9682, wKL=0.0750]
Train E03: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.5611, E=0.9140,
KL=0.9682, wKL=0.0750]
Train E03: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5306, E=0.9119,
KL=0.9608, wKL=0.0750]
Train E03: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5306, E=0.9119,
KL=0.9608, wKL=0.0750]
Train E03: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4701, E=0.9095,
KL=0.9516, wKL=0.0750]
Train E03: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4701, E=0.9095,
KL=0.9516, wKL=0.0750]
Train E03: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.5197, E=0.9085,
KL=0.9443, wKL=0.0750]
Train E03: 76%|███████▌ | 19/25 [00:26<00:08, 1.41s/batch, N=1.5197, E=0.9085,
KL=0.9443, wKL=0.0750]
Train E03: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5303, E=0.9090,
KL=0.9365, wKL=0.0750]
Train E03: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5303, E=0.9090,
KL=0.9365, wKL=0.0750]
Train E03: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.5594, E=0.9114,
KL=0.9334, wKL=0.0750]
Train E03: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5594, E=0.9114,
KL=0.9334, wKL=0.0750]
Train E03: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5389, E=0.9087,
KL=0.9238, wKL=0.0750]
Train E03: 88%|████████▊ | 22/25 [00:30<00:04, 1.41s/batch, N=1.5389, E=0.9087,
KL=0.9238, wKL=0.0750]
Train E03: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5034, E=0.9084,
KL=0.9167, wKL=0.0750]
Train E03: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5034, E=0.9084,
KL=0.9167, wKL=0.0750]
Train E03: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.5665, E=0.9107,
KL=0.9115, wKL=0.0750]
Train E03: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.5665, E=0.9107,
KL=0.9115, wKL=0.0750]
Train E03: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4971, E=0.9087,
KL=0.9031, wKL=0.0750]
Train E03: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4971, E=0.9087,
KL=0.9031, wKL=0.0750]
Train E03: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.4971, E=0.9087,
KL=0.9031, wKL=0.0750]
160.5s 77 [Epoch 003] Total: 2.4794 | N: 1.5324 | E: 0.9101 | KL(0.07×0.5): 0.9809
195.8s 78 Train E04: 0%| | 0/25 [00:00<?, ?batch/s]
Train E04: 0%| | 0/25 [00:02<?, ?batch/s, N=1.4919, E=0.9081, KL=0.8946,
wKL=0.1000]
Train E04: 4%|▍ | 1/25 [00:02<00:50, 2.10s/batch, N=1.4919, E=0.9081,
KL=0.8946, wKL=0.1000]
Train E04: 4%|▍ | 1/25 [00:03<00:50, 2.10s/batch, N=1.5228, E=0.9096,
KL=0.8925, wKL=0.1000]
Train E04: 8%|▊ | 2/25 [00:03<00:40, 1.74s/batch, N=1.5228, E=0.9096,
KL=0.8925, wKL=0.1000]
Train E04: 8%|▊ | 2/25 [00:04<00:40, 1.74s/batch, N=1.6339, E=0.9111,
KL=0.8877, wKL=0.1000]
Train E04: 12%|█▏ | 3/25 [00:04<00:34, 1.58s/batch, N=1.6339, E=0.9111,
KL=0.8877, wKL=0.1000]
Train E04: 12%|█▏ | 3/25 [00:06<00:34, 1.58s/batch, N=1.4708, E=0.9086,
KL=0.8760, wKL=0.1000]
Train E04: 16%|█▌ | 4/25 [00:06<00:32, 1.53s/batch, N=1.4708, E=0.9086,
KL=0.8760, wKL=0.1000]
Train E04: 16%|█▌ | 4/25 [00:07<00:32, 1.53s/batch, N=1.5492, E=0.9121,
KL=0.8724, wKL=0.1000]
Train E04: 20%|██ | 5/25 [00:07<00:29, 1.47s/batch, N=1.5492, E=0.9121,
KL=0.8724, wKL=0.1000]
Train E04: 20%|██ | 5/25 [00:09<00:29, 1.47s/batch, N=1.5148, E=0.9088,
KL=0.8661, wKL=0.1000]
Train E04: 24%|██▍ | 6/25 [00:09<00:27, 1.45s/batch, N=1.5148, E=0.9088,
KL=0.8661, wKL=0.1000]
Train E04: 24%|██▍ | 6/25 [00:10<00:27, 1.45s/batch, N=1.6846, E=0.9087,
KL=0.8655, wKL=0.1000]
Train E04: 28%|██▊ | 7/25 [00:10<00:25, 1.43s/batch, N=1.6846, E=0.9087,
KL=0.8655, wKL=0.1000]
Train E04: 28%|██▊ | 7/25 [00:12<00:25, 1.43s/batch, N=1.4793, E=0.9089,
KL=0.8535, wKL=0.1000]
Train E04: 32%|███▏ | 8/25 [00:12<00:24, 1.44s/batch, N=1.4793, E=0.9089,
KL=0.8535, wKL=0.1000]
Train E04: 32%|███▏ | 8/25 [00:13<00:24, 1.44s/batch, N=1.6417, E=0.9090,
KL=0.8506, wKL=0.1000]
Train E04: 36%|███▌ | 9/25 [00:13<00:22, 1.43s/batch, N=1.6417, E=0.9090,
KL=0.8506, wKL=0.1000]
Train E04: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4429, E=0.9072,
KL=0.8400, wKL=0.1000]
Train E04: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.4429, E=0.9072,
KL=0.8400, wKL=0.1000]
Train E04: 40%|████ | 10/25 [00:16<00:21, 1.42s/batch, N=1.4622, E=0.9084,
KL=0.8355, wKL=0.1000]
Train E04: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4622, E=0.9084,
KL=0.8355, wKL=0.1000]
Train E04: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.5165, E=0.9050,
KL=0.8283, wKL=0.1000]
Train E04: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.5165, E=0.9050,
KL=0.8283, wKL=0.1000]
Train E04: 48%|████▊ | 12/25 [00:19<00:18, 1.41s/batch, N=1.5154, E=0.9093,
KL=0.8258, wKL=0.1000]
Train E04: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5154, E=0.9093,
KL=0.8258, wKL=0.1000]
Train E04: 52%|█████▏ | 13/25 [00:20<00:16, 1.40s/batch, N=1.5523, E=0.9083,
KL=0.8189, wKL=0.1000]
Train E04: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.5523, E=0.9083,
KL=0.8189, wKL=0.1000]
Train E04: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5825, E=0.9117,
KL=0.8147, wKL=0.1000]
Train E04: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.5825, E=0.9117,
KL=0.8147, wKL=0.1000]
Train E04: 60%|██████ | 15/25 [00:23<00:14, 1.40s/batch, N=1.5627, E=0.9093,
KL=0.8064, wKL=0.1000]
Train E04: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5627, E=0.9093,
KL=0.8064, wKL=0.1000]
Train E04: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4209, E=0.9089,
KL=0.7959, wKL=0.1000]
Train E04: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4209, E=0.9089,
KL=0.7959, wKL=0.1000]
Train E04: 68%|██████▊ | 17/25 [00:26<00:11, 1.40s/batch, N=1.4956, E=0.9084,
KL=0.7935, wKL=0.1000]
Train E04: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4956, E=0.9084,
KL=0.7935, wKL=0.1000]
Train E04: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5084, E=0.9093,
KL=0.7862, wKL=0.1000]
Train E04: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5084, E=0.9093,
KL=0.7862, wKL=0.1000]
Train E04: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4661, E=0.9107,
KL=0.7790, wKL=0.1000]
Train E04: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4661, E=0.9107,
KL=0.7790, wKL=0.1000]
Train E04: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.5435, E=0.9080,
KL=0.7766, wKL=0.1000]
Train E04: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5435, E=0.9080,
KL=0.7766, wKL=0.1000]
Train E04: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.6161, E=0.9101,
KL=0.7749, wKL=0.1000]
Train E04: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.6161, E=0.9101,
KL=0.7749, wKL=0.1000]
Train E04: 88%|████████▊ | 22/25 [00:33<00:04, 1.40s/batch, N=1.4536, E=0.9089,
KL=0.7679, wKL=0.1000]
Train E04: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4536, E=0.9089,
KL=0.7679, wKL=0.1000]
Train E04: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4589, E=0.9092,
KL=0.7637, wKL=0.1000]
Train E04: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.4589, E=0.9092,
KL=0.7637, wKL=0.1000]
Train E04: 96%|█████████▌| 24/25 [00:35<00:01, 1.44s/batch, N=1.5813, E=0.9093,
KL=0.7598, wKL=0.1000]
Train E04: 100%|██████████| 25/25 [00:35<00:00, 1.22s/batch, N=1.5813, E=0.9093,
KL=0.7598, wKL=0.1000]
Train E04: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5813, E=0.9093,
KL=0.7598, wKL=0.1000]
195.8s 79 [Epoch 004] Total: 2.4758 | N: 1.5254 | E: 0.9091 | KL(0.10×0.5): 0.8266
230.9s 80 Train E05: 0%| | 0/25 [00:00<?, ?batch/s]
Train E05: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5352, E=0.9091, KL=0.7598,
wKL=0.1250]
Train E05: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5352, E=0.9091,
KL=0.7598, wKL=0.1250]
Train E05: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4937, E=0.9088,
KL=0.7541, wKL=0.1250]
Train E05: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4937, E=0.9088,
KL=0.7541, wKL=0.1250]
Train E05: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5630, E=0.9097,
KL=0.7527, wKL=0.1250]
Train E05: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5630, E=0.9097,
KL=0.7527, wKL=0.1250]
Train E05: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5584, E=0.9069,
KL=0.7492, wKL=0.1250]
Train E05: 16%|█▌ | 4/25 [00:05<00:29, 1.40s/batch, N=1.5584, E=0.9069,
KL=0.7492, wKL=0.1250]
Train E05: 16%|█▌ | 4/25 [00:06<00:29, 1.40s/batch, N=1.4394, E=0.9086,
KL=0.7398, wKL=0.1250]
Train E05: 20%|██ | 5/25 [00:06<00:28, 1.40s/batch, N=1.4394, E=0.9086,
KL=0.7398, wKL=0.1250]
Train E05: 20%|██ | 5/25 [00:09<00:28, 1.40s/batch, N=1.4983, E=0.9105,
KL=0.7357, wKL=0.1250]
Train E05: 24%|██▍ | 6/25 [00:09<00:30, 1.62s/batch, N=1.4983, E=0.9105,
KL=0.7357, wKL=0.1250]
Train E05: 24%|██▍ | 6/25 [00:10<00:30, 1.62s/batch, N=1.5274, E=0.9065,
KL=0.7334, wKL=0.1250]
Train E05: 28%|██▊ | 7/25 [00:10<00:27, 1.54s/batch, N=1.5274, E=0.9065,
KL=0.7334, wKL=0.1250]
Train E05: 28%|██▊ | 7/25 [00:11<00:27, 1.54s/batch, N=1.4989, E=0.9075,
KL=0.7254, wKL=0.1250]
Train E05: 32%|███▏ | 8/25 [00:11<00:25, 1.49s/batch, N=1.4989, E=0.9075,
KL=0.7254, wKL=0.1250]
Train E05: 32%|███▏ | 8/25 [00:13<00:25, 1.49s/batch, N=1.6445, E=0.9059,
KL=0.7254, wKL=0.1250]
Train E05: 36%|███▌ | 9/25 [00:13<00:23, 1.46s/batch, N=1.6445, E=0.9059,
KL=0.7254, wKL=0.1250]
Train E05: 36%|███▌ | 9/25 [00:14<00:23, 1.46s/batch, N=1.5174, E=0.9107,
KL=0.7170, wKL=0.1250]
Train E05: 40%|████ | 10/25 [00:14<00:21, 1.45s/batch, N=1.5174, E=0.9107,
KL=0.7170, wKL=0.1250]
Train E05: 40%|████ | 10/25 [00:15<00:21, 1.45s/batch, N=1.4871, E=0.9095,
KL=0.7117, wKL=0.1250]
Train E05: 44%|████▍ | 11/25 [00:15<00:20, 1.43s/batch, N=1.4871, E=0.9095,
KL=0.7117, wKL=0.1250]
Train E05: 44%|████▍ | 11/25 [00:17<00:20, 1.43s/batch, N=1.4996, E=0.9079,
KL=0.7057, wKL=0.1250]
Train E05: 48%|████▊ | 12/25 [00:17<00:18, 1.42s/batch, N=1.4996, E=0.9079,
KL=0.7057, wKL=0.1250]
Train E05: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.5169, E=0.9118,
KL=0.7030, wKL=0.1250]
Train E05: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5169, E=0.9118,
KL=0.7030, wKL=0.1250]
Train E05: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.5501, E=0.9077,
KL=0.7008, wKL=0.1250]
Train E05: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.5501, E=0.9077,
KL=0.7008, wKL=0.1250]
Train E05: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.6121, E=0.9076,
KL=0.6999, wKL=0.1250]
Train E05: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.6121, E=0.9076,
KL=0.6999, wKL=0.1250]
Train E05: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4994, E=0.9103,
KL=0.6936, wKL=0.1250]
Train E05: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4994, E=0.9103,
KL=0.6936, wKL=0.1250]
Train E05: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4794, E=0.9101,
KL=0.6906, wKL=0.1250]
Train E05: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4794, E=0.9101,
KL=0.6906, wKL=0.1250]
Train E05: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4975, E=0.9085,
KL=0.6867, wKL=0.1250]
Train E05: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4975, E=0.9085,
KL=0.6867, wKL=0.1250]
Train E05: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4718, E=0.9120,
KL=0.6830, wKL=0.1250]
Train E05: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.4718, E=0.9120,
KL=0.6830, wKL=0.1250]
Train E05: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5897, E=0.9087,
KL=0.6813, wKL=0.1250]
Train E05: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5897, E=0.9087,
KL=0.6813, wKL=0.1250]
Train E05: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5195, E=0.9116,
KL=0.6753, wKL=0.1250]
Train E05: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5195, E=0.9116,
KL=0.6753, wKL=0.1250]
Train E05: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5650, E=0.9081,
KL=0.6727, wKL=0.1250]
Train E05: 88%|████████▊ | 22/25 [00:31<00:04, 1.47s/batch, N=1.5650, E=0.9081,
KL=0.6727, wKL=0.1250]
Train E05: 88%|████████▊ | 22/25 [00:32<00:04, 1.47s/batch, N=1.4760, E=0.9096,
KL=0.6654, wKL=0.1250]
Train E05: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.4760, E=0.9096,
KL=0.6654, wKL=0.1250]
Train E05: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.5494, E=0.9055,
KL=0.6638, wKL=0.1250]
Train E05: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.5494, E=0.9055,
KL=0.6638, wKL=0.1250]
Train E05: 96%|█████████▌| 24/25 [00:35<00:01, 1.43s/batch, N=1.3126, E=0.9043,
KL=0.6551, wKL=0.1250]
Train E05: 100%|██████████| 25/25 [00:35<00:00, 1.20s/batch, N=1.3126, E=0.9043,
KL=0.6551, wKL=0.1250]
Train E05: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.3126, E=0.9043,
KL=0.6551, wKL=0.1250]
230.9s 81 [Epoch 005] Total: 2.4741 | N: 1.5210 | E: 0.9088 | KL(0.12×0.5): 0.7085
266.1s 82 Train E06: 0%| | 0/25 [00:00<?, ?batch/s]
Train E06: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5278, E=0.9074, KL=0.6585,
wKL=0.1500]
Train E06: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5278, E=0.9074,
KL=0.6585, wKL=0.1500]
Train E06: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.5546, E=0.9078,
KL=0.6563, wKL=0.1500]
Train E06: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5546, E=0.9078,
KL=0.6563, wKL=0.1500]
Train E06: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.4643, E=0.9064,
KL=0.6520, wKL=0.1500]
Train E06: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4643, E=0.9064,
KL=0.6520, wKL=0.1500]
Train E06: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.6365, E=0.9066,
KL=0.6535, wKL=0.1500]
Train E06: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.6365, E=0.9066,
KL=0.6535, wKL=0.1500]
Train E06: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5112, E=0.9068,
KL=0.6485, wKL=0.1500]
Train E06: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5112, E=0.9068,
KL=0.6485, wKL=0.1500]
Train E06: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5094, E=0.9100,
KL=0.6439, wKL=0.1500]
Train E06: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5094, E=0.9100,
KL=0.6439, wKL=0.1500]
Train E06: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.6024, E=0.9074,
KL=0.6449, wKL=0.1500]
Train E06: 28%|██▊ | 7/25 [00:09<00:25, 1.43s/batch, N=1.6024, E=0.9074,
KL=0.6449, wKL=0.1500]
Train E06: 28%|██▊ | 7/25 [00:11<00:25, 1.43s/batch, N=1.5592, E=0.9093,
KL=0.6426, wKL=0.1500]
Train E06: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5592, E=0.9093,
KL=0.6426, wKL=0.1500]
Train E06: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.5442, E=0.9066,
KL=0.6378, wKL=0.1500]
Train E06: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.5442, E=0.9066,
KL=0.6378, wKL=0.1500]
Train E06: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5272, E=0.9080,
KL=0.6323, wKL=0.1500]
Train E06: 40%|████ | 10/25 [00:14<00:23, 1.60s/batch, N=1.5272, E=0.9080,
KL=0.6323, wKL=0.1500]
Train E06: 40%|████ | 10/25 [00:16<00:23, 1.60s/batch, N=1.5376, E=0.9090,
KL=0.6292, wKL=0.1500]
Train E06: 44%|████▍ | 11/25 [00:16<00:21, 1.53s/batch, N=1.5376, E=0.9090,
KL=0.6292, wKL=0.1500]
Train E06: 44%|████▍ | 11/25 [00:17<00:21, 1.53s/batch, N=1.5777, E=0.9072,
KL=0.6285, wKL=0.1500]
Train E06: 48%|████▊ | 12/25 [00:17<00:19, 1.50s/batch, N=1.5777, E=0.9072,
KL=0.6285, wKL=0.1500]
Train E06: 48%|████▊ | 12/25 [00:18<00:19, 1.50s/batch, N=1.5136, E=0.9070,
KL=0.6209, wKL=0.1500]
Train E06: 52%|█████▏ | 13/25 [00:18<00:17, 1.46s/batch, N=1.5136, E=0.9070,
KL=0.6209, wKL=0.1500]
Train E06: 52%|█████▏ | 13/25 [00:20<00:17, 1.46s/batch, N=1.4749, E=0.9081,
KL=0.6189, wKL=0.1500]
Train E06: 56%|█████▌ | 14/25 [00:20<00:16, 1.46s/batch, N=1.4749, E=0.9081,
KL=0.6189, wKL=0.1500]
Train E06: 56%|█████▌ | 14/25 [00:21<00:16, 1.46s/batch, N=1.5319, E=0.9099,
KL=0.6171, wKL=0.1500]
Train E06: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.5319, E=0.9099,
KL=0.6171, wKL=0.1500]
Train E06: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.5148, E=0.9138,
KL=0.6159, wKL=0.1500]
Train E06: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.5148, E=0.9138,
KL=0.6159, wKL=0.1500]
Train E06: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.5262, E=0.9109,
KL=0.6101, wKL=0.1500]
Train E06: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.5262, E=0.9109,
KL=0.6101, wKL=0.1500]
Train E06: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.4758, E=0.9099,
KL=0.6075, wKL=0.1500]
Train E06: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4758, E=0.9099,
KL=0.6075, wKL=0.1500]
Train E06: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.4526, E=0.9077,
KL=0.6067, wKL=0.1500]
Train E06: 76%|███████▌ | 19/25 [00:27<00:08, 1.46s/batch, N=1.4526, E=0.9077,
KL=0.6067, wKL=0.1500]
Train E06: 76%|███████▌ | 19/25 [00:28<00:08, 1.46s/batch, N=1.5113, E=0.9078,
KL=0.6046, wKL=0.1500]
Train E06: 80%|████████ | 20/25 [00:28<00:07, 1.44s/batch, N=1.5113, E=0.9078,
KL=0.6046, wKL=0.1500]
Train E06: 80%|████████ | 20/25 [00:30<00:07, 1.44s/batch, N=1.4894, E=0.9093,
KL=0.6019, wKL=0.1500]
Train E06: 84%|████████▍ | 21/25 [00:30<00:05, 1.44s/batch, N=1.4894, E=0.9093,
KL=0.6019, wKL=0.1500]
Train E06: 84%|████████▍ | 21/25 [00:31<00:05, 1.44s/batch, N=1.4704, E=0.9106,
KL=0.6000, wKL=0.1500]
Train E06: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.4704, E=0.9106,
KL=0.6000, wKL=0.1500]
Train E06: 88%|████████▊ | 22/25 [00:33<00:04, 1.44s/batch, N=1.4772, E=0.9106,
KL=0.5981, wKL=0.1500]
Train E06: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4772, E=0.9106,
KL=0.5981, wKL=0.1500]
Train E06: 92%|█████████▏| 23/25 [00:34<00:02, 1.42s/batch, N=1.4279, E=0.9098,
KL=0.5927, wKL=0.1500]
Train E06: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4279, E=0.9098,
KL=0.5927, wKL=0.1500]
Train E06: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.5154, E=0.9113,
KL=0.5940, wKL=0.1500]
Train E06: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.5154, E=0.9113,
KL=0.5940, wKL=0.1500]
Train E06: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5154, E=0.9113,
KL=0.5940, wKL=0.1500]
266.1s 83 [Epoch 006] Total: 2.4730 | N: 1.5174 | E: 0.9087 | KL(0.15×0.5): 0.6254
301.2s 84 Train E07: 0%| | 0/25 [00:00<?, ?batch/s]
Train E07: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5009, E=0.9096, KL=0.5899,
wKL=0.1750]
Train E07: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5009, E=0.9096,
KL=0.5899, wKL=0.1750]
Train E07: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5950, E=0.9089,
KL=0.5922, wKL=0.1750]
Train E07: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.5950, E=0.9089,
KL=0.5922, wKL=0.1750]
Train E07: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4751, E=0.9052,
KL=0.5861, wKL=0.1750]
Train E07: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4751, E=0.9052,
KL=0.5861, wKL=0.1750]
Train E07: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5119, E=0.9115,
KL=0.5841, wKL=0.1750]
Train E07: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5119, E=0.9115,
KL=0.5841, wKL=0.1750]
Train E07: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4745, E=0.9074,
KL=0.5823, wKL=0.1750]
Train E07: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4745, E=0.9074,
KL=0.5823, wKL=0.1750]
Train E07: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4651, E=0.9084,
KL=0.5792, wKL=0.1750]
Train E07: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4651, E=0.9084,
KL=0.5792, wKL=0.1750]
Train E07: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5603, E=0.9081,
KL=0.5796, wKL=0.1750]
Train E07: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.5603, E=0.9081,
KL=0.5796, wKL=0.1750]
Train E07: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.5033, E=0.9087,
KL=0.5755, wKL=0.1750]
Train E07: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5033, E=0.9087,
KL=0.5755, wKL=0.1750]
Train E07: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.5339, E=0.9093,
KL=0.5742, wKL=0.1750]
Train E07: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5339, E=0.9093,
KL=0.5742, wKL=0.1750]
Train E07: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5276, E=0.9080,
KL=0.5712, wKL=0.1750]
Train E07: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5276, E=0.9080,
KL=0.5712, wKL=0.1750]
Train E07: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5078, E=0.9102,
KL=0.5665, wKL=0.1750]
Train E07: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5078, E=0.9102,
KL=0.5665, wKL=0.1750]
Train E07: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4465, E=0.9064,
KL=0.5595, wKL=0.1750]
Train E07: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4465, E=0.9064,
KL=0.5595, wKL=0.1750]
Train E07: 48%|████▊ | 12/25 [00:18<00:17, 1.38s/batch, N=1.4720, E=0.9075,
KL=0.5578, wKL=0.1750]
Train E07: 52%|█████▏ | 13/25 [00:18<00:18, 1.58s/batch, N=1.4720, E=0.9075,
KL=0.5578, wKL=0.1750]
Train E07: 52%|█████▏ | 13/25 [00:20<00:18, 1.58s/batch, N=1.5465, E=0.9065,
KL=0.5582, wKL=0.1750]
Train E07: 56%|█████▌ | 14/25 [00:20<00:16, 1.54s/batch, N=1.5465, E=0.9065,
KL=0.5582, wKL=0.1750]
Train E07: 56%|█████▌ | 14/25 [00:21<00:16, 1.54s/batch, N=1.5190, E=0.9098,
KL=0.5540, wKL=0.1750]
Train E07: 60%|██████ | 15/25 [00:21<00:15, 1.51s/batch, N=1.5190, E=0.9098,
KL=0.5540, wKL=0.1750]
Train E07: 60%|██████ | 15/25 [00:22<00:15, 1.51s/batch, N=1.4534, E=0.9071,
KL=0.5502, wKL=0.1750]
Train E07: 64%|██████▍ | 16/25 [00:22<00:13, 1.48s/batch, N=1.4534, E=0.9071,
KL=0.5502, wKL=0.1750]
Train E07: 64%|██████▍ | 16/25 [00:24<00:13, 1.48s/batch, N=1.5150, E=0.9059,
KL=0.5491, wKL=0.1750]
Train E07: 68%|██████▊ | 17/25 [00:24<00:12, 1.54s/batch, N=1.5150, E=0.9059,
KL=0.5491, wKL=0.1750]
Train E07: 68%|██████▊ | 17/25 [00:26<00:12, 1.54s/batch, N=1.5505, E=0.9097,
KL=0.5493, wKL=0.1750]
Train E07: 72%|███████▏ | 18/25 [00:26<00:10, 1.50s/batch, N=1.5505, E=0.9097,
KL=0.5493, wKL=0.1750]
Train E07: 72%|███████▏ | 18/25 [00:27<00:10, 1.50s/batch, N=1.6138, E=0.9095,
KL=0.5517, wKL=0.1750]
Train E07: 76%|███████▌ | 19/25 [00:27<00:08, 1.48s/batch, N=1.6138, E=0.9095,
KL=0.5517, wKL=0.1750]
Train E07: 76%|███████▌ | 19/25 [00:28<00:08, 1.48s/batch, N=1.4713, E=0.9096,
KL=0.5429, wKL=0.1750]
Train E07: 80%|████████ | 20/25 [00:28<00:07, 1.46s/batch, N=1.4713, E=0.9096,
KL=0.5429, wKL=0.1750]
Train E07: 80%|████████ | 20/25 [00:30<00:07, 1.46s/batch, N=1.5617, E=0.9105,
KL=0.5440, wKL=0.1750]
Train E07: 84%|████████▍ | 21/25 [00:30<00:05, 1.44s/batch, N=1.5617, E=0.9105,
KL=0.5440, wKL=0.1750]
Train E07: 84%|████████▍ | 21/25 [00:31<00:05, 1.44s/batch, N=1.5187, E=0.9084,
KL=0.5384, wKL=0.1750]
Train E07: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.5187, E=0.9084,
KL=0.5384, wKL=0.1750]
Train E07: 88%|████████▊ | 22/25 [00:33<00:04, 1.44s/batch, N=1.5278, E=0.9096,
KL=0.5393, wKL=0.1750]
Train E07: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.5278, E=0.9096,
KL=0.5393, wKL=0.1750]
Train E07: 92%|█████████▏| 23/25 [00:34<00:02, 1.42s/batch, N=1.4801, E=0.9059,
KL=0.5331, wKL=0.1750]
Train E07: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4801, E=0.9059,
KL=0.5331, wKL=0.1750]
Train E07: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.5016, E=0.9108,
KL=0.5318, wKL=0.1750]
Train E07: 100%|██████████| 25/25 [00:35<00:00, 1.18s/batch, N=1.5016, E=0.9108,
KL=0.5318, wKL=0.1750]
Train E07: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5016, E=0.9108,
KL=0.5318, wKL=0.1750]
301.2s 85 [Epoch 007] Total: 2.4713 | N: 1.5136 | E: 0.9084 | KL(0.17×0.5): 0.5623
336.5s 86 Train E08: 0%| | 0/25 [00:00<?, ?batch/s]
Train E08: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5724, E=0.9097, KL=0.5331,
wKL=0.2000]
Train E08: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5724, E=0.9097,
KL=0.5331, wKL=0.2000]
Train E08: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5567, E=0.9082,
KL=0.5304, wKL=0.2000]
Train E08: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5567, E=0.9082,
KL=0.5304, wKL=0.2000]
Train E08: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4774, E=0.9098,
KL=0.5264, wKL=0.2000]
Train E08: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4774, E=0.9098,
KL=0.5264, wKL=0.2000]
Train E08: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4698, E=0.9063,
KL=0.5226, wKL=0.2000]
Train E08: 16%|█▌ | 4/25 [00:05<00:29, 1.40s/batch, N=1.4698, E=0.9063,
KL=0.5226, wKL=0.2000]
Train E08: 16%|█▌ | 4/25 [00:06<00:29, 1.40s/batch, N=1.5310, E=0.9075,
KL=0.5223, wKL=0.2000]
Train E08: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5310, E=0.9075,
KL=0.5223, wKL=0.2000]
Train E08: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.5091, E=0.9071,
KL=0.5220, wKL=0.2000]
Train E08: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5091, E=0.9071,
KL=0.5220, wKL=0.2000]
Train E08: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5232, E=0.9106,
KL=0.5167, wKL=0.2000]
Train E08: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.5232, E=0.9106,
KL=0.5167, wKL=0.2000]
Train E08: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.4647, E=0.9099,
KL=0.5146, wKL=0.2000]
Train E08: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4647, E=0.9099,
KL=0.5146, wKL=0.2000]
Train E08: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5403, E=0.9082,
KL=0.5144, wKL=0.2000]
Train E08: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5403, E=0.9082,
KL=0.5144, wKL=0.2000]
Train E08: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.5633, E=0.9072,
KL=0.5148, wKL=0.2000]
Train E08: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.5633, E=0.9072,
KL=0.5148, wKL=0.2000]
Train E08: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.5001, E=0.9083,
KL=0.5095, wKL=0.2000]
Train E08: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.5001, E=0.9083,
KL=0.5095, wKL=0.2000]
Train E08: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.5121, E=0.9070,
KL=0.5085, wKL=0.2000]
Train E08: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.5121, E=0.9070,
KL=0.5085, wKL=0.2000]
Train E08: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5549, E=0.9069,
KL=0.5061, wKL=0.2000]
Train E08: 52%|█████▏ | 13/25 [00:18<00:16, 1.42s/batch, N=1.5549, E=0.9069,
KL=0.5061, wKL=0.2000]
Train E08: 52%|█████▏ | 13/25 [00:19<00:16, 1.42s/batch, N=1.5448, E=0.9084,
KL=0.5035, wKL=0.2000]
Train E08: 56%|█████▌ | 14/25 [00:19<00:15, 1.43s/batch, N=1.5448, E=0.9084,
KL=0.5035, wKL=0.2000]
Train E08: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.5042, E=0.9080,
KL=0.5001, wKL=0.2000]
Train E08: 60%|██████ | 15/25 [00:21<00:14, 1.46s/batch, N=1.5042, E=0.9080,
KL=0.5001, wKL=0.2000]
Train E08: 60%|██████ | 15/25 [00:22<00:14, 1.46s/batch, N=1.4334, E=0.9122,
KL=0.4966, wKL=0.2000]
Train E08: 64%|██████▍ | 16/25 [00:22<00:13, 1.45s/batch, N=1.4334, E=0.9122,
KL=0.4966, wKL=0.2000]
Train E08: 64%|██████▍ | 16/25 [00:24<00:13, 1.45s/batch, N=1.4171, E=0.9093,
KL=0.4909, wKL=0.2000]
Train E08: 68%|██████▊ | 17/25 [00:24<00:13, 1.64s/batch, N=1.4171, E=0.9093,
KL=0.4909, wKL=0.2000]
Train E08: 68%|██████▊ | 17/25 [00:26<00:13, 1.64s/batch, N=1.5226, E=0.9073,
KL=0.4930, wKL=0.2000]
Train E08: 72%|███████▏ | 18/25 [00:26<00:11, 1.57s/batch, N=1.5226, E=0.9073,
KL=0.4930, wKL=0.2000]
Train E08: 72%|███████▏ | 18/25 [00:27<00:11, 1.57s/batch, N=1.5728, E=0.9074,
KL=0.4952, wKL=0.2000]
Train E08: 76%|███████▌ | 19/25 [00:27<00:09, 1.52s/batch, N=1.5728, E=0.9074,
KL=0.4952, wKL=0.2000]
Train E08: 76%|███████▌ | 19/25 [00:28<00:09, 1.52s/batch, N=1.5308, E=0.9089,
KL=0.4907, wKL=0.2000]
Train E08: 80%|████████ | 20/25 [00:28<00:07, 1.48s/batch, N=1.5308, E=0.9089,
KL=0.4907, wKL=0.2000]
Train E08: 80%|████████ | 20/25 [00:30<00:07, 1.48s/batch, N=1.4863, E=0.9056,
KL=0.4877, wKL=0.2000]
Train E08: 84%|████████▍ | 21/25 [00:30<00:05, 1.47s/batch, N=1.4863, E=0.9056,
KL=0.4877, wKL=0.2000]
Train E08: 84%|████████▍ | 21/25 [00:31<00:05, 1.47s/batch, N=1.5131, E=0.9081,
KL=0.4868, wKL=0.2000]
Train E08: 88%|████████▊ | 22/25 [00:31<00:04, 1.45s/batch, N=1.5131, E=0.9081,
KL=0.4868, wKL=0.2000]
Train E08: 88%|████████▊ | 22/25 [00:33<00:04, 1.45s/batch, N=1.5669, E=0.9091,
KL=0.4863, wKL=0.2000]
Train E08: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.5669, E=0.9091,
KL=0.4863, wKL=0.2000]
Train E08: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.5075, E=0.9076,
KL=0.4836, wKL=0.2000]
Train E08: 96%|█████████▌| 24/25 [00:34<00:01, 1.47s/batch, N=1.5075, E=0.9076,
KL=0.4836, wKL=0.2000]
Train E08: 96%|█████████▌| 24/25 [00:35<00:01, 1.47s/batch, N=1.3417, E=0.9077,
KL=0.4760, wKL=0.2000]
Train E08: 100%|██████████| 25/25 [00:35<00:00, 1.21s/batch, N=1.3417, E=0.9077,
KL=0.4760, wKL=0.2000]
Train E08: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.3417, E=0.9077,
KL=0.4760, wKL=0.2000]
336.5s 87 [Epoch 008] Total: 2.4715 | N: 1.5127 | E: 0.9083 | KL(0.20×0.5): 0.5060
371.7s 88 Train E09: 0%| | 0/25 [00:00<?, ?batch/s]
Train E09: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4321, E=0.9076, KL=0.4758,
wKL=0.2250]
Train E09: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4321, E=0.9076,
KL=0.4758, wKL=0.2250]
Train E09: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4787, E=0.9071,
KL=0.4755, wKL=0.2250]
Train E09: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4787, E=0.9071,
KL=0.4755, wKL=0.2250]
Train E09: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.5512, E=0.9064,
KL=0.4755, wKL=0.2250]
Train E09: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5512, E=0.9064,
KL=0.4755, wKL=0.2250]
Train E09: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5518, E=0.9114,
KL=0.4766, wKL=0.2250]
Train E09: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5518, E=0.9114,
KL=0.4766, wKL=0.2250]
Train E09: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5973, E=0.9080,
KL=0.4739, wKL=0.2250]
Train E09: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5973, E=0.9080,
KL=0.4739, wKL=0.2250]
Train E09: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.4820, E=0.9074,
KL=0.4710, wKL=0.2250]
Train E09: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4820, E=0.9074,
KL=0.4710, wKL=0.2250]
Train E09: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.4524, E=0.9067,
KL=0.4678, wKL=0.2250]
Train E09: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4524, E=0.9067,
KL=0.4678, wKL=0.2250]
Train E09: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.5318, E=0.9083,
KL=0.4692, wKL=0.2250]
Train E09: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5318, E=0.9083,
KL=0.4692, wKL=0.2250]
Train E09: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4136, E=0.9061,
KL=0.4603, wKL=0.2250]
Train E09: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4136, E=0.9061,
KL=0.4603, wKL=0.2250]
Train E09: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5337, E=0.9092,
KL=0.4624, wKL=0.2250]
Train E09: 40%|████ | 10/25 [00:13<00:21, 1.41s/batch, N=1.5337, E=0.9092,
KL=0.4624, wKL=0.2250]
Train E09: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.5145, E=0.9106,
KL=0.4593, wKL=0.2250]
Train E09: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5145, E=0.9106,
KL=0.4593, wKL=0.2250]
Train E09: 44%|████▍ | 11/25 [00:17<00:19, 1.41s/batch, N=1.5478, E=0.9084,
KL=0.4569, wKL=0.2250]
Train E09: 48%|████▊ | 12/25 [00:17<00:19, 1.50s/batch, N=1.5478, E=0.9084,
KL=0.4569, wKL=0.2250]
Train E09: 48%|████▊ | 12/25 [00:18<00:19, 1.50s/batch, N=1.3977, E=0.9088,
KL=0.4489, wKL=0.2250]
Train E09: 52%|█████▏ | 13/25 [00:18<00:17, 1.48s/batch, N=1.3977, E=0.9088,
KL=0.4489, wKL=0.2250]
Train E09: 52%|█████▏ | 13/25 [00:19<00:17, 1.48s/batch, N=1.5172, E=0.9097,
KL=0.4514, wKL=0.2250]
Train E09: 56%|█████▌ | 14/25 [00:19<00:15, 1.45s/batch, N=1.5172, E=0.9097,
KL=0.4514, wKL=0.2250]
Train E09: 56%|█████▌ | 14/25 [00:21<00:15, 1.45s/batch, N=1.6038, E=0.9063,
KL=0.4527, wKL=0.2250]
Train E09: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.6038, E=0.9063,
KL=0.4527, wKL=0.2250]
Train E09: 60%|██████ | 15/25 [00:22<00:14, 1.44s/batch, N=1.5834, E=0.9080,
KL=0.4499, wKL=0.2250]
Train E09: 64%|██████▍ | 16/25 [00:22<00:12, 1.43s/batch, N=1.5834, E=0.9080,
KL=0.4499, wKL=0.2250]
Train E09: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.6092, E=0.9077,
KL=0.4526, wKL=0.2250]
Train E09: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.6092, E=0.9077,
KL=0.4526, wKL=0.2250]
Train E09: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.5172, E=0.9066,
KL=0.4459, wKL=0.2250]
Train E09: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.5172, E=0.9066,
KL=0.4459, wKL=0.2250]
Train E09: 72%|███████▏ | 18/25 [00:27<00:10, 1.44s/batch, N=1.3617, E=0.9073,
KL=0.4378, wKL=0.2250]
Train E09: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.3617, E=0.9073,
KL=0.4378, wKL=0.2250]
Train E09: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5348, E=0.9109,
KL=0.4392, wKL=0.2250]
Train E09: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5348, E=0.9109,
KL=0.4392, wKL=0.2250]
Train E09: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.5045, E=0.9064,
KL=0.4363, wKL=0.2250]
Train E09: 84%|████████▍ | 21/25 [00:30<00:06, 1.59s/batch, N=1.5045, E=0.9064,
KL=0.4363, wKL=0.2250]
Train E09: 84%|████████▍ | 21/25 [00:31<00:06, 1.59s/batch, N=1.4986, E=0.9074,
KL=0.4331, wKL=0.2250]
Train E09: 88%|████████▊ | 22/25 [00:31<00:04, 1.53s/batch, N=1.4986, E=0.9074,
KL=0.4331, wKL=0.2250]
Train E09: 88%|████████▊ | 22/25 [00:33<00:04, 1.53s/batch, N=1.5344, E=0.9078,
KL=0.4335, wKL=0.2250]
Train E09: 92%|█████████▏| 23/25 [00:33<00:02, 1.49s/batch, N=1.5344, E=0.9078,
KL=0.4335, wKL=0.2250]
Train E09: 92%|█████████▏| 23/25 [00:34<00:02, 1.49s/batch, N=1.5347, E=0.9076,
KL=0.4352, wKL=0.2250]
Train E09: 96%|█████████▌| 24/25 [00:34<00:01, 1.46s/batch, N=1.5347, E=0.9076,
KL=0.4352, wKL=0.2250]
Train E09: 96%|█████████▌| 24/25 [00:35<00:01, 1.46s/batch, N=1.5151, E=0.9070,
KL=0.4336, wKL=0.2250]
Train E09: 100%|██████████| 25/25 [00:35<00:00, 1.21s/batch, N=1.5151, E=0.9070,
KL=0.4336, wKL=0.2250]
Train E09: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5151, E=0.9070,
KL=0.4336, wKL=0.2250]
371.7s 89 [Epoch 009] Total: 2.4711 | N: 1.5119 | E: 0.9080 | KL(0.23×0.5): 0.4555
406.2s 90 Train E10: 0%| | 0/25 [00:00<?, ?batch/s]
Train E10: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4690, E=0.9054, KL=0.4296,
wKL=0.2500]
Train E10: 4%|▍ | 1/25 [00:01<00:33, 1.40s/batch, N=1.4690, E=0.9054,
KL=0.4296, wKL=0.2500]
Train E10: 4%|▍ | 1/25 [00:02<00:33, 1.40s/batch, N=1.4634, E=0.9072,
KL=0.4265, wKL=0.2500]
Train E10: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4634, E=0.9072,
KL=0.4265, wKL=0.2500]
Train E10: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4797, E=0.9109,
KL=0.4262, wKL=0.2500]
Train E10: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4797, E=0.9109,
KL=0.4262, wKL=0.2500]
Train E10: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5110, E=0.9097,
KL=0.4245, wKL=0.2500]
Train E10: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5110, E=0.9097,
KL=0.4245, wKL=0.2500]
Train E10: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5853, E=0.9086,
KL=0.4267, wKL=0.2500]
Train E10: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5853, E=0.9086,
KL=0.4267, wKL=0.2500]
Train E10: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.6000, E=0.9051,
KL=0.4240, wKL=0.2500]
Train E10: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.6000, E=0.9051,
KL=0.4240, wKL=0.2500]
Train E10: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4176, E=0.9059,
KL=0.4163, wKL=0.2500]
Train E10: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4176, E=0.9059,
KL=0.4163, wKL=0.2500]
Train E10: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.4576, E=0.9106,
KL=0.4187, wKL=0.2500]
Train E10: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4576, E=0.9106,
KL=0.4187, wKL=0.2500]
Train E10: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5136, E=0.9086,
KL=0.4176, wKL=0.2500]
Train E10: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.5136, E=0.9086,
KL=0.4176, wKL=0.2500]
Train E10: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.5597, E=0.9090,
KL=0.4163, wKL=0.2500]
Train E10: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.5597, E=0.9090,
KL=0.4163, wKL=0.2500]
Train E10: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.5447, E=0.9070,
KL=0.4131, wKL=0.2500]
Train E10: 44%|████▍ | 11/25 [00:15<00:20, 1.45s/batch, N=1.5447, E=0.9070,
KL=0.4131, wKL=0.2500]
Train E10: 44%|████▍ | 11/25 [00:17<00:20, 1.45s/batch, N=1.4955, E=0.9083,
KL=0.4092, wKL=0.2500]
Train E10: 48%|████▊ | 12/25 [00:17<00:18, 1.43s/batch, N=1.4955, E=0.9083,
KL=0.4092, wKL=0.2500]
Train E10: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.4767, E=0.9067,
KL=0.4062, wKL=0.2500]
Train E10: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4767, E=0.9067,
KL=0.4062, wKL=0.2500]
Train E10: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.4844, E=0.9062,
KL=0.4063, wKL=0.2500]
Train E10: 56%|█████▌ | 14/25 [00:19<00:15, 1.42s/batch, N=1.4844, E=0.9062,
KL=0.4063, wKL=0.2500]
Train E10: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.5586, E=0.9089,
KL=0.4069, wKL=0.2500]
Train E10: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5586, E=0.9089,
KL=0.4069, wKL=0.2500]
Train E10: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.5103, E=0.9074,
KL=0.4062, wKL=0.2500]
Train E10: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.5103, E=0.9074,
KL=0.4062, wKL=0.2500]
Train E10: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5587, E=0.9036,
KL=0.4040, wKL=0.2500]
Train E10: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5587, E=0.9036,
KL=0.4040, wKL=0.2500]
Train E10: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.5260, E=0.9070,
KL=0.4021, wKL=0.2500]
Train E10: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5260, E=0.9070,
KL=0.4021, wKL=0.2500]
Train E10: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4618, E=0.9084,
KL=0.3976, wKL=0.2500]
Train E10: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4618, E=0.9084,
KL=0.3976, wKL=0.2500]
Train E10: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5870, E=0.9060,
KL=0.3978, wKL=0.2500]
Train E10: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5870, E=0.9060,
KL=0.3978, wKL=0.2500]
Train E10: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.5556, E=0.9082,
KL=0.3980, wKL=0.2500]
Train E10: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5556, E=0.9082,
KL=0.3980, wKL=0.2500]
Train E10: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5588, E=0.9043,
KL=0.3977, wKL=0.2500]
Train E10: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5588, E=0.9043,
KL=0.3977, wKL=0.2500]
Train E10: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4508, E=0.9054,
KL=0.3936, wKL=0.2500]
Train E10: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4508, E=0.9054,
KL=0.3936, wKL=0.2500]
Train E10: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4932, E=0.9082,
KL=0.3970, wKL=0.2500]
Train E10: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.4932, E=0.9082,
KL=0.3970, wKL=0.2500]
Train E10: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.3929, E=0.9011,
KL=0.3913, wKL=0.2500]
Train E10: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.3929, E=0.9011,
KL=0.3913, wKL=0.2500]
Train E10: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.3929, E=0.9011,
KL=0.3913, wKL=0.2500]
406.2s 91 [Epoch 010] Total: 2.4698 | N: 1.5113 | E: 0.9073 | KL(0.25×0.5): 0.4106
406.2s 92 Saved checkpoint: /kaggle/working/checkpoints/gvae_10_epoch010.pt
441.4s 93 Train E11: 0%| | 0/25 [00:00<?, ?batch/s]
Train E11: 0%| | 0/25 [00:02<?, ?batch/s, N=1.5345, E=0.9069, KL=0.3948,
wKL=0.2750]
Train E11: 4%|▍ | 1/25 [00:02<00:48, 2.00s/batch, N=1.5345, E=0.9069,
KL=0.3948, wKL=0.2750]
Train E11: 4%|▍ | 1/25 [00:03<00:48, 2.00s/batch, N=1.4803, E=0.9087,
KL=0.3910, wKL=0.2750]
Train E11: 8%|▊ | 2/25 [00:03<00:37, 1.63s/batch, N=1.4803, E=0.9087,
KL=0.3910, wKL=0.2750]
Train E11: 8%|▊ | 2/25 [00:04<00:37, 1.63s/batch, N=1.5300, E=0.9080,
KL=0.3887, wKL=0.2750]
Train E11: 12%|█▏ | 3/25 [00:04<00:34, 1.55s/batch, N=1.5300, E=0.9080,
KL=0.3887, wKL=0.2750]
Train E11: 12%|█▏ | 3/25 [00:06<00:34, 1.55s/batch, N=1.4870, E=0.9037,
KL=0.3850, wKL=0.2750]
Train E11: 16%|█▌ | 4/25 [00:06<00:30, 1.47s/batch, N=1.4870, E=0.9037,
KL=0.3850, wKL=0.2750]
Train E11: 16%|█▌ | 4/25 [00:07<00:30, 1.47s/batch, N=1.5864, E=0.9059,
KL=0.3862, wKL=0.2750]
Train E11: 20%|██ | 5/25 [00:07<00:28, 1.44s/batch, N=1.5864, E=0.9059,
KL=0.3862, wKL=0.2750]
Train E11: 20%|██ | 5/25 [00:08<00:28, 1.44s/batch, N=1.5561, E=0.9089,
KL=0.3830, wKL=0.2750]
Train E11: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.5561, E=0.9089,
KL=0.3830, wKL=0.2750]
Train E11: 24%|██▍ | 6/25 [00:10<00:26, 1.42s/batch, N=1.4536, E=0.9064,
KL=0.3812, wKL=0.2750]
Train E11: 28%|██▊ | 7/25 [00:10<00:26, 1.49s/batch, N=1.4536, E=0.9064,
KL=0.3812, wKL=0.2750]
Train E11: 28%|██▊ | 7/25 [00:12<00:26, 1.49s/batch, N=1.5450, E=0.9063,
KL=0.3824, wKL=0.2750]
Train E11: 32%|███▏ | 8/25 [00:12<00:25, 1.47s/batch, N=1.5450, E=0.9063,
KL=0.3824, wKL=0.2750]
Train E11: 32%|███▏ | 8/25 [00:13<00:25, 1.47s/batch, N=1.4679, E=0.9051,
KL=0.3778, wKL=0.2750]
Train E11: 36%|███▌ | 9/25 [00:13<00:23, 1.45s/batch, N=1.4679, E=0.9051,
KL=0.3778, wKL=0.2750]
Train E11: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.5206, E=0.9088,
KL=0.3784, wKL=0.2750]
Train E11: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.5206, E=0.9088,
KL=0.3784, wKL=0.2750]
Train E11: 40%|████ | 10/25 [00:16<00:21, 1.46s/batch, N=1.4415, E=0.9062,
KL=0.3729, wKL=0.2750]
Train E11: 44%|████▍ | 11/25 [00:16<00:20, 1.44s/batch, N=1.4415, E=0.9062,
KL=0.3729, wKL=0.2750]
Train E11: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.4638, E=0.9084,
KL=0.3720, wKL=0.2750]
Train E11: 48%|████▊ | 12/25 [00:17<00:18, 1.42s/batch, N=1.4638, E=0.9084,
KL=0.3720, wKL=0.2750]
Train E11: 48%|████▊ | 12/25 [00:19<00:18, 1.42s/batch, N=1.4451, E=0.9071,
KL=0.3712, wKL=0.2750]
Train E11: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4451, E=0.9071,
KL=0.3712, wKL=0.2750]
Train E11: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.5063, E=0.9057,
KL=0.3700, wKL=0.2750]
Train E11: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.5063, E=0.9057,
KL=0.3700, wKL=0.2750]
Train E11: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.5015, E=0.9045,
KL=0.3686, wKL=0.2750]
Train E11: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.5015, E=0.9045,
KL=0.3686, wKL=0.2750]
Train E11: 60%|██████ | 15/25 [00:23<00:14, 1.40s/batch, N=1.4404, E=0.9050,
KL=0.3650, wKL=0.2750]
Train E11: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4404, E=0.9050,
KL=0.3650, wKL=0.2750]
Train E11: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.5326, E=0.9045,
KL=0.3673, wKL=0.2750]
Train E11: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5326, E=0.9045,
KL=0.3673, wKL=0.2750]
Train E11: 68%|██████▊ | 17/25 [00:26<00:11, 1.44s/batch, N=1.4966, E=0.9034,
KL=0.3638, wKL=0.2750]
Train E11: 72%|███████▏ | 18/25 [00:26<00:09, 1.43s/batch, N=1.4966, E=0.9034,
KL=0.3638, wKL=0.2750]
Train E11: 72%|███████▏ | 18/25 [00:27<00:09, 1.43s/batch, N=1.5357, E=0.9048,
KL=0.3651, wKL=0.2750]
Train E11: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.5357, E=0.9048,
KL=0.3651, wKL=0.2750]
Train E11: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5272, E=0.9049,
KL=0.3638, wKL=0.2750]
Train E11: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5272, E=0.9049,
KL=0.3638, wKL=0.2750]
Train E11: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.5028, E=0.9075,
KL=0.3620, wKL=0.2750]
Train E11: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5028, E=0.9075,
KL=0.3620, wKL=0.2750]
Train E11: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5610, E=0.9036,
KL=0.3607, wKL=0.2750]
Train E11: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5610, E=0.9036,
KL=0.3607, wKL=0.2750]
Train E11: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.5753, E=0.9076,
KL=0.3611, wKL=0.2750]
Train E11: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.5753, E=0.9076,
KL=0.3611, wKL=0.2750]
Train E11: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5363, E=0.9063,
KL=0.3592, wKL=0.2750]
Train E11: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.5363, E=0.9063,
KL=0.3592, wKL=0.2750]
Train E11: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.5575, E=0.9084,
KL=0.3619, wKL=0.2750]
Train E11: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.5575, E=0.9084,
KL=0.3619, wKL=0.2750]
Train E11: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5575, E=0.9084,
KL=0.3619, wKL=0.2750]
441.4s 94 [Epoch 011] Total: 2.4679 | N: 1.5103 | E: 0.9062 | KL(0.28×0.5): 0.3736
476.3s 95 Train E12: 0%| | 0/25 [00:00<?, ?batch/s]
Train E12: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4128, E=0.9053, KL=0.3562,
wKL=0.3000]
Train E12: 4%|▍ | 1/25 [00:01<00:33, 1.40s/batch, N=1.4128, E=0.9053,
KL=0.3562, wKL=0.3000]
Train E12: 4%|▍ | 1/25 [00:02<00:33, 1.40s/batch, N=1.4161, E=0.9053,
KL=0.3549, wKL=0.3000]
Train E12: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.4161, E=0.9053,
KL=0.3549, wKL=0.3000]
Train E12: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.5206, E=0.9017,
KL=0.3554, wKL=0.3000]
Train E12: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5206, E=0.9017,
KL=0.3554, wKL=0.3000]
Train E12: 12%|█▏ | 3/25 [00:06<00:29, 1.36s/batch, N=1.4862, E=0.9044,
KL=0.3527, wKL=0.3000]
Train E12: 16%|█▌ | 4/25 [00:06<00:34, 1.64s/batch, N=1.4862, E=0.9044,
KL=0.3527, wKL=0.3000]
Train E12: 16%|█▌ | 4/25 [00:07<00:34, 1.64s/batch, N=1.5436, E=0.9061,
KL=0.3525, wKL=0.3000]
Train E12: 20%|██ | 5/25 [00:07<00:31, 1.60s/batch, N=1.5436, E=0.9061,
KL=0.3525, wKL=0.3000]
Train E12: 20%|██ | 5/25 [00:09<00:31, 1.60s/batch, N=1.4792, E=0.9045,
KL=0.3508, wKL=0.3000]
Train E12: 24%|██▍ | 6/25 [00:09<00:29, 1.53s/batch, N=1.4792, E=0.9045,
KL=0.3508, wKL=0.3000]
Train E12: 24%|██▍ | 6/25 [00:10<00:29, 1.53s/batch, N=1.5652, E=0.9024,
KL=0.3523, wKL=0.3000]
Train E12: 28%|██▊ | 7/25 [00:10<00:26, 1.50s/batch, N=1.5652, E=0.9024,
KL=0.3523, wKL=0.3000]
Train E12: 28%|██▊ | 7/25 [00:11<00:26, 1.50s/batch, N=1.4485, E=0.9041,
KL=0.3477, wKL=0.3000]
Train E12: 32%|███▏ | 8/25 [00:11<00:24, 1.47s/batch, N=1.4485, E=0.9041,
KL=0.3477, wKL=0.3000]
Train E12: 32%|███▏ | 8/25 [00:13<00:24, 1.47s/batch, N=1.5496, E=0.9024,
KL=0.3475, wKL=0.3000]
Train E12: 36%|███▌ | 9/25 [00:13<00:23, 1.45s/batch, N=1.5496, E=0.9024,
KL=0.3475, wKL=0.3000]
Train E12: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.5340, E=0.9005,
KL=0.3429, wKL=0.3000]
Train E12: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.5340, E=0.9005,
KL=0.3429, wKL=0.3000]
Train E12: 40%|████ | 10/25 [00:16<00:21, 1.44s/batch, N=1.5123, E=0.9035,
KL=0.3440, wKL=0.3000]
Train E12: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.5123, E=0.9035,
KL=0.3440, wKL=0.3000]
Train E12: 44%|████▍ | 11/25 [00:17<00:19, 1.42s/batch, N=1.4561, E=0.9068,
KL=0.3427, wKL=0.3000]
Train E12: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4561, E=0.9068,
KL=0.3427, wKL=0.3000]
Train E12: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5383, E=0.9027,
KL=0.3449, wKL=0.3000]
Train E12: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5383, E=0.9027,
KL=0.3449, wKL=0.3000]
Train E12: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.5454, E=0.9028,
KL=0.3441, wKL=0.3000]
Train E12: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.5454, E=0.9028,
KL=0.3441, wKL=0.3000]
Train E12: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4459, E=0.9030,
KL=0.3395, wKL=0.3000]
Train E12: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4459, E=0.9030,
KL=0.3395, wKL=0.3000]
Train E12: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.5298, E=0.9042,
KL=0.3398, wKL=0.3000]
Train E12: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.5298, E=0.9042,
KL=0.3398, wKL=0.3000]
Train E12: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.5703, E=0.9021,
KL=0.3371, wKL=0.3000]
Train E12: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5703, E=0.9021,
KL=0.3371, wKL=0.3000]
Train E12: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5296, E=0.9030,
KL=0.3369, wKL=0.3000]
Train E12: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5296, E=0.9030,
KL=0.3369, wKL=0.3000]
Train E12: 72%|███████▏ | 18/25 [00:27<00:09, 1.39s/batch, N=1.5632, E=0.9040,
KL=0.3377, wKL=0.3000]
Train E12: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.5632, E=0.9040,
KL=0.3377, wKL=0.3000]
Train E12: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5168, E=0.9033,
KL=0.3349, wKL=0.3000]
Train E12: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5168, E=0.9033,
KL=0.3349, wKL=0.3000]
Train E12: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5792, E=0.9022,
KL=0.3360, wKL=0.3000]
Train E12: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5792, E=0.9022,
KL=0.3360, wKL=0.3000]
Train E12: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5169, E=0.8997,
KL=0.3342, wKL=0.3000]
Train E12: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.5169, E=0.8997,
KL=0.3342, wKL=0.3000]
Train E12: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4897, E=0.9009,
KL=0.3325, wKL=0.3000]
Train E12: 92%|█████████▏| 23/25 [00:32<00:02, 1.39s/batch, N=1.4897, E=0.9009,
KL=0.3325, wKL=0.3000]
Train E12: 92%|█████████▏| 23/25 [00:34<00:02, 1.39s/batch, N=1.4535, E=0.9016,
KL=0.3297, wKL=0.3000]
Train E12: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4535, E=0.9016,
KL=0.3297, wKL=0.3000]
Train E12: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5575, E=0.8985,
KL=0.3337, wKL=0.3000]
Train E12: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.5575, E=0.8985,
KL=0.3337, wKL=0.3000]
Train E12: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5575, E=0.8985,
KL=0.3337, wKL=0.3000]
476.3s 96 [Epoch 012] Total: 2.4639 | N: 1.5093 | E: 0.9031 | KL(0.30×0.5): 0.3435
511.3s 97 Train E13: 0%| | 0/25 [00:00<?, ?batch/s]
Train E13: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5473, E=0.8999, KL=0.3347,
wKL=0.3250]
Train E13: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5473, E=0.8999,
KL=0.3347, wKL=0.3250]
Train E13: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5427, E=0.8995,
KL=0.3329, wKL=0.3250]
Train E13: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5427, E=0.8995,
KL=0.3329, wKL=0.3250]
Train E13: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4968, E=0.8979,
KL=0.3290, wKL=0.3250]
Train E13: 12%|█▏ | 3/25 [00:04<00:33, 1.51s/batch, N=1.4968, E=0.8979,
KL=0.3290, wKL=0.3250]
Train E13: 12%|█▏ | 3/25 [00:05<00:33, 1.51s/batch, N=1.4971, E=0.9007,
KL=0.3302, wKL=0.3250]
Train E13: 16%|█▌ | 4/25 [00:05<00:31, 1.48s/batch, N=1.4971, E=0.9007,
KL=0.3302, wKL=0.3250]
Train E13: 16%|█▌ | 4/25 [00:07<00:31, 1.48s/batch, N=1.5223, E=0.8996,
KL=0.3284, wKL=0.3250]
Train E13: 20%|██ | 5/25 [00:07<00:28, 1.44s/batch, N=1.5223, E=0.8996,
KL=0.3284, wKL=0.3250]
Train E13: 20%|██ | 5/25 [00:08<00:28, 1.44s/batch, N=1.4452, E=0.8981,
KL=0.3267, wKL=0.3250]
Train E13: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.4452, E=0.8981,
KL=0.3267, wKL=0.3250]
Train E13: 24%|██▍ | 6/25 [00:09<00:26, 1.42s/batch, N=1.4388, E=0.8965,
KL=0.3252, wKL=0.3250]
Train E13: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4388, E=0.8965,
KL=0.3252, wKL=0.3250]
Train E13: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5665, E=0.8962,
KL=0.3252, wKL=0.3250]
Train E13: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5665, E=0.8962,
KL=0.3252, wKL=0.3250]
Train E13: 32%|███▏ | 8/25 [00:13<00:23, 1.39s/batch, N=1.4660, E=0.8937,
KL=0.3210, wKL=0.3250]
Train E13: 36%|███▌ | 9/25 [00:13<00:25, 1.58s/batch, N=1.4660, E=0.8937,
KL=0.3210, wKL=0.3250]
Train E13: 36%|███▌ | 9/25 [00:14<00:25, 1.58s/batch, N=1.4529, E=0.8947,
KL=0.3185, wKL=0.3250]
Train E13: 40%|████ | 10/25 [00:14<00:23, 1.54s/batch, N=1.4529, E=0.8947,
KL=0.3185, wKL=0.3250]
Train E13: 40%|████ | 10/25 [00:16<00:23, 1.54s/batch, N=1.5188, E=0.8961,
KL=0.3200, wKL=0.3250]
Train E13: 44%|████▍ | 11/25 [00:16<00:20, 1.49s/batch, N=1.5188, E=0.8961,
KL=0.3200, wKL=0.3250]
Train E13: 44%|████▍ | 11/25 [00:17<00:20, 1.49s/batch, N=1.5467, E=0.8957,
KL=0.3198, wKL=0.3250]
Train E13: 48%|████▊ | 12/25 [00:17<00:18, 1.46s/batch, N=1.5467, E=0.8957,
KL=0.3198, wKL=0.3250]
Train E13: 48%|████▊ | 12/25 [00:18<00:18, 1.46s/batch, N=1.4774, E=0.8950,
KL=0.3192, wKL=0.3250]
Train E13: 52%|█████▏ | 13/25 [00:18<00:17, 1.43s/batch, N=1.4774, E=0.8950,
KL=0.3192, wKL=0.3250]
Train E13: 52%|█████▏ | 13/25 [00:20<00:17, 1.43s/batch, N=1.4142, E=0.8893,
KL=0.3181, wKL=0.3250]
Train E13: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4142, E=0.8893,
KL=0.3181, wKL=0.3250]
Train E13: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.5587, E=0.8889,
KL=0.3206, wKL=0.3250]
Train E13: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5587, E=0.8889,
KL=0.3206, wKL=0.3250]
Train E13: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4874, E=0.8876,
KL=0.3166, wKL=0.3250]
Train E13: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4874, E=0.8876,
KL=0.3166, wKL=0.3250]
Train E13: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.5069, E=0.8868,
KL=0.3164, wKL=0.3250]
Train E13: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.5069, E=0.8868,
KL=0.3164, wKL=0.3250]
Train E13: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.5411, E=0.8859,
KL=0.3183, wKL=0.3250]
Train E13: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5411, E=0.8859,
KL=0.3183, wKL=0.3250]
Train E13: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.6103, E=0.8889,
KL=0.3208, wKL=0.3250]
Train E13: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.6103, E=0.8889,
KL=0.3208, wKL=0.3250]
Train E13: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4265, E=0.8808,
KL=0.3167, wKL=0.3250]
Train E13: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4265, E=0.8808,
KL=0.3167, wKL=0.3250]
Train E13: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5063, E=0.8819,
KL=0.3182, wKL=0.3250]
Train E13: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5063, E=0.8819,
KL=0.3182, wKL=0.3250]
Train E13: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5402, E=0.8805,
KL=0.3210, wKL=0.3250]
Train E13: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.5402, E=0.8805,
KL=0.3210, wKL=0.3250]
Train E13: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4774, E=0.8791,
KL=0.3202, wKL=0.3250]
Train E13: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4774, E=0.8791,
KL=0.3202, wKL=0.3250]
Train E13: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5410, E=0.8765,
KL=0.3200, wKL=0.3250]
Train E13: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5410, E=0.8765,
KL=0.3200, wKL=0.3250]
Train E13: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5975, E=0.8798,
KL=0.3234, wKL=0.3250]
Train E13: 100%|██████████| 25/25 [00:34<00:00, 1.19s/batch, N=1.5975, E=0.8798,
KL=0.3234, wKL=0.3250]
Train E13: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5975, E=0.8798,
KL=0.3234, wKL=0.3250]
511.3s 98 [Epoch 013] Total: 2.4503 | N: 1.5069 | E: 0.8910 | KL(0.33×0.5): 0.3224
546.4s 99 Train E14: 0%| | 0/25 [00:00<?, ?batch/s]
Train E14: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5399, E=0.8757, KL=0.3247,
wKL=0.3500]
Train E14: 4%|▍ | 1/25 [00:01<00:36, 1.53s/batch, N=1.5399, E=0.8757,
KL=0.3247, wKL=0.3500]
Train E14: 4%|▍ | 1/25 [00:02<00:36, 1.53s/batch, N=1.4990, E=0.8706,
KL=0.3237, wKL=0.3500]
Train E14: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.4990, E=0.8706,
KL=0.3237, wKL=0.3500]
Train E14: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.5140, E=0.8673,
KL=0.3227, wKL=0.3500]
Train E14: 12%|█▏ | 3/25 [00:04<00:31, 1.41s/batch, N=1.5140, E=0.8673,
KL=0.3227, wKL=0.3500]
Train E14: 12%|█▏ | 3/25 [00:05<00:31, 1.41s/batch, N=1.4455, E=0.8663,
KL=0.3225, wKL=0.3500]
Train E14: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4455, E=0.8663,
KL=0.3225, wKL=0.3500]
Train E14: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4900, E=0.8637,
KL=0.3254, wKL=0.3500]
Train E14: 20%|██ | 5/25 [00:07<00:27, 1.40s/batch, N=1.4900, E=0.8637,
KL=0.3254, wKL=0.3500]
Train E14: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.5399, E=0.8619,
KL=0.3268, wKL=0.3500]
Train E14: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5399, E=0.8619,
KL=0.3268, wKL=0.3500]
Train E14: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4421, E=0.8567,
KL=0.3279, wKL=0.3500]
Train E14: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4421, E=0.8567,
KL=0.3279, wKL=0.3500]
Train E14: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4874, E=0.8567,
KL=0.3325, wKL=0.3500]
Train E14: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4874, E=0.8567,
KL=0.3325, wKL=0.3500]
Train E14: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.5812, E=0.8488,
KL=0.3320, wKL=0.3500]
Train E14: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5812, E=0.8488,
KL=0.3320, wKL=0.3500]
Train E14: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.5572, E=0.8514,
KL=0.3318, wKL=0.3500]
Train E14: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.5572, E=0.8514,
KL=0.3318, wKL=0.3500]
Train E14: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4397, E=0.8511,
KL=0.3325, wKL=0.3500]
Train E14: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4397, E=0.8511,
KL=0.3325, wKL=0.3500]
Train E14: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.4607, E=0.8474,
KL=0.3378, wKL=0.3500]
Train E14: 48%|████▊ | 12/25 [00:17<00:20, 1.59s/batch, N=1.4607, E=0.8474,
KL=0.3378, wKL=0.3500]
Train E14: 48%|████▊ | 12/25 [00:18<00:20, 1.59s/batch, N=1.5927, E=0.8502,
KL=0.3428, wKL=0.3500]
Train E14: 52%|█████▏ | 13/25 [00:18<00:18, 1.54s/batch, N=1.5927, E=0.8502,
KL=0.3428, wKL=0.3500]
Train E14: 52%|█████▏ | 13/25 [00:20<00:18, 1.54s/batch, N=1.5155, E=0.8485,
KL=0.3436, wKL=0.3500]
Train E14: 56%|█████▌ | 14/25 [00:20<00:16, 1.51s/batch, N=1.5155, E=0.8485,
KL=0.3436, wKL=0.3500]
Train E14: 56%|█████▌ | 14/25 [00:21<00:16, 1.51s/batch, N=1.4703, E=0.8405,
KL=0.3418, wKL=0.3500]
Train E14: 60%|██████ | 15/25 [00:21<00:14, 1.47s/batch, N=1.4703, E=0.8405,
KL=0.3418, wKL=0.3500]
Train E14: 60%|██████ | 15/25 [00:23<00:14, 1.47s/batch, N=1.4250, E=0.8418,
KL=0.3431, wKL=0.3500]
Train E14: 64%|██████▍ | 16/25 [00:23<00:13, 1.44s/batch, N=1.4250, E=0.8418,
KL=0.3431, wKL=0.3500]
Train E14: 64%|██████▍ | 16/25 [00:24<00:13, 1.44s/batch, N=1.4905, E=0.8388,
KL=0.3465, wKL=0.3500]
Train E14: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.4905, E=0.8388,
KL=0.3465, wKL=0.3500]
Train E14: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.5387, E=0.8415,
KL=0.3508, wKL=0.3500]
Train E14: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5387, E=0.8415,
KL=0.3508, wKL=0.3500]
Train E14: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.5076, E=0.8382,
KL=0.3525, wKL=0.3500]
Train E14: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5076, E=0.8382,
KL=0.3525, wKL=0.3500]
Train E14: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4468, E=0.8417,
KL=0.3493, wKL=0.3500]
Train E14: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4468, E=0.8417,
KL=0.3493, wKL=0.3500]
Train E14: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.4582, E=0.8392,
KL=0.3473, wKL=0.3500]
Train E14: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4582, E=0.8392,
KL=0.3473, wKL=0.3500]
Train E14: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.4248, E=0.8377,
KL=0.3458, wKL=0.3500]
Train E14: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.4248, E=0.8377,
KL=0.3458, wKL=0.3500]
Train E14: 88%|████████▊ | 22/25 [00:33<00:04, 1.44s/batch, N=1.5299, E=0.8386,
KL=0.3474, wKL=0.3500]
Train E14: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.5299, E=0.8386,
KL=0.3474, wKL=0.3500]
Train E14: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5121, E=0.8366,
KL=0.3468, wKL=0.3500]
Train E14: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.5121, E=0.8366,
KL=0.3468, wKL=0.3500]
Train E14: 96%|█████████▌| 24/25 [00:35<00:01, 1.44s/batch, N=1.6547, E=0.8339,
KL=0.3487, wKL=0.3500]
Train E14: 100%|██████████| 25/25 [00:35<00:00, 1.19s/batch, N=1.6547, E=0.8339,
KL=0.3487, wKL=0.3500]
Train E14: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.6547, E=0.8339,
KL=0.3487, wKL=0.3500]
546.4s 100 [Epoch 014] Total: 2.4081 | N: 1.4989 | E: 0.8502 | KL(0.35×0.5): 0.3376
581.5s 101 Train E15: 0%| | 0/25 [00:00<?, ?batch/s]
Train E15: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5293, E=0.8376, KL=0.3426,
wKL=0.3750]
Train E15: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.5293, E=0.8376,
KL=0.3426, wKL=0.3750]
Train E15: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.4947, E=0.8374,
KL=0.3395, wKL=0.3750]
Train E15: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4947, E=0.8374,
KL=0.3395, wKL=0.3750]
Train E15: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.5105, E=0.8318,
KL=0.3359, wKL=0.3750]
Train E15: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5105, E=0.8318,
KL=0.3359, wKL=0.3750]
Train E15: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4658, E=0.8380,
KL=0.3319, wKL=0.3750]
Train E15: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.4658, E=0.8380,
KL=0.3319, wKL=0.3750]
Train E15: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.4792, E=0.8344,
KL=0.3301, wKL=0.3750]
Train E15: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4792, E=0.8344,
KL=0.3301, wKL=0.3750]
Train E15: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5064, E=0.8337,
KL=0.3282, wKL=0.3750]
Train E15: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5064, E=0.8337,
KL=0.3282, wKL=0.3750]
Train E15: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5324, E=0.8321,
KL=0.3251, wKL=0.3750]
Train E15: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5324, E=0.8321,
KL=0.3251, wKL=0.3750]
Train E15: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4746, E=0.8305,
KL=0.3212, wKL=0.3750]
Train E15: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4746, E=0.8305,
KL=0.3212, wKL=0.3750]
Train E15: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5386, E=0.8388,
KL=0.3199, wKL=0.3750]
Train E15: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.5386, E=0.8388,
KL=0.3199, wKL=0.3750]
Train E15: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.6122, E=0.8324,
KL=0.3214, wKL=0.3750]
Train E15: 40%|████ | 10/25 [00:13<00:20, 1.37s/batch, N=1.6122, E=0.8324,
KL=0.3214, wKL=0.3750]
Train E15: 40%|████ | 10/25 [00:15<00:20, 1.37s/batch, N=1.4640, E=0.8278,
KL=0.3193, wKL=0.3750]
Train E15: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4640, E=0.8278,
KL=0.3193, wKL=0.3750]
Train E15: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4052, E=0.8340,
KL=0.3164, wKL=0.3750]
Train E15: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4052, E=0.8340,
KL=0.3164, wKL=0.3750]
Train E15: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4371, E=0.8316,
KL=0.3169, wKL=0.3750]
Train E15: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4371, E=0.8316,
KL=0.3169, wKL=0.3750]
Train E15: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4958, E=0.8341,
KL=0.3133, wKL=0.3750]
Train E15: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4958, E=0.8341,
KL=0.3133, wKL=0.3750]
Train E15: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.4404, E=0.8369,
KL=0.3131, wKL=0.3750]
Train E15: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.4404, E=0.8369,
KL=0.3131, wKL=0.3750]
Train E15: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4911, E=0.8272,
KL=0.3145, wKL=0.3750]
Train E15: 64%|██████▍ | 16/25 [00:22<00:14, 1.59s/batch, N=1.4911, E=0.8272,
KL=0.3145, wKL=0.3750]
Train E15: 64%|██████▍ | 16/25 [00:24<00:14, 1.59s/batch, N=1.5196, E=0.8321,
KL=0.3153, wKL=0.3750]
Train E15: 68%|██████▊ | 17/25 [00:24<00:12, 1.55s/batch, N=1.5196, E=0.8321,
KL=0.3153, wKL=0.3750]
Train E15: 68%|██████▊ | 17/25 [00:25<00:12, 1.55s/batch, N=1.4874, E=0.8312,
KL=0.3143, wKL=0.3750]
Train E15: 72%|███████▏ | 18/25 [00:25<00:10, 1.50s/batch, N=1.4874, E=0.8312,
KL=0.3143, wKL=0.3750]
Train E15: 72%|███████▏ | 18/25 [00:27<00:10, 1.50s/batch, N=1.5481, E=0.8335,
KL=0.3144, wKL=0.3750]
Train E15: 76%|███████▌ | 19/25 [00:27<00:08, 1.47s/batch, N=1.5481, E=0.8335,
KL=0.3144, wKL=0.3750]
Train E15: 76%|███████▌ | 19/25 [00:28<00:08, 1.47s/batch, N=1.4926, E=0.8245,
KL=0.3127, wKL=0.3750]
Train E15: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.4926, E=0.8245,
KL=0.3127, wKL=0.3750]
Train E15: 80%|████████ | 20/25 [00:30<00:07, 1.50s/batch, N=1.4364, E=0.8316,
KL=0.3108, wKL=0.3750]
Train E15: 84%|████████▍ | 21/25 [00:30<00:06, 1.52s/batch, N=1.4364, E=0.8316,
KL=0.3108, wKL=0.3750]
Train E15: 84%|████████▍ | 21/25 [00:31<00:06, 1.52s/batch, N=1.5324, E=0.8264,
KL=0.3127, wKL=0.3750]
Train E15: 88%|████████▊ | 22/25 [00:31<00:04, 1.48s/batch, N=1.5324, E=0.8264,
KL=0.3127, wKL=0.3750]
Train E15: 88%|████████▊ | 22/25 [00:32<00:04, 1.48s/batch, N=1.4754, E=0.8256,
KL=0.3123, wKL=0.3750]
Train E15: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.4754, E=0.8256,
KL=0.3123, wKL=0.3750]
Train E15: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5212, E=0.8307,
KL=0.3116, wKL=0.3750]
Train E15: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5212, E=0.8307,
KL=0.3116, wKL=0.3750]
Train E15: 96%|█████████▌| 24/25 [00:35<00:01, 1.45s/batch, N=1.5164, E=0.8324,
KL=0.3098, wKL=0.3750]
Train E15: 100%|██████████| 25/25 [00:35<00:00, 1.20s/batch, N=1.5164, E=0.8324,
KL=0.3098, wKL=0.3750]
Train E15: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5164, E=0.8324,
KL=0.3098, wKL=0.3750]
581.5s 102 [Epoch 015] Total: 2.3881 | N: 1.4958 | E: 0.8322 | KL(0.38×0.5): 0.3204
616.5s 103 Train E16: 0%| | 0/25 [00:00<?, ?batch/s]
Train E16: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4652, E=0.8296, KL=0.3083,
wKL=0.4000]
Train E16: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4652, E=0.8296,
KL=0.3083, wKL=0.4000]
Train E16: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5902, E=0.8318,
KL=0.3113, wKL=0.4000]
Train E16: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5902, E=0.8318,
KL=0.3113, wKL=0.4000]
Train E16: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4924, E=0.8271,
KL=0.3087, wKL=0.4000]
Train E16: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4924, E=0.8271,
KL=0.3087, wKL=0.4000]
Train E16: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4109, E=0.8294,
KL=0.3025, wKL=0.4000]
Train E16: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4109, E=0.8294,
KL=0.3025, wKL=0.4000]
Train E16: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5012, E=0.8313,
KL=0.3038, wKL=0.4000]
Train E16: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5012, E=0.8313,
KL=0.3038, wKL=0.4000]
Train E16: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.3029, wKL=0.4000]
Train E16: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.3029, wKL=0.4000]
Train E16: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4879, E=0.8262,
KL=0.3038, wKL=0.4000]
Train E16: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4879, E=0.8262,
KL=0.3038, wKL=0.4000]
Train E16: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5264, E=0.8323,
KL=0.2990, wKL=0.4000]
Train E16: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5264, E=0.8323,
KL=0.2990, wKL=0.4000]
Train E16: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.5038, E=0.8319,
KL=0.2969, wKL=0.4000]
Train E16: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5038, E=0.8319,
KL=0.2969, wKL=0.4000]
Train E16: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5893, E=0.8302,
KL=0.2981, wKL=0.4000]
Train E16: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.5893, E=0.8302,
KL=0.2981, wKL=0.4000]
Train E16: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4951, E=0.8289,
KL=0.2933, wKL=0.4000]
Train E16: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4951, E=0.8289,
KL=0.2933, wKL=0.4000]
Train E16: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4369, E=0.8354,
KL=0.2926, wKL=0.4000]
Train E16: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4369, E=0.8354,
KL=0.2926, wKL=0.4000]
Train E16: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4493, E=0.8349,
KL=0.2913, wKL=0.4000]
Train E16: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4493, E=0.8349,
KL=0.2913, wKL=0.4000]
Train E16: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5137, E=0.8328,
KL=0.2911, wKL=0.4000]
Train E16: 56%|█████▌ | 14/25 [00:19<00:15, 1.42s/batch, N=1.5137, E=0.8328,
KL=0.2911, wKL=0.4000]
Train E16: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.5055, E=0.8258,
KL=0.2906, wKL=0.4000]
Train E16: 60%|██████ | 15/25 [00:20<00:14, 1.43s/batch, N=1.5055, E=0.8258,
KL=0.2906, wKL=0.4000]
Train E16: 60%|██████ | 15/25 [00:22<00:14, 1.43s/batch, N=1.5913, E=0.8269,
KL=0.2929, wKL=0.4000]
Train E16: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.5913, E=0.8269,
KL=0.2929, wKL=0.4000]
Train E16: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.4800, E=0.8300,
KL=0.2900, wKL=0.4000]
Train E16: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.4800, E=0.8300,
KL=0.2900, wKL=0.4000]
Train E16: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5476, E=0.8323,
KL=0.2899, wKL=0.4000]
Train E16: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.5476, E=0.8323,
KL=0.2899, wKL=0.4000]
Train E16: 72%|███████▏ | 18/25 [00:27<00:10, 1.44s/batch, N=1.4325, E=0.8280,
KL=0.2881, wKL=0.4000]
Train E16: 76%|███████▌ | 19/25 [00:27<00:09, 1.64s/batch, N=1.4325, E=0.8280,
KL=0.2881, wKL=0.4000]
Train E16: 76%|███████▌ | 19/25 [00:28<00:09, 1.64s/batch, N=1.4304, E=0.8310,
KL=0.2873, wKL=0.4000]
Train E16: 80%|████████ | 20/25 [00:28<00:07, 1.58s/batch, N=1.4304, E=0.8310,
KL=0.2873, wKL=0.4000]
Train E16: 80%|████████ | 20/25 [00:30<00:07, 1.58s/batch, N=1.5011, E=0.8236,
KL=0.2911, wKL=0.4000]
Train E16: 84%|████████▍ | 21/25 [00:30<00:06, 1.53s/batch, N=1.5011, E=0.8236,
KL=0.2911, wKL=0.4000]
Train E16: 84%|████████▍ | 21/25 [00:31<00:06, 1.53s/batch, N=1.3503, E=0.8260,
KL=0.2881, wKL=0.4000]
Train E16: 88%|████████▊ | 22/25 [00:31<00:04, 1.49s/batch, N=1.3503, E=0.8260,
KL=0.2881, wKL=0.4000]
Train E16: 88%|████████▊ | 22/25 [00:33<00:04, 1.49s/batch, N=1.5412, E=0.8293,
KL=0.2905, wKL=0.4000]
Train E16: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.5412, E=0.8293,
KL=0.2905, wKL=0.4000]
Train E16: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5358, E=0.8297,
KL=0.2887, wKL=0.4000]
Train E16: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.5358, E=0.8297,
KL=0.2887, wKL=0.4000]
Train E16: 96%|█████████▌| 24/25 [00:35<00:01, 1.44s/batch, N=1.5065, E=0.8332,
KL=0.2890, wKL=0.4000]
Train E16: 100%|██████████| 25/25 [00:35<00:00, 1.21s/batch, N=1.5065, E=0.8332,
KL=0.2890, wKL=0.4000]
Train E16: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5065, E=0.8332,
KL=0.2890, wKL=0.4000]
616.5s 104 [Epoch 016] Total: 2.3838 | N: 1.4951 | E: 0.8295 | KL(0.40×0.5): 0.2958
651.5s 105 Train E17: 0%| | 0/25 [00:00<?, ?batch/s]
Train E17: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5101, E=0.8297, KL=0.2871,
wKL=0.4250]
Train E17: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5101, E=0.8297,
KL=0.2871, wKL=0.4250]
Train E17: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5180, E=0.8264,
KL=0.2894, wKL=0.4250]
Train E17: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5180, E=0.8264,
KL=0.2894, wKL=0.4250]
Train E17: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5914, E=0.8280,
KL=0.2868, wKL=0.4250]
Train E17: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5914, E=0.8280,
KL=0.2868, wKL=0.4250]
Train E17: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5319, E=0.8271,
KL=0.2854, wKL=0.4250]
Train E17: 16%|█▌ | 4/25 [00:05<00:29, 1.40s/batch, N=1.5319, E=0.8271,
KL=0.2854, wKL=0.4250]
Train E17: 16%|█▌ | 4/25 [00:06<00:29, 1.40s/batch, N=1.5482, E=0.8274,
KL=0.2864, wKL=0.4250]
Train E17: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5482, E=0.8274,
KL=0.2864, wKL=0.4250]
Train E17: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.3949, E=0.8273,
KL=0.2783, wKL=0.4250]
Train E17: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.3949, E=0.8273,
KL=0.2783, wKL=0.4250]
Train E17: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5255, E=0.8298,
KL=0.2778, wKL=0.4250]
Train E17: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5255, E=0.8298,
KL=0.2778, wKL=0.4250]
Train E17: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4098, E=0.8272,
KL=0.2756, wKL=0.4250]
Train E17: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4098, E=0.8272,
KL=0.2756, wKL=0.4250]
Train E17: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4573, E=0.8311,
KL=0.2768, wKL=0.4250]
Train E17: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4573, E=0.8311,
KL=0.2768, wKL=0.4250]
Train E17: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5705, E=0.8246,
KL=0.2774, wKL=0.4250]
Train E17: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.5705, E=0.8246,
KL=0.2774, wKL=0.4250]
Train E17: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4136, E=0.8305,
KL=0.2737, wKL=0.4250]
Train E17: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4136, E=0.8305,
KL=0.2737, wKL=0.4250]
Train E17: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4269, E=0.8278,
KL=0.2703, wKL=0.4250]
Train E17: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4269, E=0.8278,
KL=0.2703, wKL=0.4250]
Train E17: 48%|████▊ | 12/25 [00:18<00:17, 1.38s/batch, N=1.4903, E=0.8246,
KL=0.2703, wKL=0.4250]
Train E17: 52%|█████▏ | 13/25 [00:18<00:16, 1.38s/batch, N=1.4903, E=0.8246,
KL=0.2703, wKL=0.4250]
Train E17: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.4956, E=0.8299,
KL=0.2703, wKL=0.4250]
Train E17: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.4956, E=0.8299,
KL=0.2703, wKL=0.4250]
Train E17: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4749, E=0.8302,
KL=0.2700, wKL=0.4250]
Train E17: 60%|██████ | 15/25 [00:20<00:13, 1.37s/batch, N=1.4749, E=0.8302,
KL=0.2700, wKL=0.4250]
Train E17: 60%|██████ | 15/25 [00:22<00:13, 1.37s/batch, N=1.4780, E=0.8297,
KL=0.2687, wKL=0.4250]
Train E17: 64%|██████▍ | 16/25 [00:22<00:13, 1.46s/batch, N=1.4780, E=0.8297,
KL=0.2687, wKL=0.4250]
Train E17: 64%|██████▍ | 16/25 [00:23<00:13, 1.46s/batch, N=1.4482, E=0.8310,
KL=0.2663, wKL=0.4250]
Train E17: 68%|██████▊ | 17/25 [00:23<00:11, 1.45s/batch, N=1.4482, E=0.8310,
KL=0.2663, wKL=0.4250]
Train E17: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.4167, E=0.8253,
KL=0.2665, wKL=0.4250]
Train E17: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.4167, E=0.8253,
KL=0.2665, wKL=0.4250]
Train E17: 72%|███████▏ | 18/25 [00:26<00:10, 1.44s/batch, N=1.6420, E=0.8272,
KL=0.2701, wKL=0.4250]
Train E17: 76%|███████▌ | 19/25 [00:26<00:08, 1.43s/batch, N=1.6420, E=0.8272,
KL=0.2701, wKL=0.4250]
Train E17: 76%|███████▌ | 19/25 [00:28<00:08, 1.43s/batch, N=1.4691, E=0.8249,
KL=0.2689, wKL=0.4250]
Train E17: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4691, E=0.8249,
KL=0.2689, wKL=0.4250]
Train E17: 80%|████████ | 20/25 [00:29<00:07, 1.42s/batch, N=1.5187, E=0.8294,
KL=0.2655, wKL=0.4250]
Train E17: 84%|████████▍ | 21/25 [00:29<00:05, 1.42s/batch, N=1.5187, E=0.8294,
KL=0.2655, wKL=0.4250]
Train E17: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5358, E=0.8260,
KL=0.2660, wKL=0.4250]
Train E17: 88%|████████▊ | 22/25 [00:30<00:04, 1.40s/batch, N=1.5358, E=0.8260,
KL=0.2660, wKL=0.4250]
Train E17: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4772, E=0.8265,
KL=0.2688, wKL=0.4250]
Train E17: 92%|█████████▏| 23/25 [00:32<00:03, 1.59s/batch, N=1.4772, E=0.8265,
KL=0.2688, wKL=0.4250]
Train E17: 92%|█████████▏| 23/25 [00:34<00:03, 1.59s/batch, N=1.5117, E=0.8327,
KL=0.2681, wKL=0.4250]
Train E17: 96%|█████████▌| 24/25 [00:34<00:01, 1.54s/batch, N=1.5117, E=0.8327,
KL=0.2681, wKL=0.4250]
Train E17: 96%|█████████▌| 24/25 [00:34<00:01, 1.54s/batch, N=1.4888, E=0.8280,
KL=0.2671, wKL=0.4250]
Train E17: 100%|██████████| 25/25 [00:34<00:00, 1.26s/batch, N=1.4888, E=0.8280,
KL=0.2671, wKL=0.4250]
Train E17: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4888, E=0.8280,
KL=0.2671, wKL=0.4250]
651.5s 106 [Epoch 017] Total: 2.3803 | N: 1.4939 | E: 0.8281 | KL(0.42×0.5): 0.2742
685.8s 107 Train E18: 0%| | 0/25 [00:00<?, ?batch/s]
Train E18: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4879, E=0.8247, KL=0.2658,
wKL=0.4500]
Train E18: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4879, E=0.8247,
KL=0.2658, wKL=0.4500]
Train E18: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4226, E=0.8264,
KL=0.2647, wKL=0.4500]
Train E18: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4226, E=0.8264,
KL=0.2647, wKL=0.4500]
Train E18: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5227, E=0.8289,
KL=0.2678, wKL=0.4500]
Train E18: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.5227, E=0.8289,
KL=0.2678, wKL=0.4500]
Train E18: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5000, E=0.8261,
KL=0.2642, wKL=0.4500]
Train E18: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5000, E=0.8261,
KL=0.2642, wKL=0.4500]
Train E18: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5024, E=0.8298,
KL=0.2628, wKL=0.4500]
Train E18: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5024, E=0.8298,
KL=0.2628, wKL=0.4500]
Train E18: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.5569, E=0.8267,
KL=0.2598, wKL=0.4500]
Train E18: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5569, E=0.8267,
KL=0.2598, wKL=0.4500]
Train E18: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4495, E=0.8282,
KL=0.2607, wKL=0.4500]
Train E18: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4495, E=0.8282,
KL=0.2607, wKL=0.4500]
Train E18: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4642, E=0.8282,
KL=0.2584, wKL=0.4500]
Train E18: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4642, E=0.8282,
KL=0.2584, wKL=0.4500]
Train E18: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5023, E=0.8306,
KL=0.2546, wKL=0.4500]
Train E18: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.5023, E=0.8306,
KL=0.2546, wKL=0.4500]
Train E18: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.4571, E=0.8248,
KL=0.2521, wKL=0.4500]
Train E18: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4571, E=0.8248,
KL=0.2521, wKL=0.4500]
Train E18: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.4748, E=0.8270,
KL=0.2542, wKL=0.4500]
Train E18: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4748, E=0.8270,
KL=0.2542, wKL=0.4500]
Train E18: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5611, E=0.8239,
KL=0.2546, wKL=0.4500]
Train E18: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5611, E=0.8239,
KL=0.2546, wKL=0.4500]
Train E18: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4592, E=0.8272,
KL=0.2513, wKL=0.4500]
Train E18: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4592, E=0.8272,
KL=0.2513, wKL=0.4500]
Train E18: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5321, E=0.8307,
KL=0.2512, wKL=0.4500]
Train E18: 56%|█████▌ | 14/25 [00:19<00:16, 1.47s/batch, N=1.5321, E=0.8307,
KL=0.2512, wKL=0.4500]
Train E18: 56%|█████▌ | 14/25 [00:20<00:16, 1.47s/batch, N=1.4314, E=0.8231,
KL=0.2524, wKL=0.4500]
Train E18: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.4314, E=0.8231,
KL=0.2524, wKL=0.4500]
Train E18: 60%|██████ | 15/25 [00:22<00:14, 1.44s/batch, N=1.4881, E=0.8294,
KL=0.2504, wKL=0.4500]
Train E18: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.4881, E=0.8294,
KL=0.2504, wKL=0.4500]
Train E18: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.5106, E=0.8253,
KL=0.2510, wKL=0.4500]
Train E18: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.5106, E=0.8253,
KL=0.2510, wKL=0.4500]
Train E18: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5054, E=0.8284,
KL=0.2513, wKL=0.4500]
Train E18: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5054, E=0.8284,
KL=0.2513, wKL=0.4500]
Train E18: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.4670, E=0.8261,
KL=0.2505, wKL=0.4500]
Train E18: 76%|███████▌ | 19/25 [00:26<00:08, 1.42s/batch, N=1.4670, E=0.8261,
KL=0.2505, wKL=0.4500]
Train E18: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5448, E=0.8259,
KL=0.2506, wKL=0.4500]
Train E18: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.5448, E=0.8259,
KL=0.2506, wKL=0.4500]
Train E18: 80%|████████ | 20/25 [00:29<00:07, 1.43s/batch, N=1.4607, E=0.8246,
KL=0.2483, wKL=0.4500]
Train E18: 84%|████████▍ | 21/25 [00:29<00:05, 1.43s/batch, N=1.4607, E=0.8246,
KL=0.2483, wKL=0.4500]
Train E18: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4392, E=0.8265,
KL=0.2488, wKL=0.4500]
Train E18: 88%|████████▊ | 22/25 [00:30<00:04, 1.42s/batch, N=1.4392, E=0.8265,
KL=0.2488, wKL=0.4500]
Train E18: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5339, E=0.8325,
KL=0.2493, wKL=0.4500]
Train E18: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.5339, E=0.8325,
KL=0.2493, wKL=0.4500]
Train E18: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.5196, E=0.8241,
KL=0.2475, wKL=0.4500]
Train E18: 96%|█████████▌| 24/25 [00:33<00:01, 1.41s/batch, N=1.5196, E=0.8241,
KL=0.2475, wKL=0.4500]
Train E18: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5215, E=0.8312,
KL=0.2472, wKL=0.4500]
Train E18: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5215, E=0.8312,
KL=0.2472, wKL=0.4500]
Train E18: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.5215, E=0.8312,
KL=0.2472, wKL=0.4500]
685.8s 108 [Epoch 018] Total: 2.3764 | N: 1.4919 | E: 0.8271 | KL(0.45×0.5): 0.2550
721.0s 109 Train E19: 0%| | 0/25 [00:00<?, ?batch/s]
Train E19: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5769, E=0.8271, KL=0.2476,
wKL=0.4750]
Train E19: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5769, E=0.8271,
KL=0.2476, wKL=0.4750]
Train E19: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4629, E=0.8268,
KL=0.2465, wKL=0.4750]
Train E19: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4629, E=0.8268,
KL=0.2465, wKL=0.4750]
Train E19: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4440, E=0.8297,
KL=0.2440, wKL=0.4750]
Train E19: 12%|█▏ | 3/25 [00:04<00:37, 1.72s/batch, N=1.4440, E=0.8297,
KL=0.2440, wKL=0.4750]
Train E19: 12%|█▏ | 3/25 [00:06<00:37, 1.72s/batch, N=1.4732, E=0.8238,
KL=0.2456, wKL=0.4750]
Train E19: 16%|█▌ | 4/25 [00:06<00:33, 1.58s/batch, N=1.4732, E=0.8238,
KL=0.2456, wKL=0.4750]
Train E19: 16%|█▌ | 4/25 [00:07<00:33, 1.58s/batch, N=1.4542, E=0.8244,
KL=0.2437, wKL=0.4750]
Train E19: 20%|██ | 5/25 [00:07<00:30, 1.50s/batch, N=1.4542, E=0.8244,
KL=0.2437, wKL=0.4750]
Train E19: 20%|██ | 5/25 [00:09<00:30, 1.50s/batch, N=1.5483, E=0.8246,
KL=0.2437, wKL=0.4750]
Train E19: 24%|██▍ | 6/25 [00:09<00:27, 1.47s/batch, N=1.5483, E=0.8246,
KL=0.2437, wKL=0.4750]
Train E19: 24%|██▍ | 6/25 [00:10<00:27, 1.47s/batch, N=1.4934, E=0.8268,
KL=0.2413, wKL=0.4750]
Train E19: 28%|██▊ | 7/25 [00:10<00:25, 1.44s/batch, N=1.4934, E=0.8268,
KL=0.2413, wKL=0.4750]
Train E19: 28%|██▊ | 7/25 [00:11<00:25, 1.44s/batch, N=1.5205, E=0.8303,
KL=0.2385, wKL=0.4750]
Train E19: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5205, E=0.8303,
KL=0.2385, wKL=0.4750]
Train E19: 32%|███▏ | 8/25 [00:13<00:24, 1.42s/batch, N=1.5378, E=0.8236,
KL=0.2394, wKL=0.4750]
Train E19: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.5378, E=0.8236,
KL=0.2394, wKL=0.4750]
Train E19: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.3987, E=0.8264,
KL=0.2364, wKL=0.4750]
Train E19: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.3987, E=0.8264,
KL=0.2364, wKL=0.4750]
Train E19: 40%|████ | 10/25 [00:16<00:21, 1.41s/batch, N=1.4580, E=0.8238,
KL=0.2371, wKL=0.4750]
Train E19: 44%|████▍ | 11/25 [00:16<00:20, 1.49s/batch, N=1.4580, E=0.8238,
KL=0.2371, wKL=0.4750]
Train E19: 44%|████▍ | 11/25 [00:17<00:20, 1.49s/batch, N=1.4793, E=0.8267,
KL=0.2371, wKL=0.4750]
Train E19: 48%|████▊ | 12/25 [00:17<00:19, 1.47s/batch, N=1.4793, E=0.8267,
KL=0.2371, wKL=0.4750]
Train E19: 48%|████▊ | 12/25 [00:19<00:19, 1.47s/batch, N=1.5524, E=0.8266,
KL=0.2352, wKL=0.4750]
Train E19: 52%|█████▏ | 13/25 [00:19<00:17, 1.44s/batch, N=1.5524, E=0.8266,
KL=0.2352, wKL=0.4750]
Train E19: 52%|█████▏ | 13/25 [00:20<00:17, 1.44s/batch, N=1.4741, E=0.8267,
KL=0.2346, wKL=0.4750]
Train E19: 56%|█████▌ | 14/25 [00:20<00:15, 1.44s/batch, N=1.4741, E=0.8267,
KL=0.2346, wKL=0.4750]
Train E19: 56%|█████▌ | 14/25 [00:21<00:15, 1.44s/batch, N=1.3984, E=0.8248,
KL=0.2339, wKL=0.4750]
Train E19: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.3984, E=0.8248,
KL=0.2339, wKL=0.4750]
Train E19: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.4899, E=0.8275,
KL=0.2346, wKL=0.4750]
Train E19: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.4899, E=0.8275,
KL=0.2346, wKL=0.4750]
Train E19: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.4708, E=0.8295,
KL=0.2338, wKL=0.4750]
Train E19: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.4708, E=0.8295,
KL=0.2338, wKL=0.4750]
Train E19: 68%|██████▊ | 17/25 [00:26<00:11, 1.43s/batch, N=1.5463, E=0.8269,
KL=0.2338, wKL=0.4750]
Train E19: 72%|███████▏ | 18/25 [00:26<00:09, 1.42s/batch, N=1.5463, E=0.8269,
KL=0.2338, wKL=0.4750]
Train E19: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5078, E=0.8283,
KL=0.2335, wKL=0.4750]
Train E19: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5078, E=0.8283,
KL=0.2335, wKL=0.4750]
Train E19: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5204, E=0.8328,
KL=0.2330, wKL=0.4750]
Train E19: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5204, E=0.8328,
KL=0.2330, wKL=0.4750]
Train E19: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.4661, E=0.8239,
KL=0.2334, wKL=0.4750]
Train E19: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.4661, E=0.8239,
KL=0.2334, wKL=0.4750]
Train E19: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5584, E=0.8301,
KL=0.2326, wKL=0.4750]
Train E19: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.5584, E=0.8301,
KL=0.2326, wKL=0.4750]
Train E19: 88%|████████▊ | 22/25 [00:33<00:04, 1.40s/batch, N=1.4515, E=0.8221,
KL=0.2326, wKL=0.4750]
Train E19: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4515, E=0.8221,
KL=0.2326, wKL=0.4750]
Train E19: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5171, E=0.8242,
KL=0.2332, wKL=0.4750]
Train E19: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5171, E=0.8242,
KL=0.2332, wKL=0.4750]
Train E19: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.4290, E=0.8331,
KL=0.2307, wKL=0.4750]
Train E19: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.4290, E=0.8331,
KL=0.2307, wKL=0.4750]
Train E19: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4290, E=0.8331,
KL=0.2307, wKL=0.4750]
721.0s 110 [Epoch 019] Total: 2.3737 | N: 1.4906 | E: 0.8267 | KL(0.47×0.5): 0.2376
755.9s 111 Train E20: 0%| | 0/25 [00:00<?, ?batch/s]
Train E20: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4677, E=0.8248, KL=0.2314,
wKL=0.5000]
Train E20: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4677, E=0.8248,
KL=0.2314, wKL=0.5000]
Train E20: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5378, E=0.8258,
KL=0.2315, wKL=0.5000]
Train E20: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5378, E=0.8258,
KL=0.2315, wKL=0.5000]
Train E20: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4674, E=0.8247,
KL=0.2297, wKL=0.5000]
Train E20: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4674, E=0.8247,
KL=0.2297, wKL=0.5000]
Train E20: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5557, E=0.8224,
KL=0.2296, wKL=0.5000]
Train E20: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5557, E=0.8224,
KL=0.2296, wKL=0.5000]
Train E20: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4961, E=0.8286,
KL=0.2284, wKL=0.5000]
Train E20: 20%|██ | 5/25 [00:06<00:27, 1.36s/batch, N=1.4961, E=0.8286,
KL=0.2284, wKL=0.5000]
Train E20: 20%|██ | 5/25 [00:08<00:27, 1.36s/batch, N=1.5197, E=0.8290,
KL=0.2270, wKL=0.5000]
Train E20: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.5197, E=0.8290,
KL=0.2270, wKL=0.5000]
Train E20: 24%|██▍ | 6/25 [00:10<00:25, 1.37s/batch, N=1.6077, E=0.8287,
KL=0.2248, wKL=0.5000]
Train E20: 28%|██▊ | 7/25 [00:10<00:28, 1.58s/batch, N=1.6077, E=0.8287,
KL=0.2248, wKL=0.5000]
Train E20: 28%|██▊ | 7/25 [00:11<00:28, 1.58s/batch, N=1.3944, E=0.8268,
KL=0.2233, wKL=0.5000]
Train E20: 32%|███▏ | 8/25 [00:11<00:25, 1.52s/batch, N=1.3944, E=0.8268,
KL=0.2233, wKL=0.5000]
Train E20: 32%|███▏ | 8/25 [00:13<00:25, 1.52s/batch, N=1.5787, E=0.8268,
KL=0.2245, wKL=0.5000]
Train E20: 36%|███▌ | 9/25 [00:13<00:25, 1.58s/batch, N=1.5787, E=0.8268,
KL=0.2245, wKL=0.5000]
Train E20: 36%|███▌ | 9/25 [00:14<00:25, 1.58s/batch, N=1.5136, E=0.8272,
KL=0.2224, wKL=0.5000]
Train E20: 40%|████ | 10/25 [00:14<00:22, 1.52s/batch, N=1.5136, E=0.8272,
KL=0.2224, wKL=0.5000]
Train E20: 40%|████ | 10/25 [00:16<00:22, 1.52s/batch, N=1.5782, E=0.8290,
KL=0.2225, wKL=0.5000]
Train E20: 44%|████▍ | 11/25 [00:16<00:20, 1.48s/batch, N=1.5782, E=0.8290,
KL=0.2225, wKL=0.5000]
Train E20: 44%|████▍ | 11/25 [00:17<00:20, 1.48s/batch, N=1.4295, E=0.8279,
KL=0.2208, wKL=0.5000]
Train E20: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.4295, E=0.8279,
KL=0.2208, wKL=0.5000]
Train E20: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.4425, E=0.8260,
KL=0.2219, wKL=0.5000]
Train E20: 52%|█████▏ | 13/25 [00:18<00:17, 1.44s/batch, N=1.4425, E=0.8260,
KL=0.2219, wKL=0.5000]
Train E20: 52%|█████▏ | 13/25 [00:20<00:17, 1.44s/batch, N=1.5109, E=0.8263,
KL=0.2234, wKL=0.5000]
Train E20: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.5109, E=0.8263,
KL=0.2234, wKL=0.5000]
Train E20: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.4230, E=0.8272,
KL=0.2203, wKL=0.5000]
Train E20: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4230, E=0.8272,
KL=0.2203, wKL=0.5000]
Train E20: 60%|██████ | 15/25 [00:23<00:14, 1.42s/batch, N=1.4053, E=0.8277,
KL=0.2223, wKL=0.5000]
Train E20: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.4053, E=0.8277,
KL=0.2223, wKL=0.5000]
Train E20: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.5393, E=0.8283,
KL=0.2215, wKL=0.5000]
Train E20: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5393, E=0.8283,
KL=0.2215, wKL=0.5000]
Train E20: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5004, E=0.8275,
KL=0.2212, wKL=0.5000]
Train E20: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5004, E=0.8275,
KL=0.2212, wKL=0.5000]
Train E20: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4836, E=0.8266,
KL=0.2210, wKL=0.5000]
Train E20: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.4836, E=0.8266,
KL=0.2210, wKL=0.5000]
Train E20: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4796, E=0.8254,
KL=0.2208, wKL=0.5000]
Train E20: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4796, E=0.8254,
KL=0.2208, wKL=0.5000]
Train E20: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.4548, E=0.8230,
KL=0.2215, wKL=0.5000]
Train E20: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4548, E=0.8230,
KL=0.2215, wKL=0.5000]
Train E20: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4718, E=0.8208,
KL=0.2196, wKL=0.5000]
Train E20: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4718, E=0.8208,
KL=0.2196, wKL=0.5000]
Train E20: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4813, E=0.8267,
KL=0.2173, wKL=0.5000]
Train E20: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4813, E=0.8267,
KL=0.2173, wKL=0.5000]
Train E20: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4811, E=0.8236,
KL=0.2188, wKL=0.5000]
Train E20: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4811, E=0.8236,
KL=0.2188, wKL=0.5000]
Train E20: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.3494, E=0.8218,
KL=0.2192, wKL=0.5000]
Train E20: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.3494, E=0.8218,
KL=0.2192, wKL=0.5000]
Train E20: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.3494, E=0.8218,
KL=0.2192, wKL=0.5000]
755.9s 112 [Epoch 020] Total: 2.3722 | N: 1.4901 | E: 0.8262 | KL(0.50×0.5): 0.2235
755.9s 113 Saved checkpoint: /kaggle/working/checkpoints/gvae_20_epoch020.pt
790.7s 114 Train E21: 0%| | 0/25 [00:00<?, ?batch/s]
Train E21: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5098, E=0.8210, KL=0.2180,
wKL=0.5250]
Train E21: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5098, E=0.8210,
KL=0.2180, wKL=0.5250]
Train E21: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5177, E=0.8287,
KL=0.2175, wKL=0.5250]
Train E21: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.5177, E=0.8287,
KL=0.2175, wKL=0.5250]
Train E21: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.4634, E=0.8311,
KL=0.2162, wKL=0.5250]
Train E21: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4634, E=0.8311,
KL=0.2162, wKL=0.5250]
Train E21: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5852, E=0.8255,
KL=0.2175, wKL=0.5250]
Train E21: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5852, E=0.8255,
KL=0.2175, wKL=0.5250]
Train E21: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4773, E=0.8272,
KL=0.2139, wKL=0.5250]
Train E21: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4773, E=0.8272,
KL=0.2139, wKL=0.5250]
Train E21: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5737, E=0.8219,
KL=0.2151, wKL=0.5250]
Train E21: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5737, E=0.8219,
KL=0.2151, wKL=0.5250]
Train E21: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5300, E=0.8267,
KL=0.2153, wKL=0.5250]
Train E21: 28%|██▊ | 7/25 [00:09<00:26, 1.47s/batch, N=1.5300, E=0.8267,
KL=0.2153, wKL=0.5250]
Train E21: 28%|██▊ | 7/25 [00:11<00:26, 1.47s/batch, N=1.5196, E=0.8270,
KL=0.2132, wKL=0.5250]
Train E21: 32%|███▏ | 8/25 [00:11<00:24, 1.44s/batch, N=1.5196, E=0.8270,
KL=0.2132, wKL=0.5250]
Train E21: 32%|███▏ | 8/25 [00:13<00:24, 1.44s/batch, N=1.4657, E=0.8232,
KL=0.2128, wKL=0.5250]
Train E21: 36%|███▌ | 9/25 [00:13<00:25, 1.62s/batch, N=1.4657, E=0.8232,
KL=0.2128, wKL=0.5250]
Train E21: 36%|███▌ | 9/25 [00:14<00:25, 1.62s/batch, N=1.4895, E=0.8277,
KL=0.2118, wKL=0.5250]
Train E21: 40%|████ | 10/25 [00:14<00:23, 1.55s/batch, N=1.4895, E=0.8277,
KL=0.2118, wKL=0.5250]
Train E21: 40%|████ | 10/25 [00:16<00:23, 1.55s/batch, N=1.4332, E=0.8272,
KL=0.2106, wKL=0.5250]
Train E21: 44%|████▍ | 11/25 [00:16<00:21, 1.51s/batch, N=1.4332, E=0.8272,
KL=0.2106, wKL=0.5250]
Train E21: 44%|████▍ | 11/25 [00:17<00:21, 1.51s/batch, N=1.5168, E=0.8265,
KL=0.2120, wKL=0.5250]
Train E21: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.5168, E=0.8265,
KL=0.2120, wKL=0.5250]
Train E21: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.5849, E=0.8252,
KL=0.2115, wKL=0.5250]
Train E21: 52%|█████▏ | 13/25 [00:18<00:17, 1.45s/batch, N=1.5849, E=0.8252,
KL=0.2115, wKL=0.5250]
Train E21: 52%|█████▏ | 13/25 [00:20<00:17, 1.45s/batch, N=1.5195, E=0.8266,
KL=0.2101, wKL=0.5250]
Train E21: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.5195, E=0.8266,
KL=0.2101, wKL=0.5250]
Train E21: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.4705, E=0.8262,
KL=0.2129, wKL=0.5250]
Train E21: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4705, E=0.8262,
KL=0.2129, wKL=0.5250]
Train E21: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.4954, E=0.8276,
KL=0.2114, wKL=0.5250]
Train E21: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4954, E=0.8276,
KL=0.2114, wKL=0.5250]
Train E21: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4817, E=0.8210,
KL=0.2115, wKL=0.5250]
Train E21: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4817, E=0.8210,
KL=0.2115, wKL=0.5250]
Train E21: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.4048, E=0.8283,
KL=0.2103, wKL=0.5250]
Train E21: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4048, E=0.8283,
KL=0.2103, wKL=0.5250]
Train E21: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.3990, E=0.8223,
KL=0.2099, wKL=0.5250]
Train E21: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.3990, E=0.8223,
KL=0.2099, wKL=0.5250]
Train E21: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4736, E=0.8249,
KL=0.2106, wKL=0.5250]
Train E21: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4736, E=0.8249,
KL=0.2106, wKL=0.5250]
Train E21: 80%|████████ | 20/25 [00:29<00:06, 1.40s/batch, N=1.5337, E=0.8247,
KL=0.2112, wKL=0.5250]
Train E21: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.5337, E=0.8247,
KL=0.2112, wKL=0.5250]
Train E21: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.4584, E=0.8301,
KL=0.2099, wKL=0.5250]
Train E21: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4584, E=0.8301,
KL=0.2099, wKL=0.5250]
Train E21: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4004, E=0.8241,
KL=0.2091, wKL=0.5250]
Train E21: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.4004, E=0.8241,
KL=0.2091, wKL=0.5250]
Train E21: 92%|█████████▏| 23/25 [00:34<00:02, 1.38s/batch, N=1.4414, E=0.8279,
KL=0.2077, wKL=0.5250]
Train E21: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4414, E=0.8279,
KL=0.2077, wKL=0.5250]
Train E21: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5041, E=0.8297,
KL=0.2090, wKL=0.5250]
Train E21: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5041, E=0.8297,
KL=0.2090, wKL=0.5250]
Train E21: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5041, E=0.8297,
KL=0.2090, wKL=0.5250]
790.7s 115 [Epoch 021] Total: 2.3714 | N: 1.4896 | E: 0.8260 | KL(0.53×0.5): 0.2124
825.7s 116 Train E22: 0%| | 0/25 [00:00<?, ?batch/s]
Train E22: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4450, E=0.8256, KL=0.2083,
wKL=0.5500]
Train E22: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4450, E=0.8256,
KL=0.2083, wKL=0.5500]
Train E22: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5185, E=0.8231,
KL=0.2093, wKL=0.5500]
Train E22: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5185, E=0.8231,
KL=0.2093, wKL=0.5500]
Train E22: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4773, E=0.8272,
KL=0.2076, wKL=0.5500]
Train E22: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4773, E=0.8272,
KL=0.2076, wKL=0.5500]
Train E22: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4963, E=0.8232,
KL=0.2067, wKL=0.5500]
Train E22: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4963, E=0.8232,
KL=0.2067, wKL=0.5500]
Train E22: 16%|█▌ | 4/25 [00:07<00:28, 1.37s/batch, N=1.4462, E=0.8232,
KL=0.2074, wKL=0.5500]
Train E22: 20%|██ | 5/25 [00:07<00:29, 1.46s/batch, N=1.4462, E=0.8232,
KL=0.2074, wKL=0.5500]
Train E22: 20%|██ | 5/25 [00:08<00:29, 1.46s/batch, N=1.5894, E=0.8296,
KL=0.2035, wKL=0.5500]
Train E22: 24%|██▍ | 6/25 [00:08<00:27, 1.43s/batch, N=1.5894, E=0.8296,
KL=0.2035, wKL=0.5500]
Train E22: 24%|██▍ | 6/25 [00:09<00:27, 1.43s/batch, N=1.5221, E=0.8278,
KL=0.2037, wKL=0.5500]
Train E22: 28%|██▊ | 7/25 [00:09<00:25, 1.42s/batch, N=1.5221, E=0.8278,
KL=0.2037, wKL=0.5500]
Train E22: 28%|██▊ | 7/25 [00:11<00:25, 1.42s/batch, N=1.4845, E=0.8295,
KL=0.2022, wKL=0.5500]
Train E22: 32%|███▏ | 8/25 [00:11<00:24, 1.41s/batch, N=1.4845, E=0.8295,
KL=0.2022, wKL=0.5500]
Train E22: 32%|███▏ | 8/25 [00:12<00:24, 1.41s/batch, N=1.3684, E=0.8345,
KL=0.2019, wKL=0.5500]
Train E22: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.3684, E=0.8345,
KL=0.2019, wKL=0.5500]
Train E22: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.5234, E=0.8283,
KL=0.2015, wKL=0.5500]
Train E22: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.5234, E=0.8283,
KL=0.2015, wKL=0.5500]
Train E22: 40%|████ | 10/25 [00:15<00:21, 1.42s/batch, N=1.4563, E=0.8228,
KL=0.2007, wKL=0.5500]
Train E22: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4563, E=0.8228,
KL=0.2007, wKL=0.5500]
Train E22: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.6448, E=0.8277,
KL=0.2030, wKL=0.5500]
Train E22: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.6448, E=0.8277,
KL=0.2030, wKL=0.5500]
Train E22: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4910, E=0.8219,
KL=0.2035, wKL=0.5500]
Train E22: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4910, E=0.8219,
KL=0.2035, wKL=0.5500]
Train E22: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.4948, E=0.8256,
KL=0.2007, wKL=0.5500]
Train E22: 56%|█████▌ | 14/25 [00:20<00:17, 1.61s/batch, N=1.4948, E=0.8256,
KL=0.2007, wKL=0.5500]
Train E22: 56%|█████▌ | 14/25 [00:21<00:17, 1.61s/batch, N=1.4372, E=0.8277,
KL=0.2027, wKL=0.5500]
Train E22: 60%|██████ | 15/25 [00:21<00:15, 1.54s/batch, N=1.4372, E=0.8277,
KL=0.2027, wKL=0.5500]
Train E22: 60%|██████ | 15/25 [00:23<00:15, 1.54s/batch, N=1.4764, E=0.8237,
KL=0.2005, wKL=0.5500]
Train E22: 64%|██████▍ | 16/25 [00:23<00:13, 1.50s/batch, N=1.4764, E=0.8237,
KL=0.2005, wKL=0.5500]
Train E22: 64%|██████▍ | 16/25 [00:24<00:13, 1.50s/batch, N=1.5053, E=0.8242,
KL=0.2018, wKL=0.5500]
Train E22: 68%|██████▊ | 17/25 [00:24<00:11, 1.47s/batch, N=1.5053, E=0.8242,
KL=0.2018, wKL=0.5500]
Train E22: 68%|██████▊ | 17/25 [00:25<00:11, 1.47s/batch, N=1.4938, E=0.8268,
KL=0.1999, wKL=0.5500]
Train E22: 72%|███████▏ | 18/25 [00:25<00:10, 1.45s/batch, N=1.4938, E=0.8268,
KL=0.1999, wKL=0.5500]
Train E22: 72%|███████▏ | 18/25 [00:27<00:10, 1.45s/batch, N=1.4765, E=0.8251,
KL=0.1995, wKL=0.5500]
Train E22: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.4765, E=0.8251,
KL=0.1995, wKL=0.5500]
Train E22: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.5090, E=0.8242,
KL=0.2015, wKL=0.5500]
Train E22: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5090, E=0.8242,
KL=0.2015, wKL=0.5500]
Train E22: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4856, E=0.8252,
KL=0.1976, wKL=0.5500]
Train E22: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4856, E=0.8252,
KL=0.1976, wKL=0.5500]
Train E22: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.4631, E=0.8223,
KL=0.2002, wKL=0.5500]
Train E22: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.4631, E=0.8223,
KL=0.2002, wKL=0.5500]
Train E22: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5016, E=0.8244,
KL=0.1999, wKL=0.5500]
Train E22: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5016, E=0.8244,
KL=0.1999, wKL=0.5500]
Train E22: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4187, E=0.8249,
KL=0.1979, wKL=0.5500]
Train E22: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4187, E=0.8249,
KL=0.1979, wKL=0.5500]
Train E22: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5344, E=0.8275,
KL=0.1993, wKL=0.5500]
Train E22: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5344, E=0.8275,
KL=0.1993, wKL=0.5500]
Train E22: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5344, E=0.8275,
KL=0.1993, wKL=0.5500]
825.7s 117 [Epoch 022] Total: 2.3708 | N: 1.4893 | E: 0.8258 | KL(0.55×0.5): 0.2025
860.9s 118 Train E23: 0%| | 0/25 [00:00<?, ?batch/s]
Train E23: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5300, E=0.8231, KL=0.1997,
wKL=0.5750]
Train E23: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5300, E=0.8231,
KL=0.1997, wKL=0.5750]
Train E23: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5106, E=0.8261,
KL=0.2004, wKL=0.5750]
Train E23: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.5106, E=0.8261,
KL=0.2004, wKL=0.5750]
Train E23: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.4288, E=0.8297,
KL=0.1955, wKL=0.5750]
Train E23: 12%|█▏ | 3/25 [00:04<00:32, 1.47s/batch, N=1.4288, E=0.8297,
KL=0.1955, wKL=0.5750]
Train E23: 12%|█▏ | 3/25 [00:05<00:32, 1.47s/batch, N=1.4177, E=0.8252,
KL=0.1966, wKL=0.5750]
Train E23: 16%|█▌ | 4/25 [00:05<00:30, 1.44s/batch, N=1.4177, E=0.8252,
KL=0.1966, wKL=0.5750]
Train E23: 16%|█▌ | 4/25 [00:07<00:30, 1.44s/batch, N=1.4072, E=0.8228,
KL=0.1960, wKL=0.5750]
Train E23: 20%|██ | 5/25 [00:07<00:28, 1.42s/batch, N=1.4072, E=0.8228,
KL=0.1960, wKL=0.5750]
Train E23: 20%|██ | 5/25 [00:08<00:28, 1.42s/batch, N=1.3849, E=0.8260,
KL=0.1940, wKL=0.5750]
Train E23: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.3849, E=0.8260,
KL=0.1940, wKL=0.5750]
Train E23: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5069, E=0.8195,
KL=0.1950, wKL=0.5750]
Train E23: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5069, E=0.8195,
KL=0.1950, wKL=0.5750]
Train E23: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4894, E=0.8278,
KL=0.1940, wKL=0.5750]
Train E23: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4894, E=0.8278,
KL=0.1940, wKL=0.5750]
Train E23: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5203, E=0.8227,
KL=0.1948, wKL=0.5750]
Train E23: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5203, E=0.8227,
KL=0.1948, wKL=0.5750]
Train E23: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.5059, E=0.8244,
KL=0.1938, wKL=0.5750]
Train E23: 40%|████ | 10/25 [00:14<00:20, 1.40s/batch, N=1.5059, E=0.8244,
KL=0.1938, wKL=0.5750]
Train E23: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4490, E=0.8256,
KL=0.1900, wKL=0.5750]
Train E23: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4490, E=0.8256,
KL=0.1900, wKL=0.5750]
Train E23: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4577, E=0.8250,
KL=0.1937, wKL=0.5750]
Train E23: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4577, E=0.8250,
KL=0.1937, wKL=0.5750]
Train E23: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.5089, E=0.8263,
KL=0.1905, wKL=0.5750]
Train E23: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5089, E=0.8263,
KL=0.1905, wKL=0.5750]
Train E23: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5315, E=0.8321,
KL=0.1929, wKL=0.5750]
Train E23: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5315, E=0.8321,
KL=0.1929, wKL=0.5750]
Train E23: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5533, E=0.8261,
KL=0.1915, wKL=0.5750]
Train E23: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5533, E=0.8261,
KL=0.1915, wKL=0.5750]
Train E23: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4476, E=0.8238,
KL=0.1910, wKL=0.5750]
Train E23: 64%|██████▍ | 16/25 [00:23<00:14, 1.59s/batch, N=1.4476, E=0.8238,
KL=0.1910, wKL=0.5750]
Train E23: 64%|██████▍ | 16/25 [00:24<00:14, 1.59s/batch, N=1.5785, E=0.8269,
KL=0.1936, wKL=0.5750]
Train E23: 68%|██████▊ | 17/25 [00:24<00:12, 1.53s/batch, N=1.5785, E=0.8269,
KL=0.1936, wKL=0.5750]
Train E23: 68%|██████▊ | 17/25 [00:26<00:12, 1.53s/batch, N=1.5131, E=0.8269,
KL=0.1897, wKL=0.5750]
Train E23: 72%|███████▏ | 18/25 [00:26<00:10, 1.51s/batch, N=1.5131, E=0.8269,
KL=0.1897, wKL=0.5750]
Train E23: 72%|███████▏ | 18/25 [00:27<00:10, 1.51s/batch, N=1.4914, E=0.8276,
KL=0.1936, wKL=0.5750]
Train E23: 76%|███████▌ | 19/25 [00:27<00:08, 1.48s/batch, N=1.4914, E=0.8276,
KL=0.1936, wKL=0.5750]
Train E23: 76%|███████▌ | 19/25 [00:28<00:08, 1.48s/batch, N=1.5348, E=0.8265,
KL=0.1907, wKL=0.5750]
Train E23: 80%|████████ | 20/25 [00:28<00:07, 1.45s/batch, N=1.5348, E=0.8265,
KL=0.1907, wKL=0.5750]
Train E23: 80%|████████ | 20/25 [00:30<00:07, 1.45s/batch, N=1.5551, E=0.8236,
KL=0.1917, wKL=0.5750]
Train E23: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.5551, E=0.8236,
KL=0.1917, wKL=0.5750]
Train E23: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.4673, E=0.8266,
KL=0.1926, wKL=0.5750]
Train E23: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4673, E=0.8266,
KL=0.1926, wKL=0.5750]
Train E23: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.3957, E=0.8260,
KL=0.1887, wKL=0.5750]
Train E23: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.3957, E=0.8260,
KL=0.1887, wKL=0.5750]
Train E23: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5596, E=0.8263,
KL=0.1921, wKL=0.5750]
Train E23: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.5596, E=0.8263,
KL=0.1921, wKL=0.5750]
Train E23: 96%|█████████▌| 24/25 [00:35<00:01, 1.43s/batch, N=1.4980, E=0.8318,
KL=0.1911, wKL=0.5750]
Train E23: 100%|██████████| 25/25 [00:35<00:00, 1.24s/batch, N=1.4980, E=0.8318,
KL=0.1911, wKL=0.5750]
Train E23: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4980, E=0.8318,
KL=0.1911, wKL=0.5750]
860.9s 119 [Epoch 023] Total: 2.3709 | N: 1.4895 | E: 0.8258 | KL(0.57×0.5): 0.1934
895.8s 120 Train E24: 0%| | 0/25 [00:00<?, ?batch/s]
Train E24: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5214, E=0.8300, KL=0.1897,
wKL=0.6000]
Train E24: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5214, E=0.8300,
KL=0.1897, wKL=0.6000]
Train E24: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4791, E=0.8293,
KL=0.1908, wKL=0.6000]
Train E24: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4791, E=0.8293,
KL=0.1908, wKL=0.6000]
Train E24: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.5493, E=0.8232,
KL=0.1886, wKL=0.6000]
Train E24: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5493, E=0.8232,
KL=0.1886, wKL=0.6000]
Train E24: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5567, E=0.8266,
KL=0.1891, wKL=0.6000]
Train E24: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.5567, E=0.8266,
KL=0.1891, wKL=0.6000]
Train E24: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4458, E=0.8216,
KL=0.1898, wKL=0.6000]
Train E24: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4458, E=0.8216,
KL=0.1898, wKL=0.6000]
Train E24: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4449, E=0.8214,
KL=0.1841, wKL=0.6000]
Train E24: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4449, E=0.8214,
KL=0.1841, wKL=0.6000]
Train E24: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4670, E=0.8253,
KL=0.1860, wKL=0.6000]
Train E24: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4670, E=0.8253,
KL=0.1860, wKL=0.6000]
Train E24: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4734, E=0.8256,
KL=0.1853, wKL=0.6000]
Train E24: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4734, E=0.8256,
KL=0.1853, wKL=0.6000]
Train E24: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5170, E=0.8210,
KL=0.1827, wKL=0.6000]
Train E24: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5170, E=0.8210,
KL=0.1827, wKL=0.6000]
Train E24: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4594, E=0.8295,
KL=0.1843, wKL=0.6000]
Train E24: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4594, E=0.8295,
KL=0.1843, wKL=0.6000]
Train E24: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.5517, E=0.8258,
KL=0.1840, wKL=0.6000]
Train E24: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.5517, E=0.8258,
KL=0.1840, wKL=0.6000]
Train E24: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4509, E=0.8292,
KL=0.1810, wKL=0.6000]
Train E24: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4509, E=0.8292,
KL=0.1810, wKL=0.6000]
Train E24: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4944, E=0.8252,
KL=0.1848, wKL=0.6000]
Train E24: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4944, E=0.8252,
KL=0.1848, wKL=0.6000]
Train E24: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5482, E=0.8180,
KL=0.1844, wKL=0.6000]
Train E24: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5482, E=0.8180,
KL=0.1844, wKL=0.6000]
Train E24: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.5802, E=0.8256,
KL=0.1864, wKL=0.6000]
Train E24: 60%|██████ | 15/25 [00:20<00:14, 1.41s/batch, N=1.5802, E=0.8256,
KL=0.1864, wKL=0.6000]
Train E24: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.4072, E=0.8256,
KL=0.1835, wKL=0.6000]
Train E24: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4072, E=0.8256,
KL=0.1835, wKL=0.6000]
Train E24: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.3967, E=0.8277,
KL=0.1825, wKL=0.6000]
Train E24: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.3967, E=0.8277,
KL=0.1825, wKL=0.6000]
Train E24: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4961, E=0.8282,
KL=0.1856, wKL=0.6000]
Train E24: 72%|███████▏ | 18/25 [00:24<00:09, 1.39s/batch, N=1.4961, E=0.8282,
KL=0.1856, wKL=0.6000]
Train E24: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4918, E=0.8258,
KL=0.1834, wKL=0.6000]
Train E24: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4918, E=0.8258,
KL=0.1834, wKL=0.6000]
Train E24: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5006, E=0.8242,
KL=0.1837, wKL=0.6000]
Train E24: 80%|████████ | 20/25 [00:28<00:07, 1.58s/batch, N=1.5006, E=0.8242,
KL=0.1837, wKL=0.6000]
Train E24: 80%|████████ | 20/25 [00:29<00:07, 1.58s/batch, N=1.4928, E=0.8284,
KL=0.1838, wKL=0.6000]
Train E24: 84%|████████▍ | 21/25 [00:29<00:06, 1.54s/batch, N=1.4928, E=0.8284,
KL=0.1838, wKL=0.6000]
Train E24: 84%|████████▍ | 21/25 [00:31<00:06, 1.54s/batch, N=1.4682, E=0.8226,
KL=0.1824, wKL=0.6000]
Train E24: 88%|████████▊ | 22/25 [00:31<00:04, 1.54s/batch, N=1.4682, E=0.8226,
KL=0.1824, wKL=0.6000]
Train E24: 88%|████████▊ | 22/25 [00:32<00:04, 1.54s/batch, N=1.4533, E=0.8305,
KL=0.1819, wKL=0.6000]
Train E24: 92%|█████████▏| 23/25 [00:32<00:03, 1.51s/batch, N=1.4533, E=0.8305,
KL=0.1819, wKL=0.6000]
Train E24: 92%|█████████▏| 23/25 [00:34<00:03, 1.51s/batch, N=1.5288, E=0.8235,
KL=0.1823, wKL=0.6000]
Train E24: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.5288, E=0.8235,
KL=0.1823, wKL=0.6000]
Train E24: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.4011, E=0.8268,
KL=0.1785, wKL=0.6000]
Train E24: 100%|██████████| 25/25 [00:34<00:00, 1.22s/batch, N=1.4011, E=0.8268,
KL=0.1785, wKL=0.6000]
Train E24: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4011, E=0.8268,
KL=0.1785, wKL=0.6000]
895.8s 121 [Epoch 024] Total: 2.3702 | N: 1.4891 | E: 0.8256 | KL(0.60×0.5): 0.1849
930.8s 122 Train E25: 0%| | 0/25 [00:00<?, ?batch/s]
Train E25: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5353, E=0.8301, KL=0.1801,
wKL=0.6250]
Train E25: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5353, E=0.8301,
KL=0.1801, wKL=0.6250]
Train E25: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5337, E=0.8260,
KL=0.1806, wKL=0.6250]
Train E25: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5337, E=0.8260,
KL=0.1806, wKL=0.6250]
Train E25: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4798, E=0.8257,
KL=0.1783, wKL=0.6250]
Train E25: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4798, E=0.8257,
KL=0.1783, wKL=0.6250]
Train E25: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5227, E=0.8266,
KL=0.1803, wKL=0.6250]
Train E25: 16%|█▌ | 4/25 [00:05<00:29, 1.42s/batch, N=1.5227, E=0.8266,
KL=0.1803, wKL=0.6250]
Train E25: 16%|█▌ | 4/25 [00:07<00:29, 1.42s/batch, N=1.4464, E=0.8266,
KL=0.1784, wKL=0.6250]
Train E25: 20%|██ | 5/25 [00:07<00:28, 1.41s/batch, N=1.4464, E=0.8266,
KL=0.1784, wKL=0.6250]
Train E25: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.4993, E=0.8181,
KL=0.1793, wKL=0.6250]
Train E25: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4993, E=0.8181,
KL=0.1793, wKL=0.6250]
Train E25: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.4512, E=0.8302,
KL=0.1785, wKL=0.6250]
Train E25: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4512, E=0.8302,
KL=0.1785, wKL=0.6250]
Train E25: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4525, E=0.8227,
KL=0.1783, wKL=0.6250]
Train E25: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4525, E=0.8227,
KL=0.1783, wKL=0.6250]
Train E25: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4172, E=0.8237,
KL=0.1772, wKL=0.6250]
Train E25: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4172, E=0.8237,
KL=0.1772, wKL=0.6250]
Train E25: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.4498, E=0.8272,
KL=0.1769, wKL=0.6250]
Train E25: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4498, E=0.8272,
KL=0.1769, wKL=0.6250]
Train E25: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5047, E=0.8297,
KL=0.1745, wKL=0.6250]
Train E25: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5047, E=0.8297,
KL=0.1745, wKL=0.6250]
Train E25: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5093, E=0.8220,
KL=0.1783, wKL=0.6250]
Train E25: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5093, E=0.8220,
KL=0.1783, wKL=0.6250]
Train E25: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4370, E=0.8274,
KL=0.1743, wKL=0.6250]
Train E25: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4370, E=0.8274,
KL=0.1743, wKL=0.6250]
Train E25: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.6212, E=0.8199,
KL=0.1776, wKL=0.6250]
Train E25: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.6212, E=0.8199,
KL=0.1776, wKL=0.6250]
Train E25: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5419, E=0.8209,
KL=0.1768, wKL=0.6250]
Train E25: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.5419, E=0.8209,
KL=0.1768, wKL=0.6250]
Train E25: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4988, E=0.8204,
KL=0.1756, wKL=0.6250]
Train E25: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4988, E=0.8204,
KL=0.1756, wKL=0.6250]
Train E25: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.5101, E=0.8248,
KL=0.1758, wKL=0.6250]
Train E25: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5101, E=0.8248,
KL=0.1758, wKL=0.6250]
Train E25: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4803, E=0.8256,
KL=0.1742, wKL=0.6250]
Train E25: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4803, E=0.8256,
KL=0.1742, wKL=0.6250]
Train E25: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4426, E=0.8228,
KL=0.1753, wKL=0.6250]
Train E25: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4426, E=0.8228,
KL=0.1753, wKL=0.6250]
Train E25: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4549, E=0.8227,
KL=0.1766, wKL=0.6250]
Train E25: 80%|████████ | 20/25 [00:28<00:07, 1.44s/batch, N=1.4549, E=0.8227,
KL=0.1766, wKL=0.6250]
Train E25: 80%|████████ | 20/25 [00:29<00:07, 1.44s/batch, N=1.5083, E=0.8313,
KL=0.1723, wKL=0.6250]
Train E25: 84%|████████▍ | 21/25 [00:29<00:05, 1.46s/batch, N=1.5083, E=0.8313,
KL=0.1723, wKL=0.6250]
Train E25: 84%|████████▍ | 21/25 [00:30<00:05, 1.46s/batch, N=1.4604, E=0.8243,
KL=0.1754, wKL=0.6250]
Train E25: 88%|████████▊ | 22/25 [00:30<00:04, 1.44s/batch, N=1.4604, E=0.8243,
KL=0.1754, wKL=0.6250]
Train E25: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.5875, E=0.8262,
KL=0.1762, wKL=0.6250]
Train E25: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.5875, E=0.8262,
KL=0.1762, wKL=0.6250]
Train E25: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.3709, E=0.8264,
KL=0.1742, wKL=0.6250]
Train E25: 96%|█████████▌| 24/25 [00:34<00:01, 1.61s/batch, N=1.3709, E=0.8264,
KL=0.1742, wKL=0.6250]
Train E25: 96%|█████████▌| 24/25 [00:35<00:01, 1.61s/batch, N=1.5278, E=0.8294,
KL=0.1759, wKL=0.6250]
Train E25: 100%|██████████| 25/25 [00:35<00:00, 1.32s/batch, N=1.5278, E=0.8294,
KL=0.1759, wKL=0.6250]
Train E25: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5278, E=0.8294,
KL=0.1759, wKL=0.6250]
930.8s 123 [Epoch 025] Total: 2.3692 | N: 1.4888 | E: 0.8251 | KL(0.62×0.5): 0.1769
965.3s 124 Train E26: 0%| | 0/25 [00:00<?, ?batch/s]
Train E26: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4241, E=0.8307, KL=0.1734,
wKL=0.6500]
Train E26: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4241, E=0.8307,
KL=0.1734, wKL=0.6500]
Train E26: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4363, E=0.8242,
KL=0.1743, wKL=0.6500]
Train E26: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4363, E=0.8242,
KL=0.1743, wKL=0.6500]
Train E26: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.5535, E=0.8239,
KL=0.1720, wKL=0.6500]
Train E26: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5535, E=0.8239,
KL=0.1720, wKL=0.6500]
Train E26: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4711, E=0.8247,
KL=0.1716, wKL=0.6500]
Train E26: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4711, E=0.8247,
KL=0.1716, wKL=0.6500]
Train E26: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.6623, E=0.8266,
KL=0.1748, wKL=0.6500]
Train E26: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.6623, E=0.8266,
KL=0.1748, wKL=0.6500]
Train E26: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4884, E=0.8230,
KL=0.1702, wKL=0.6500]
Train E26: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4884, E=0.8230,
KL=0.1702, wKL=0.6500]
Train E26: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4805, E=0.8266,
KL=0.1710, wKL=0.6500]
Train E26: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4805, E=0.8266,
KL=0.1710, wKL=0.6500]
Train E26: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4945, E=0.8230,
KL=0.1707, wKL=0.6500]
Train E26: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.4945, E=0.8230,
KL=0.1707, wKL=0.6500]
Train E26: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.4994, E=0.8272,
KL=0.1689, wKL=0.6500]
Train E26: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.4994, E=0.8272,
KL=0.1689, wKL=0.6500]
Train E26: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.4202, E=0.8284,
KL=0.1706, wKL=0.6500]
Train E26: 40%|████ | 10/25 [00:13<00:21, 1.41s/batch, N=1.4202, E=0.8284,
KL=0.1706, wKL=0.6500]
Train E26: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4906, E=0.8257,
KL=0.1703, wKL=0.6500]
Train E26: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4906, E=0.8257,
KL=0.1703, wKL=0.6500]
Train E26: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4588, E=0.8290,
KL=0.1689, wKL=0.6500]
Train E26: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4588, E=0.8290,
KL=0.1689, wKL=0.6500]
Train E26: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5497, E=0.8289,
KL=0.1709, wKL=0.6500]
Train E26: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5497, E=0.8289,
KL=0.1709, wKL=0.6500]
Train E26: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4802, E=0.8266,
KL=0.1679, wKL=0.6500]
Train E26: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4802, E=0.8266,
KL=0.1679, wKL=0.6500]
Train E26: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5013, E=0.8279,
KL=0.1706, wKL=0.6500]
Train E26: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.5013, E=0.8279,
KL=0.1706, wKL=0.6500]
Train E26: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5297, E=0.8234,
KL=0.1699, wKL=0.6500]
Train E26: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5297, E=0.8234,
KL=0.1699, wKL=0.6500]
Train E26: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4244, E=0.8228,
KL=0.1680, wKL=0.6500]
Train E26: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4244, E=0.8228,
KL=0.1680, wKL=0.6500]
Train E26: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5109, E=0.8195,
KL=0.1701, wKL=0.6500]
Train E26: 72%|███████▏ | 18/25 [00:25<00:10, 1.45s/batch, N=1.5109, E=0.8195,
KL=0.1701, wKL=0.6500]
Train E26: 72%|███████▏ | 18/25 [00:26<00:10, 1.45s/batch, N=1.4531, E=0.8178,
KL=0.1668, wKL=0.6500]
Train E26: 76%|███████▌ | 19/25 [00:26<00:08, 1.45s/batch, N=1.4531, E=0.8178,
KL=0.1668, wKL=0.6500]
Train E26: 76%|███████▌ | 19/25 [00:28<00:08, 1.45s/batch, N=1.4466, E=0.8198,
KL=0.1667, wKL=0.6500]
Train E26: 80%|████████ | 20/25 [00:28<00:07, 1.44s/batch, N=1.4466, E=0.8198,
KL=0.1667, wKL=0.6500]
Train E26: 80%|████████ | 20/25 [00:29<00:07, 1.44s/batch, N=1.4801, E=0.8170,
KL=0.1695, wKL=0.6500]
Train E26: 84%|████████▍ | 21/25 [00:29<00:05, 1.43s/batch, N=1.4801, E=0.8170,
KL=0.1695, wKL=0.6500]
Train E26: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4348, E=0.8250,
KL=0.1663, wKL=0.6500]
Train E26: 88%|████████▊ | 22/25 [00:30<00:04, 1.44s/batch, N=1.4348, E=0.8250,
KL=0.1663, wKL=0.6500]
Train E26: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.6050, E=0.8220,
KL=0.1687, wKL=0.6500]
Train E26: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.6050, E=0.8220,
KL=0.1687, wKL=0.6500]
Train E26: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4663, E=0.8265,
KL=0.1667, wKL=0.6500]
Train E26: 96%|█████████▌| 24/25 [00:33<00:01, 1.42s/batch, N=1.4663, E=0.8265,
KL=0.1667, wKL=0.6500]
Train E26: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4224, E=0.8277,
KL=0.1662, wKL=0.6500]
Train E26: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.4224, E=0.8277,
KL=0.1662, wKL=0.6500]
Train E26: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4224, E=0.8277,
KL=0.1662, wKL=0.6500]
965.3s 125 [Epoch 026] Total: 2.3688 | N: 1.4889 | E: 0.8246 | KL(0.65×0.5): 0.1699
1000.4s 126 Train E27: 0%| | 0/25 [00:00<?, ?batch/s]
Train E27: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5394, E=0.8218, KL=0.1679,
wKL=0.6750]
Train E27: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5394, E=0.8218,
KL=0.1679, wKL=0.6750]
Train E27: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4821, E=0.8224,
KL=0.1674, wKL=0.6750]
Train E27: 8%|▊ | 2/25 [00:02<00:32, 1.39s/batch, N=1.4821, E=0.8224,
KL=0.1674, wKL=0.6750]
Train E27: 8%|▊ | 2/25 [00:04<00:32, 1.39s/batch, N=1.5124, E=0.8247,
KL=0.1666, wKL=0.6750]
Train E27: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5124, E=0.8247,
KL=0.1666, wKL=0.6750]
Train E27: 12%|█▏ | 3/25 [00:06<00:30, 1.39s/batch, N=1.5776, E=0.8220,
KL=0.1668, wKL=0.6750]
Train E27: 16%|█▌ | 4/25 [00:06<00:35, 1.68s/batch, N=1.5776, E=0.8220,
KL=0.1668, wKL=0.6750]
Train E27: 16%|█▌ | 4/25 [00:07<00:35, 1.68s/batch, N=1.4154, E=0.8297,
KL=0.1645, wKL=0.6750]
Train E27: 20%|██ | 5/25 [00:07<00:31, 1.59s/batch, N=1.4154, E=0.8297,
KL=0.1645, wKL=0.6750]
Train E27: 20%|██ | 5/25 [00:09<00:31, 1.59s/batch, N=1.4853, E=0.8218,
KL=0.1644, wKL=0.6750]
Train E27: 24%|██▍ | 6/25 [00:09<00:28, 1.52s/batch, N=1.4853, E=0.8218,
KL=0.1644, wKL=0.6750]
Train E27: 24%|██▍ | 6/25 [00:10<00:28, 1.52s/batch, N=1.4476, E=0.8222,
KL=0.1640, wKL=0.6750]
Train E27: 28%|██▊ | 7/25 [00:10<00:26, 1.48s/batch, N=1.4476, E=0.8222,
KL=0.1640, wKL=0.6750]
Train E27: 28%|██▊ | 7/25 [00:11<00:26, 1.48s/batch, N=1.5324, E=0.8213,
KL=0.1652, wKL=0.6750]
Train E27: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.5324, E=0.8213,
KL=0.1652, wKL=0.6750]
Train E27: 32%|███▏ | 8/25 [00:13<00:24, 1.45s/batch, N=1.5175, E=0.8235,
KL=0.1642, wKL=0.6750]
Train E27: 36%|███▌ | 9/25 [00:13<00:22, 1.43s/batch, N=1.5175, E=0.8235,
KL=0.1642, wKL=0.6750]
Train E27: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4774, E=0.8272,
KL=0.1622, wKL=0.6750]
Train E27: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.4774, E=0.8272,
KL=0.1622, wKL=0.6750]
Train E27: 40%|████ | 10/25 [00:16<00:21, 1.42s/batch, N=1.4666, E=0.8253,
KL=0.1639, wKL=0.6750]
Train E27: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.4666, E=0.8253,
KL=0.1639, wKL=0.6750]
Train E27: 44%|████▍ | 11/25 [00:17<00:19, 1.42s/batch, N=1.4312, E=0.8232,
KL=0.1627, wKL=0.6750]
Train E27: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4312, E=0.8232,
KL=0.1627, wKL=0.6750]
Train E27: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4510, E=0.8269,
KL=0.1630, wKL=0.6750]
Train E27: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4510, E=0.8269,
KL=0.1630, wKL=0.6750]
Train E27: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.4915, E=0.8250,
KL=0.1641, wKL=0.6750]
Train E27: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4915, E=0.8250,
KL=0.1641, wKL=0.6750]
Train E27: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4872, E=0.8192,
KL=0.1654, wKL=0.6750]
Train E27: 60%|██████ | 15/25 [00:21<00:14, 1.45s/batch, N=1.4872, E=0.8192,
KL=0.1654, wKL=0.6750]
Train E27: 60%|██████ | 15/25 [00:23<00:14, 1.45s/batch, N=1.5341, E=0.8243,
KL=0.1633, wKL=0.6750]
Train E27: 64%|██████▍ | 16/25 [00:23<00:13, 1.45s/batch, N=1.5341, E=0.8243,
KL=0.1633, wKL=0.6750]
Train E27: 64%|██████▍ | 16/25 [00:24<00:13, 1.45s/batch, N=1.4783, E=0.8250,
KL=0.1645, wKL=0.6750]
Train E27: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.4783, E=0.8250,
KL=0.1645, wKL=0.6750]
Train E27: 68%|██████▊ | 17/25 [00:26<00:11, 1.44s/batch, N=1.4938, E=0.8233,
KL=0.1624, wKL=0.6750]
Train E27: 72%|███████▏ | 18/25 [00:26<00:10, 1.44s/batch, N=1.4938, E=0.8233,
KL=0.1624, wKL=0.6750]
Train E27: 72%|███████▏ | 18/25 [00:27<00:10, 1.44s/batch, N=1.5440, E=0.8236,
KL=0.1649, wKL=0.6750]
Train E27: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.5440, E=0.8236,
KL=0.1649, wKL=0.6750]
Train E27: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.4384, E=0.8155,
KL=0.1632, wKL=0.6750]
Train E27: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4384, E=0.8155,
KL=0.1632, wKL=0.6750]
Train E27: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4082, E=0.8276,
KL=0.1638, wKL=0.6750]
Train E27: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4082, E=0.8276,
KL=0.1638, wKL=0.6750]
Train E27: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5292, E=0.8252,
KL=0.1646, wKL=0.6750]
Train E27: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5292, E=0.8252,
KL=0.1646, wKL=0.6750]
Train E27: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.5363, E=0.8221,
KL=0.1621, wKL=0.6750]
Train E27: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.5363, E=0.8221,
KL=0.1621, wKL=0.6750]
Train E27: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4207, E=0.8267,
KL=0.1626, wKL=0.6750]
Train E27: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4207, E=0.8267,
KL=0.1626, wKL=0.6750]
Train E27: 96%|█████████▌| 24/25 [00:35<00:01, 1.41s/batch, N=1.5373, E=0.8203,
KL=0.1621, wKL=0.6750]
Train E27: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.5373, E=0.8203,
KL=0.1621, wKL=0.6750]
Train E27: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5373, E=0.8203,
KL=0.1621, wKL=0.6750]
1000.4s 127 [Epoch 027] Total: 2.3674 | N: 1.4882 | E: 0.8237 | KL(0.68×0.5):
0.1643
1035.2s 128 Train E28: 0%| | 0/25 [00:00<?, ?batch/s]
Train E28: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4465, E=0.8288, KL=0.1615,
wKL=0.7000]
Train E28: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4465, E=0.8288,
KL=0.1615, wKL=0.7000]
Train E28: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4221, E=0.8226,
KL=0.1611, wKL=0.7000]
Train E28: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4221, E=0.8226,
KL=0.1611, wKL=0.7000]
Train E28: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4262, E=0.8246,
KL=0.1607, wKL=0.7000]
Train E28: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4262, E=0.8246,
KL=0.1607, wKL=0.7000]
Train E28: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4637, E=0.8223,
KL=0.1621, wKL=0.7000]
Train E28: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4637, E=0.8223,
KL=0.1621, wKL=0.7000]
Train E28: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5387, E=0.8274,
KL=0.1594, wKL=0.7000]
Train E28: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5387, E=0.8274,
KL=0.1594, wKL=0.7000]
Train E28: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4756, E=0.8190,
KL=0.1613, wKL=0.7000]
Train E28: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.4756, E=0.8190,
KL=0.1613, wKL=0.7000]
Train E28: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.5217, E=0.8216,
KL=0.1604, wKL=0.7000]
Train E28: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.5217, E=0.8216,
KL=0.1604, wKL=0.7000]
Train E28: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4858, E=0.8216,
KL=0.1607, wKL=0.7000]
Train E28: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4858, E=0.8216,
KL=0.1607, wKL=0.7000]
Train E28: 32%|███▏ | 8/25 [00:13<00:23, 1.40s/batch, N=1.4428, E=0.8249,
KL=0.1578, wKL=0.7000]
Train E28: 36%|███▌ | 9/25 [00:13<00:25, 1.59s/batch, N=1.4428, E=0.8249,
KL=0.1578, wKL=0.7000]
Train E28: 36%|███▌ | 9/25 [00:14<00:25, 1.59s/batch, N=1.5075, E=0.8192,
KL=0.1592, wKL=0.7000]
Train E28: 40%|████ | 10/25 [00:14<00:22, 1.53s/batch, N=1.5075, E=0.8192,
KL=0.1592, wKL=0.7000]
Train E28: 40%|████ | 10/25 [00:15<00:22, 1.53s/batch, N=1.4390, E=0.8229,
KL=0.1590, wKL=0.7000]
Train E28: 44%|████▍ | 11/25 [00:15<00:20, 1.48s/batch, N=1.4390, E=0.8229,
KL=0.1590, wKL=0.7000]
Train E28: 44%|████▍ | 11/25 [00:17<00:20, 1.48s/batch, N=1.4187, E=0.8218,
KL=0.1563, wKL=0.7000]
Train E28: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.4187, E=0.8218,
KL=0.1563, wKL=0.7000]
Train E28: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.5216, E=0.8252,
KL=0.1581, wKL=0.7000]
Train E28: 52%|█████▏ | 13/25 [00:18<00:17, 1.49s/batch, N=1.5216, E=0.8252,
KL=0.1581, wKL=0.7000]
Train E28: 52%|█████▏ | 13/25 [00:20<00:17, 1.49s/batch, N=1.4731, E=0.8204,
KL=0.1575, wKL=0.7000]
Train E28: 56%|█████▌ | 14/25 [00:20<00:16, 1.49s/batch, N=1.4731, E=0.8204,
KL=0.1575, wKL=0.7000]
Train E28: 56%|█████▌ | 14/25 [00:21<00:16, 1.49s/batch, N=1.5030, E=0.8283,
KL=0.1587, wKL=0.7000]
Train E28: 60%|██████ | 15/25 [00:21<00:14, 1.46s/batch, N=1.5030, E=0.8283,
KL=0.1587, wKL=0.7000]
Train E28: 60%|██████ | 15/25 [00:23<00:14, 1.46s/batch, N=1.5571, E=0.8262,
KL=0.1585, wKL=0.7000]
Train E28: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.5571, E=0.8262,
KL=0.1585, wKL=0.7000]
Train E28: 64%|██████▍ | 16/25 [00:24<00:12, 1.44s/batch, N=1.5183, E=0.8212,
KL=0.1586, wKL=0.7000]
Train E28: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.5183, E=0.8212,
KL=0.1586, wKL=0.7000]
Train E28: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.6235, E=0.8213,
KL=0.1600, wKL=0.7000]
Train E28: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.6235, E=0.8213,
KL=0.1600, wKL=0.7000]
Train E28: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4393, E=0.8235,
KL=0.1575, wKL=0.7000]
Train E28: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.4393, E=0.8235,
KL=0.1575, wKL=0.7000]
Train E28: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4900, E=0.8218,
KL=0.1594, wKL=0.7000]
Train E28: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4900, E=0.8218,
KL=0.1594, wKL=0.7000]
Train E28: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5360, E=0.8193,
KL=0.1586, wKL=0.7000]
Train E28: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5360, E=0.8193,
KL=0.1586, wKL=0.7000]
Train E28: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5623, E=0.8235,
KL=0.1598, wKL=0.7000]
Train E28: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5623, E=0.8235,
KL=0.1598, wKL=0.7000]
Train E28: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.3578, E=0.8217,
KL=0.1563, wKL=0.7000]
Train E28: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.3578, E=0.8217,
KL=0.1563, wKL=0.7000]
Train E28: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5508, E=0.8266,
KL=0.1567, wKL=0.7000]
Train E28: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5508, E=0.8266,
KL=0.1567, wKL=0.7000]
Train E28: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4833, E=0.8317,
KL=0.1573, wKL=0.7000]
Train E28: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4833, E=0.8317,
KL=0.1573, wKL=0.7000]
Train E28: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4833, E=0.8317,
KL=0.1573, wKL=0.7000]
1035.2s 129 [Epoch 028] Total: 2.3673 | N: 1.4883 | E: 0.8233 | KL(0.70×0.5):
0.1591
1070.0s 130 Train E29: 0%| | 0/25 [00:00<?, ?batch/s]
Train E29: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4347, E=0.8272, KL=0.1542,
wKL=0.7250]
Train E29: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4347, E=0.8272,
KL=0.1542, wKL=0.7250]
Train E29: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5497, E=0.8231,
KL=0.1580, wKL=0.7250]
Train E29: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.5497, E=0.8231,
KL=0.1580, wKL=0.7250]
Train E29: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4634, E=0.8214,
KL=0.1549, wKL=0.7250]
Train E29: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4634, E=0.8214,
KL=0.1549, wKL=0.7250]
Train E29: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5020, E=0.8177,
KL=0.1548, wKL=0.7250]
Train E29: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5020, E=0.8177,
KL=0.1548, wKL=0.7250]
Train E29: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4169, E=0.8187,
KL=0.1554, wKL=0.7250]
Train E29: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4169, E=0.8187,
KL=0.1554, wKL=0.7250]
Train E29: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4167, E=0.8264,
KL=0.1530, wKL=0.7250]
Train E29: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4167, E=0.8264,
KL=0.1530, wKL=0.7250]
Train E29: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5830, E=0.8255,
KL=0.1540, wKL=0.7250]
Train E29: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.5830, E=0.8255,
KL=0.1540, wKL=0.7250]
Train E29: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5142, E=0.8186,
KL=0.1543, wKL=0.7250]
Train E29: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.5142, E=0.8186,
KL=0.1543, wKL=0.7250]
Train E29: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4767, E=0.8210,
KL=0.1531, wKL=0.7250]
Train E29: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4767, E=0.8210,
KL=0.1531, wKL=0.7250]
Train E29: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.5523, E=0.8236,
KL=0.1538, wKL=0.7250]
Train E29: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5523, E=0.8236,
KL=0.1538, wKL=0.7250]
Train E29: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.3959, E=0.8267,
KL=0.1541, wKL=0.7250]
Train E29: 44%|████▍ | 11/25 [00:15<00:19, 1.42s/batch, N=1.3959, E=0.8267,
KL=0.1541, wKL=0.7250]
Train E29: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.5864, E=0.8208,
KL=0.1568, wKL=0.7250]
Train E29: 48%|████▊ | 12/25 [00:16<00:18, 1.44s/batch, N=1.5864, E=0.8208,
KL=0.1568, wKL=0.7250]
Train E29: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.4578, E=0.8226,
KL=0.1543, wKL=0.7250]
Train E29: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4578, E=0.8226,
KL=0.1543, wKL=0.7250]
Train E29: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.4437, E=0.8224,
KL=0.1547, wKL=0.7250]
Train E29: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.4437, E=0.8224,
KL=0.1547, wKL=0.7250]
Train E29: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4298, E=0.8221,
KL=0.1549, wKL=0.7250]
Train E29: 60%|██████ | 15/25 [00:21<00:15, 1.58s/batch, N=1.4298, E=0.8221,
KL=0.1549, wKL=0.7250]
Train E29: 60%|██████ | 15/25 [00:22<00:15, 1.58s/batch, N=1.5296, E=0.8273,
KL=0.1540, wKL=0.7250]
Train E29: 64%|██████▍ | 16/25 [00:22<00:13, 1.52s/batch, N=1.5296, E=0.8273,
KL=0.1540, wKL=0.7250]
Train E29: 64%|██████▍ | 16/25 [00:24<00:13, 1.52s/batch, N=1.5725, E=0.8252,
KL=0.1565, wKL=0.7250]
Train E29: 68%|██████▊ | 17/25 [00:24<00:11, 1.49s/batch, N=1.5725, E=0.8252,
KL=0.1565, wKL=0.7250]
Train E29: 68%|██████▊ | 17/25 [00:25<00:11, 1.49s/batch, N=1.5650, E=0.8186,
KL=0.1532, wKL=0.7250]
Train E29: 72%|███████▏ | 18/25 [00:25<00:10, 1.47s/batch, N=1.5650, E=0.8186,
KL=0.1532, wKL=0.7250]
Train E29: 72%|███████▏ | 18/25 [00:27<00:10, 1.47s/batch, N=1.4760, E=0.8249,
KL=0.1528, wKL=0.7250]
Train E29: 76%|███████▌ | 19/25 [00:27<00:08, 1.45s/batch, N=1.4760, E=0.8249,
KL=0.1528, wKL=0.7250]
Train E29: 76%|███████▌ | 19/25 [00:28<00:08, 1.45s/batch, N=1.4987, E=0.8262,
KL=0.1540, wKL=0.7250]
Train E29: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4987, E=0.8262,
KL=0.1540, wKL=0.7250]
Train E29: 80%|████████ | 20/25 [00:29<00:07, 1.42s/batch, N=1.4728, E=0.8222,
KL=0.1514, wKL=0.7250]
Train E29: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4728, E=0.8222,
KL=0.1514, wKL=0.7250]
Train E29: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5235, E=0.8316,
KL=0.1531, wKL=0.7250]
Train E29: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5235, E=0.8316,
KL=0.1531, wKL=0.7250]
Train E29: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.3917, E=0.8257,
KL=0.1507, wKL=0.7250]
Train E29: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.3917, E=0.8257,
KL=0.1507, wKL=0.7250]
Train E29: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4670, E=0.8277,
KL=0.1517, wKL=0.7250]
Train E29: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4670, E=0.8277,
KL=0.1517, wKL=0.7250]
Train E29: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5261, E=0.8165,
KL=0.1537, wKL=0.7250]
Train E29: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5261, E=0.8165,
KL=0.1537, wKL=0.7250]
Train E29: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5261, E=0.8165,
KL=0.1537, wKL=0.7250]
1070.0s 131 [Epoch 029] Total: 2.3683 | N: 1.4890 | E: 0.8235 | KL(0.72×0.5):
0.1541
1104.9s 132 Train E30: 0%| | 0/25 [00:00<?, ?batch/s]
Train E30: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4621, E=0.8241, KL=0.1524,
wKL=0.7500]
Train E30: 4%|▍ | 1/25 [00:01<00:33, 1.39s/batch, N=1.4621, E=0.8241,
KL=0.1524, wKL=0.7500]
Train E30: 4%|▍ | 1/25 [00:02<00:33, 1.39s/batch, N=1.4448, E=0.8217,
KL=0.1525, wKL=0.7500]
Train E30: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4448, E=0.8217,
KL=0.1525, wKL=0.7500]
Train E30: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.6167, E=0.8271,
KL=0.1520, wKL=0.7500]
Train E30: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.6167, E=0.8271,
KL=0.1520, wKL=0.7500]
Train E30: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.4834, E=0.8184,
KL=0.1531, wKL=0.7500]
Train E30: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4834, E=0.8184,
KL=0.1531, wKL=0.7500]
Train E30: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4240, E=0.8199,
KL=0.1509, wKL=0.7500]
Train E30: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4240, E=0.8199,
KL=0.1509, wKL=0.7500]
Train E30: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4628, E=0.8240,
KL=0.1518, wKL=0.7500]
Train E30: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4628, E=0.8240,
KL=0.1518, wKL=0.7500]
Train E30: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5116, E=0.8237,
KL=0.1500, wKL=0.7500]
Train E30: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5116, E=0.8237,
KL=0.1500, wKL=0.7500]
Train E30: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.5467, E=0.8247,
KL=0.1492, wKL=0.7500]
Train E30: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5467, E=0.8247,
KL=0.1492, wKL=0.7500]
Train E30: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4071, E=0.8291,
KL=0.1479, wKL=0.7500]
Train E30: 36%|███▌ | 9/25 [00:12<00:23, 1.46s/batch, N=1.4071, E=0.8291,
KL=0.1479, wKL=0.7500]
Train E30: 36%|███▌ | 9/25 [00:14<00:23, 1.46s/batch, N=1.4695, E=0.8215,
KL=0.1489, wKL=0.7500]
Train E30: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.4695, E=0.8215,
KL=0.1489, wKL=0.7500]
Train E30: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4986, E=0.8220,
KL=0.1488, wKL=0.7500]
Train E30: 44%|████▍ | 11/25 [00:15<00:20, 1.43s/batch, N=1.4986, E=0.8220,
KL=0.1488, wKL=0.7500]
Train E30: 44%|████▍ | 11/25 [00:16<00:20, 1.43s/batch, N=1.4979, E=0.8262,
KL=0.1494, wKL=0.7500]
Train E30: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.4979, E=0.8262,
KL=0.1494, wKL=0.7500]
Train E30: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.4749, E=0.8277,
KL=0.1483, wKL=0.7500]
Train E30: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4749, E=0.8277,
KL=0.1483, wKL=0.7500]
Train E30: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4977, E=0.8231,
KL=0.1496, wKL=0.7500]
Train E30: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4977, E=0.8231,
KL=0.1496, wKL=0.7500]
Train E30: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4391, E=0.8221,
KL=0.1491, wKL=0.7500]
Train E30: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4391, E=0.8221,
KL=0.1491, wKL=0.7500]
Train E30: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.5263, E=0.8217,
KL=0.1499, wKL=0.7500]
Train E30: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5263, E=0.8217,
KL=0.1499, wKL=0.7500]
Train E30: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.5557, E=0.8228,
KL=0.1508, wKL=0.7500]
Train E30: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5557, E=0.8228,
KL=0.1508, wKL=0.7500]
Train E30: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5575, E=0.8170,
KL=0.1476, wKL=0.7500]
Train E30: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5575, E=0.8170,
KL=0.1476, wKL=0.7500]
Train E30: 72%|███████▏ | 18/25 [00:27<00:09, 1.39s/batch, N=1.5374, E=0.8224,
KL=0.1490, wKL=0.7500]
Train E30: 76%|███████▌ | 19/25 [00:27<00:09, 1.58s/batch, N=1.5374, E=0.8224,
KL=0.1490, wKL=0.7500]
Train E30: 76%|███████▌ | 19/25 [00:28<00:09, 1.58s/batch, N=1.5458, E=0.8247,
KL=0.1476, wKL=0.7500]
Train E30: 80%|████████ | 20/25 [00:28<00:07, 1.52s/batch, N=1.5458, E=0.8247,
KL=0.1476, wKL=0.7500]
Train E30: 80%|████████ | 20/25 [00:30<00:07, 1.52s/batch, N=1.4827, E=0.8263,
KL=0.1478, wKL=0.7500]
Train E30: 84%|████████▍ | 21/25 [00:30<00:05, 1.48s/batch, N=1.4827, E=0.8263,
KL=0.1478, wKL=0.7500]
Train E30: 84%|████████▍ | 21/25 [00:31<00:05, 1.48s/batch, N=1.3661, E=0.8217,
KL=0.1486, wKL=0.7500]
Train E30: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.3661, E=0.8217,
KL=0.1486, wKL=0.7500]
Train E30: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.4920, E=0.8242,
KL=0.1460, wKL=0.7500]
Train E30: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.4920, E=0.8242,
KL=0.1460, wKL=0.7500]
Train E30: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.4350, E=0.8216,
KL=0.1485, wKL=0.7500]
Train E30: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.4350, E=0.8216,
KL=0.1485, wKL=0.7500]
Train E30: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.4819, E=0.8261,
KL=0.1479, wKL=0.7500]
Train E30: 100%|██████████| 25/25 [00:34<00:00, 1.19s/batch, N=1.4819, E=0.8261,
KL=0.1479, wKL=0.7500]
Train E30: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4819, E=0.8261,
KL=0.1479, wKL=0.7500]
1104.9s 133 [Epoch 030] Total: 2.3682 | N: 1.4889 | E: 0.8233 | KL(0.75×0.5):
0.1495
1104.9s 134 Saved checkpoint: /kaggle/working/checkpoints/gvae_30_epoch030.pt
1139.7s 135 Train E31: 0%| | 0/25 [00:00<?, ?batch/s]
Train E31: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5405, E=0.8219, KL=0.1472,
wKL=0.7750]
Train E31: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5405, E=0.8219,
KL=0.1472, wKL=0.7750]
Train E31: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5836, E=0.8256,
KL=0.1474, wKL=0.7750]
Train E31: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5836, E=0.8256,
KL=0.1474, wKL=0.7750]
Train E31: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4920, E=0.8246,
KL=0.1458, wKL=0.7750]
Train E31: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4920, E=0.8246,
KL=0.1458, wKL=0.7750]
Train E31: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4810, E=0.8253,
KL=0.1452, wKL=0.7750]
Train E31: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4810, E=0.8253,
KL=0.1452, wKL=0.7750]
Train E31: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4772, E=0.8285,
KL=0.1448, wKL=0.7750]
Train E31: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4772, E=0.8285,
KL=0.1448, wKL=0.7750]
Train E31: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4420, E=0.8184,
KL=0.1446, wKL=0.7750]
Train E31: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4420, E=0.8184,
KL=0.1446, wKL=0.7750]
Train E31: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5350, E=0.8218,
KL=0.1454, wKL=0.7750]
Train E31: 28%|██▊ | 7/25 [00:09<00:26, 1.49s/batch, N=1.5350, E=0.8218,
KL=0.1454, wKL=0.7750]
Train E31: 28%|██▊ | 7/25 [00:11<00:26, 1.49s/batch, N=1.5586, E=0.8255,
KL=0.1449, wKL=0.7750]
Train E31: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.5586, E=0.8255,
KL=0.1449, wKL=0.7750]
Train E31: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.4219, E=0.8225,
KL=0.1441, wKL=0.7750]
Train E31: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.4219, E=0.8225,
KL=0.1441, wKL=0.7750]
Train E31: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.4970, E=0.8234,
KL=0.1458, wKL=0.7750]
Train E31: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4970, E=0.8234,
KL=0.1458, wKL=0.7750]
Train E31: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4820, E=0.8245,
KL=0.1445, wKL=0.7750]
Train E31: 44%|████▍ | 11/25 [00:15<00:19, 1.42s/batch, N=1.4820, E=0.8245,
KL=0.1445, wKL=0.7750]
Train E31: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.4467, E=0.8221,
KL=0.1452, wKL=0.7750]
Train E31: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4467, E=0.8221,
KL=0.1452, wKL=0.7750]
Train E31: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.3781, E=0.8219,
KL=0.1446, wKL=0.7750]
Train E31: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.3781, E=0.8219,
KL=0.1446, wKL=0.7750]
Train E31: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5280, E=0.8291,
KL=0.1446, wKL=0.7750]
Train E31: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5280, E=0.8291,
KL=0.1446, wKL=0.7750]
Train E31: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4462, E=0.8217,
KL=0.1455, wKL=0.7750]
Train E31: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.4462, E=0.8217,
KL=0.1455, wKL=0.7750]
Train E31: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.4947, E=0.8237,
KL=0.1445, wKL=0.7750]
Train E31: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4947, E=0.8237,
KL=0.1445, wKL=0.7750]
Train E31: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5019, E=0.8227,
KL=0.1449, wKL=0.7750]
Train E31: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5019, E=0.8227,
KL=0.1449, wKL=0.7750]
Train E31: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.5066, E=0.8258,
KL=0.1443, wKL=0.7750]
Train E31: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5066, E=0.8258,
KL=0.1443, wKL=0.7750]
Train E31: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4949, E=0.8243,
KL=0.1441, wKL=0.7750]
Train E31: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4949, E=0.8243,
KL=0.1441, wKL=0.7750]
Train E31: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5473, E=0.8259,
KL=0.1454, wKL=0.7750]
Train E31: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.5473, E=0.8259,
KL=0.1454, wKL=0.7750]
Train E31: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5141, E=0.8215,
KL=0.1445, wKL=0.7750]
Train E31: 84%|████████▍ | 21/25 [00:29<00:05, 1.38s/batch, N=1.5141, E=0.8215,
KL=0.1445, wKL=0.7750]
Train E31: 84%|████████▍ | 21/25 [00:30<00:05, 1.38s/batch, N=1.3829, E=0.8229,
KL=0.1457, wKL=0.7750]
Train E31: 88%|████████▊ | 22/25 [00:30<00:04, 1.38s/batch, N=1.3829, E=0.8229,
KL=0.1457, wKL=0.7750]
Train E31: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.4962, E=0.8160,
KL=0.1466, wKL=0.7750]
Train E31: 92%|█████████▏| 23/25 [00:32<00:03, 1.57s/batch, N=1.4962, E=0.8160,
KL=0.1466, wKL=0.7750]
Train E31: 92%|█████████▏| 23/25 [00:34<00:03, 1.57s/batch, N=1.5284, E=0.8235,
KL=0.1466, wKL=0.7750]
Train E31: 96%|█████████▌| 24/25 [00:34<00:01, 1.51s/batch, N=1.5284, E=0.8235,
KL=0.1466, wKL=0.7750]
Train E31: 96%|█████████▌| 24/25 [00:34<00:01, 1.51s/batch, N=1.3842, E=0.8176,
KL=0.1451, wKL=0.7750]
Train E31: 100%|██████████| 25/25 [00:34<00:00, 1.24s/batch, N=1.3842, E=0.8176,
KL=0.1451, wKL=0.7750]
Train E31: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.3842, E=0.8176,
KL=0.1451, wKL=0.7750]
1139.7s 136 [Epoch 031] Total: 2.3685 | N: 1.4889 | E: 0.8234 | KL(0.78×0.5):
0.1453
1173.9s 137 Train E32: 0%| | 0/25 [00:00<?, ?batch/s]
Train E32: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5335, E=0.8253, KL=0.1451,
wKL=0.8000]
Train E32: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5335, E=0.8253,
KL=0.1451, wKL=0.8000]
Train E32: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.5083, E=0.8255,
KL=0.1444, wKL=0.8000]
Train E32: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5083, E=0.8255,
KL=0.1444, wKL=0.8000]
Train E32: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.5043, E=0.8250,
KL=0.1433, wKL=0.8000]
Train E32: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5043, E=0.8250,
KL=0.1433, wKL=0.8000]
Train E32: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4564, E=0.8267,
KL=0.1401, wKL=0.8000]
Train E32: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.4564, E=0.8267,
KL=0.1401, wKL=0.8000]
Train E32: 16%|█▌ | 4/25 [00:07<00:29, 1.38s/batch, N=1.4805, E=0.8281,
KL=0.1402, wKL=0.8000]
Train E32: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.4805, E=0.8281,
KL=0.1402, wKL=0.8000]
Train E32: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.5015, E=0.8239,
KL=0.1380, wKL=0.8000]
Train E32: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.5015, E=0.8239,
KL=0.1380, wKL=0.8000]
Train E32: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5169, E=0.8256,
KL=0.1377, wKL=0.8000]
Train E32: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5169, E=0.8256,
KL=0.1377, wKL=0.8000]
Train E32: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4322, E=0.8234,
KL=0.1377, wKL=0.8000]
Train E32: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.4322, E=0.8234,
KL=0.1377, wKL=0.8000]
Train E32: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.5228, E=0.8248,
KL=0.1396, wKL=0.8000]
Train E32: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5228, E=0.8248,
KL=0.1396, wKL=0.8000]
Train E32: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4262, E=0.8217,
KL=0.1389, wKL=0.8000]
Train E32: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4262, E=0.8217,
KL=0.1389, wKL=0.8000]
Train E32: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.3619, E=0.8251,
KL=0.1399, wKL=0.8000]
Train E32: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.3619, E=0.8251,
KL=0.1399, wKL=0.8000]
Train E32: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5627, E=0.8233,
KL=0.1428, wKL=0.8000]
Train E32: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5627, E=0.8233,
KL=0.1428, wKL=0.8000]
Train E32: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5259, E=0.8187,
KL=0.1431, wKL=0.8000]
Train E32: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5259, E=0.8187,
KL=0.1431, wKL=0.8000]
Train E32: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5005, E=0.8238,
KL=0.1429, wKL=0.8000]
Train E32: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5005, E=0.8238,
KL=0.1429, wKL=0.8000]
Train E32: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4982, E=0.8253,
KL=0.1437, wKL=0.8000]
Train E32: 60%|██████ | 15/25 [00:20<00:14, 1.40s/batch, N=1.4982, E=0.8253,
KL=0.1437, wKL=0.8000]
Train E32: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4044, E=0.8217,
KL=0.1415, wKL=0.8000]
Train E32: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4044, E=0.8217,
KL=0.1415, wKL=0.8000]
Train E32: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5273, E=0.8208,
KL=0.1418, wKL=0.8000]
Train E32: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5273, E=0.8208,
KL=0.1418, wKL=0.8000]
Train E32: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5201, E=0.8204,
KL=0.1425, wKL=0.8000]
Train E32: 72%|███████▏ | 18/25 [00:25<00:09, 1.38s/batch, N=1.5201, E=0.8204,
KL=0.1425, wKL=0.8000]
Train E32: 72%|███████▏ | 18/25 [00:26<00:09, 1.38s/batch, N=1.5206, E=0.8213,
KL=0.1406, wKL=0.8000]
Train E32: 76%|███████▌ | 19/25 [00:26<00:08, 1.38s/batch, N=1.5206, E=0.8213,
KL=0.1406, wKL=0.8000]
Train E32: 76%|███████▌ | 19/25 [00:27<00:08, 1.38s/batch, N=1.4744, E=0.8250,
KL=0.1391, wKL=0.8000]
Train E32: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.4744, E=0.8250,
KL=0.1391, wKL=0.8000]
Train E32: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5144, E=0.8198,
KL=0.1381, wKL=0.8000]
Train E32: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5144, E=0.8198,
KL=0.1381, wKL=0.8000]
Train E32: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.5535, E=0.8204,
KL=0.1367, wKL=0.8000]
Train E32: 88%|████████▊ | 22/25 [00:30<00:04, 1.39s/batch, N=1.5535, E=0.8204,
KL=0.1367, wKL=0.8000]
Train E32: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.3901, E=0.8202,
KL=0.1382, wKL=0.8000]
Train E32: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.3901, E=0.8202,
KL=0.1382, wKL=0.8000]
Train E32: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4896, E=0.8242,
KL=0.1377, wKL=0.8000]
Train E32: 96%|█████████▌| 24/25 [00:33<00:01, 1.42s/batch, N=1.4896, E=0.8242,
KL=0.1377, wKL=0.8000]
Train E32: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.5109, E=0.8219,
KL=0.1382, wKL=0.8000]
Train E32: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.5109, E=0.8219,
KL=0.1382, wKL=0.8000]
Train E32: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.5109, E=0.8219,
KL=0.1382, wKL=0.8000]
1173.9s 138 [Epoch 032] Total: 2.3685 | N: 1.4890 | E: 0.8233 | KL(0.80×0.5):
0.1405
1209.1s 139 Train E33: 0%| | 0/25 [00:00<?, ?batch/s]
Train E33: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5660, E=0.8236, KL=0.1380,
wKL=0.8250]
Train E33: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5660, E=0.8236,
KL=0.1380, wKL=0.8250]
Train E33: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.3950, E=0.8211,
KL=0.1381, wKL=0.8250]
Train E33: 8%|▊ | 2/25 [00:02<00:32, 1.40s/batch, N=1.3950, E=0.8211,
KL=0.1381, wKL=0.8250]
Train E33: 8%|▊ | 2/25 [00:04<00:32, 1.40s/batch, N=1.5396, E=0.8229,
KL=0.1400, wKL=0.8250]
Train E33: 12%|█▏ | 3/25 [00:04<00:32, 1.49s/batch, N=1.5396, E=0.8229,
KL=0.1400, wKL=0.8250]
Train E33: 12%|█▏ | 3/25 [00:05<00:32, 1.49s/batch, N=1.4796, E=0.8253,
KL=0.1382, wKL=0.8250]
Train E33: 16%|█▌ | 4/25 [00:05<00:30, 1.47s/batch, N=1.4796, E=0.8253,
KL=0.1382, wKL=0.8250]
Train E33: 16%|█▌ | 4/25 [00:07<00:30, 1.47s/batch, N=1.4762, E=0.8303,
KL=0.1379, wKL=0.8250]
Train E33: 20%|██ | 5/25 [00:07<00:29, 1.45s/batch, N=1.4762, E=0.8303,
KL=0.1379, wKL=0.8250]
Train E33: 20%|██ | 5/25 [00:08<00:29, 1.45s/batch, N=1.4490, E=0.8192,
KL=0.1388, wKL=0.8250]
Train E33: 24%|██▍ | 6/25 [00:08<00:27, 1.44s/batch, N=1.4490, E=0.8192,
KL=0.1388, wKL=0.8250]
Train E33: 24%|██▍ | 6/25 [00:10<00:27, 1.44s/batch, N=1.4558, E=0.8242,
KL=0.1362, wKL=0.8250]
Train E33: 28%|██▊ | 7/25 [00:10<00:29, 1.62s/batch, N=1.4558, E=0.8242,
KL=0.1362, wKL=0.8250]
Train E33: 28%|██▊ | 7/25 [00:12<00:29, 1.62s/batch, N=1.5475, E=0.8254,
KL=0.1370, wKL=0.8250]
Train E33: 32%|███▏ | 8/25 [00:12<00:26, 1.54s/batch, N=1.5475, E=0.8254,
KL=0.1370, wKL=0.8250]
Train E33: 32%|███▏ | 8/25 [00:13<00:26, 1.54s/batch, N=1.4838, E=0.8240,
KL=0.1365, wKL=0.8250]
Train E33: 36%|███▌ | 9/25 [00:13<00:23, 1.49s/batch, N=1.4838, E=0.8240,
KL=0.1365, wKL=0.8250]
Train E33: 36%|███▌ | 9/25 [00:14<00:23, 1.49s/batch, N=1.4609, E=0.8262,
KL=0.1357, wKL=0.8250]
Train E33: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.4609, E=0.8262,
KL=0.1357, wKL=0.8250]
Train E33: 40%|████ | 10/25 [00:16<00:21, 1.46s/batch, N=1.4215, E=0.8265,
KL=0.1349, wKL=0.8250]
Train E33: 44%|████▍ | 11/25 [00:16<00:20, 1.43s/batch, N=1.4215, E=0.8265,
KL=0.1349, wKL=0.8250]
Train E33: 44%|████▍ | 11/25 [00:17<00:20, 1.43s/batch, N=1.4652, E=0.8216,
KL=0.1357, wKL=0.8250]
Train E33: 48%|████▊ | 12/25 [00:17<00:18, 1.44s/batch, N=1.4652, E=0.8216,
KL=0.1357, wKL=0.8250]
Train E33: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.4590, E=0.8262,
KL=0.1341, wKL=0.8250]
Train E33: 52%|█████▏ | 13/25 [00:18<00:16, 1.42s/batch, N=1.4590, E=0.8262,
KL=0.1341, wKL=0.8250]
Train E33: 52%|█████▏ | 13/25 [00:20<00:16, 1.42s/batch, N=1.5590, E=0.8273,
KL=0.1348, wKL=0.8250]
Train E33: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.5590, E=0.8273,
KL=0.1348, wKL=0.8250]
Train E33: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.5220, E=0.8222,
KL=0.1342, wKL=0.8250]
Train E33: 60%|██████ | 15/25 [00:21<00:13, 1.39s/batch, N=1.5220, E=0.8222,
KL=0.1342, wKL=0.8250]
Train E33: 60%|██████ | 15/25 [00:23<00:13, 1.39s/batch, N=1.4700, E=0.8246,
KL=0.1346, wKL=0.8250]
Train E33: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4700, E=0.8246,
KL=0.1346, wKL=0.8250]
Train E33: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.5006, E=0.8253,
KL=0.1350, wKL=0.8250]
Train E33: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5006, E=0.8253,
KL=0.1350, wKL=0.8250]
Train E33: 68%|██████▊ | 17/25 [00:26<00:11, 1.40s/batch, N=1.4063, E=0.8239,
KL=0.1335, wKL=0.8250]
Train E33: 72%|███████▏ | 18/25 [00:26<00:10, 1.43s/batch, N=1.4063, E=0.8239,
KL=0.1335, wKL=0.8250]
Train E33: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.4765, E=0.8208,
KL=0.1358, wKL=0.8250]
Train E33: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.4765, E=0.8208,
KL=0.1358, wKL=0.8250]
Train E33: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.5272, E=0.8236,
KL=0.1348, wKL=0.8250]
Train E33: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5272, E=0.8236,
KL=0.1348, wKL=0.8250]
Train E33: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.5352, E=0.8216,
KL=0.1350, wKL=0.8250]
Train E33: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5352, E=0.8216,
KL=0.1350, wKL=0.8250]
Train E33: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4673, E=0.8185,
KL=0.1357, wKL=0.8250]
Train E33: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.4673, E=0.8185,
KL=0.1357, wKL=0.8250]
Train E33: 88%|████████▊ | 22/25 [00:33<00:04, 1.40s/batch, N=1.5042, E=0.8231,
KL=0.1336, wKL=0.8250]
Train E33: 92%|█████████▏| 23/25 [00:33<00:02, 1.39s/batch, N=1.5042, E=0.8231,
KL=0.1336, wKL=0.8250]
Train E33: 92%|█████████▏| 23/25 [00:34<00:02, 1.39s/batch, N=1.5281, E=0.8194,
KL=0.1355, wKL=0.8250]
Train E33: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5281, E=0.8194,
KL=0.1355, wKL=0.8250]
Train E33: 96%|█████████▌| 24/25 [00:35<00:01, 1.39s/batch, N=1.6140, E=0.8276,
KL=0.1353, wKL=0.8250]
Train E33: 100%|██████████| 25/25 [00:35<00:00, 1.19s/batch, N=1.6140, E=0.8276,
KL=0.1353, wKL=0.8250]
Train E33: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.6140, E=0.8276,
KL=0.1353, wKL=0.8250]
1209.1s 140 [Epoch 033] Total: 2.3692 | N: 1.4895 | E: 0.8237 | KL(0.82×0.5):
0.1360
1243.8s 141 Train E34: 0%| | 0/25 [00:00<?, ?batch/s]
Train E34: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5060, E=0.8251, KL=0.1327,
wKL=0.8500]
Train E34: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.5060, E=0.8251,
KL=0.1327, wKL=0.8500]
Train E34: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.4803, E=0.8254,
KL=0.1340, wKL=0.8500]
Train E34: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4803, E=0.8254,
KL=0.1340, wKL=0.8500]
Train E34: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4526, E=0.8243,
KL=0.1328, wKL=0.8500]
Train E34: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4526, E=0.8243,
KL=0.1328, wKL=0.8500]
Train E34: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5122, E=0.8203,
KL=0.1327, wKL=0.8500]
Train E34: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5122, E=0.8203,
KL=0.1327, wKL=0.8500]
Train E34: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5363, E=0.8263,
KL=0.1334, wKL=0.8500]
Train E34: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5363, E=0.8263,
KL=0.1334, wKL=0.8500]
Train E34: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4758, E=0.8253,
KL=0.1319, wKL=0.8500]
Train E34: 24%|██▍ | 6/25 [00:08<00:25, 1.36s/batch, N=1.4758, E=0.8253,
KL=0.1319, wKL=0.8500]
Train E34: 24%|██▍ | 6/25 [00:09<00:25, 1.36s/batch, N=1.4749, E=0.8255,
KL=0.1312, wKL=0.8500]
Train E34: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4749, E=0.8255,
KL=0.1312, wKL=0.8500]
Train E34: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4804, E=0.8176,
KL=0.1317, wKL=0.8500]
Train E34: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.4804, E=0.8176,
KL=0.1317, wKL=0.8500]
Train E34: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.4409, E=0.8231,
KL=0.1311, wKL=0.8500]
Train E34: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4409, E=0.8231,
KL=0.1311, wKL=0.8500]
Train E34: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.4400, E=0.8221,
KL=0.1311, wKL=0.8500]
Train E34: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4400, E=0.8221,
KL=0.1311, wKL=0.8500]
Train E34: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.4646, E=0.8242,
KL=0.1313, wKL=0.8500]
Train E34: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.4646, E=0.8242,
KL=0.1313, wKL=0.8500]
Train E34: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.5191, E=0.8253,
KL=0.1314, wKL=0.8500]
Train E34: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.5191, E=0.8253,
KL=0.1314, wKL=0.8500]
Train E34: 48%|████▊ | 12/25 [00:18<00:17, 1.38s/batch, N=1.5061, E=0.8197,
KL=0.1312, wKL=0.8500]
Train E34: 52%|█████▏ | 13/25 [00:18<00:18, 1.57s/batch, N=1.5061, E=0.8197,
KL=0.1312, wKL=0.8500]
Train E34: 52%|█████▏ | 13/25 [00:19<00:18, 1.57s/batch, N=1.4309, E=0.8260,
KL=0.1295, wKL=0.8500]
Train E34: 56%|█████▌ | 14/25 [00:19<00:16, 1.52s/batch, N=1.4309, E=0.8260,
KL=0.1295, wKL=0.8500]
Train E34: 56%|█████▌ | 14/25 [00:21<00:16, 1.52s/batch, N=1.5514, E=0.8220,
KL=0.1337, wKL=0.8500]
Train E34: 60%|██████ | 15/25 [00:21<00:14, 1.49s/batch, N=1.5514, E=0.8220,
KL=0.1337, wKL=0.8500]
Train E34: 60%|██████ | 15/25 [00:22<00:14, 1.49s/batch, N=1.5622, E=0.8240,
KL=0.1315, wKL=0.8500]
Train E34: 64%|██████▍ | 16/25 [00:22<00:13, 1.47s/batch, N=1.5622, E=0.8240,
KL=0.1315, wKL=0.8500]
Train E34: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5277, E=0.8278,
KL=0.1307, wKL=0.8500]
Train E34: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5277, E=0.8278,
KL=0.1307, wKL=0.8500]
Train E34: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4747, E=0.8235,
KL=0.1339, wKL=0.8500]
Train E34: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4747, E=0.8235,
KL=0.1339, wKL=0.8500]
Train E34: 72%|███████▏ | 18/25 [00:26<00:09, 1.42s/batch, N=1.5264, E=0.8245,
KL=0.1317, wKL=0.8500]
Train E34: 76%|███████▌ | 19/25 [00:26<00:08, 1.41s/batch, N=1.5264, E=0.8245,
KL=0.1317, wKL=0.8500]
Train E34: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4576, E=0.8191,
KL=0.1305, wKL=0.8500]
Train E34: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.4576, E=0.8191,
KL=0.1305, wKL=0.8500]
Train E34: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.4281, E=0.8215,
KL=0.1317, wKL=0.8500]
Train E34: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4281, E=0.8215,
KL=0.1317, wKL=0.8500]
Train E34: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5200, E=0.8273,
KL=0.1291, wKL=0.8500]
Train E34: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.5200, E=0.8273,
KL=0.1291, wKL=0.8500]
Train E34: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5702, E=0.8251,
KL=0.1286, wKL=0.8500]
Train E34: 92%|█████████▏| 23/25 [00:32<00:02, 1.47s/batch, N=1.5702, E=0.8251,
KL=0.1286, wKL=0.8500]
Train E34: 92%|█████████▏| 23/25 [00:34<00:02, 1.47s/batch, N=1.3967, E=0.8190,
KL=0.1278, wKL=0.8500]
Train E34: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.3967, E=0.8190,
KL=0.1278, wKL=0.8500]
Train E34: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5163, E=0.8254,
KL=0.1268, wKL=0.8500]
Train E34: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.5163, E=0.8254,
KL=0.1268, wKL=0.8500]
Train E34: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5163, E=0.8254,
KL=0.1268, wKL=0.8500]
1243.8s 142 [Epoch 034] Total: 2.3688 | N: 1.4894 | E: 0.8235 | KL(0.85×0.5):
0.1314
1278.5s 143 Train E35: 0%| | 0/25 [00:00<?, ?batch/s]
Train E35: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4119, E=0.8222, KL=0.1271,
wKL=0.8750]
Train E35: 4%|▍ | 1/25 [00:01<00:32, 1.33s/batch, N=1.4119, E=0.8222,
KL=0.1271, wKL=0.8750]
Train E35: 4%|▍ | 1/25 [00:02<00:32, 1.33s/batch, N=1.3558, E=0.8200,
KL=0.1256, wKL=0.8750]
Train E35: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.3558, E=0.8200,
KL=0.1256, wKL=0.8750]
Train E35: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4430, E=0.8227,
KL=0.1269, wKL=0.8750]
Train E35: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4430, E=0.8227,
KL=0.1269, wKL=0.8750]
Train E35: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4491, E=0.8281,
KL=0.1276, wKL=0.8750]
Train E35: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4491, E=0.8281,
KL=0.1276, wKL=0.8750]
Train E35: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4885, E=0.8313,
KL=0.1279, wKL=0.8750]
Train E35: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4885, E=0.8313,
KL=0.1279, wKL=0.8750]
Train E35: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4725, E=0.8245,
KL=0.1279, wKL=0.8750]
Train E35: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.4725, E=0.8245,
KL=0.1279, wKL=0.8750]
Train E35: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.4273, E=0.8281,
KL=0.1280, wKL=0.8750]
Train E35: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4273, E=0.8281,
KL=0.1280, wKL=0.8750]
Train E35: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.5760, E=0.8268,
KL=0.1300, wKL=0.8750]
Train E35: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.5760, E=0.8268,
KL=0.1300, wKL=0.8750]
Train E35: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.5736, E=0.8220,
KL=0.1293, wKL=0.8750]
Train E35: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.5736, E=0.8220,
KL=0.1293, wKL=0.8750]
Train E35: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.5052, E=0.8220,
KL=0.1286, wKL=0.8750]
Train E35: 40%|████ | 10/25 [00:13<00:20, 1.37s/batch, N=1.5052, E=0.8220,
KL=0.1286, wKL=0.8750]
Train E35: 40%|████ | 10/25 [00:15<00:20, 1.37s/batch, N=1.5273, E=0.8229,
KL=0.1287, wKL=0.8750]
Train E35: 44%|████▍ | 11/25 [00:15<00:19, 1.37s/batch, N=1.5273, E=0.8229,
KL=0.1287, wKL=0.8750]
Train E35: 44%|████▍ | 11/25 [00:16<00:19, 1.37s/batch, N=1.4564, E=0.8216,
KL=0.1291, wKL=0.8750]
Train E35: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4564, E=0.8216,
KL=0.1291, wKL=0.8750]
Train E35: 48%|████▊ | 12/25 [00:17<00:17, 1.38s/batch, N=1.5736, E=0.8234,
KL=0.1274, wKL=0.8750]
Train E35: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.5736, E=0.8234,
KL=0.1274, wKL=0.8750]
Train E35: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4625, E=0.8197,
KL=0.1275, wKL=0.8750]
Train E35: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4625, E=0.8197,
KL=0.1275, wKL=0.8750]
Train E35: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5224, E=0.8275,
KL=0.1267, wKL=0.8750]
Train E35: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.5224, E=0.8275,
KL=0.1267, wKL=0.8750]
Train E35: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5614, E=0.8171,
KL=0.1269, wKL=0.8750]
Train E35: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5614, E=0.8171,
KL=0.1269, wKL=0.8750]
Train E35: 64%|██████▍ | 16/25 [00:24<00:12, 1.39s/batch, N=1.5558, E=0.8228,
KL=0.1270, wKL=0.8750]
Train E35: 68%|██████▊ | 17/25 [00:24<00:12, 1.57s/batch, N=1.5558, E=0.8228,
KL=0.1270, wKL=0.8750]
Train E35: 68%|██████▊ | 17/25 [00:25<00:12, 1.57s/batch, N=1.4441, E=0.8234,
KL=0.1248, wKL=0.8750]
Train E35: 72%|███████▏ | 18/25 [00:25<00:10, 1.52s/batch, N=1.4441, E=0.8234,
KL=0.1248, wKL=0.8750]
Train E35: 72%|███████▏ | 18/25 [00:26<00:10, 1.52s/batch, N=1.4841, E=0.8242,
KL=0.1254, wKL=0.8750]
Train E35: 76%|███████▌ | 19/25 [00:26<00:09, 1.52s/batch, N=1.4841, E=0.8242,
KL=0.1254, wKL=0.8750]
Train E35: 76%|███████▌ | 19/25 [00:28<00:09, 1.52s/batch, N=1.5467, E=0.8268,
KL=0.1266, wKL=0.8750]
Train E35: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.5467, E=0.8268,
KL=0.1266, wKL=0.8750]
Train E35: 80%|████████ | 20/25 [00:29<00:07, 1.50s/batch, N=1.5198, E=0.8200,
KL=0.1253, wKL=0.8750]
Train E35: 84%|████████▍ | 21/25 [00:29<00:06, 1.51s/batch, N=1.5198, E=0.8200,
KL=0.1253, wKL=0.8750]
Train E35: 84%|████████▍ | 21/25 [00:31<00:06, 1.51s/batch, N=1.4474, E=0.8262,
KL=0.1250, wKL=0.8750]
Train E35: 88%|████████▊ | 22/25 [00:31<00:04, 1.47s/batch, N=1.4474, E=0.8262,
KL=0.1250, wKL=0.8750]
Train E35: 88%|████████▊ | 22/25 [00:32<00:04, 1.47s/batch, N=1.4772, E=0.8219,
KL=0.1249, wKL=0.8750]
Train E35: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.4772, E=0.8219,
KL=0.1249, wKL=0.8750]
Train E35: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.4919, E=0.8213,
KL=0.1238, wKL=0.8750]
Train E35: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4919, E=0.8213,
KL=0.1238, wKL=0.8750]
Train E35: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4301, E=0.8243,
KL=0.1248, wKL=0.8750]
Train E35: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4301, E=0.8243,
KL=0.1248, wKL=0.8750]
Train E35: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4301, E=0.8243,
KL=0.1248, wKL=0.8750]
1278.5s 144 [Epoch 035] Total: 2.3687 | N: 1.4895 | E: 0.8236 | KL(0.88×0.5):
0.1270
1313.3s 145 Train E36: 0%| | 0/25 [00:00<?, ?batch/s]
Train E36: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5319, E=0.8262, KL=0.1229,
wKL=0.9000]
Train E36: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5319, E=0.8262,
KL=0.1229, wKL=0.9000]
Train E36: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4719, E=0.8225,
KL=0.1243, wKL=0.9000]
Train E36: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4719, E=0.8225,
KL=0.1243, wKL=0.9000]
Train E36: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5257, E=0.8176,
KL=0.1259, wKL=0.9000]
Train E36: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5257, E=0.8176,
KL=0.1259, wKL=0.9000]
Train E36: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.3931, E=0.8285,
KL=0.1216, wKL=0.9000]
Train E36: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.3931, E=0.8285,
KL=0.1216, wKL=0.9000]
Train E36: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4468, E=0.8228,
KL=0.1233, wKL=0.9000]
Train E36: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4468, E=0.8228,
KL=0.1233, wKL=0.9000]
Train E36: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5588, E=0.8214,
KL=0.1244, wKL=0.9000]
Train E36: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.5588, E=0.8214,
KL=0.1244, wKL=0.9000]
Train E36: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.6293, E=0.8263,
KL=0.1227, wKL=0.9000]
Train E36: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.6293, E=0.8263,
KL=0.1227, wKL=0.9000]
Train E36: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4875, E=0.8223,
KL=0.1228, wKL=0.9000]
Train E36: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.4875, E=0.8223,
KL=0.1228, wKL=0.9000]
Train E36: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4666, E=0.8268,
KL=0.1216, wKL=0.9000]
Train E36: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4666, E=0.8268,
KL=0.1216, wKL=0.9000]
Train E36: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4493, E=0.8238,
KL=0.1203, wKL=0.9000]
Train E36: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4493, E=0.8238,
KL=0.1203, wKL=0.9000]
Train E36: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4521, E=0.8247,
KL=0.1207, wKL=0.9000]
Train E36: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4521, E=0.8247,
KL=0.1207, wKL=0.9000]
Train E36: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4667, E=0.8258,
KL=0.1210, wKL=0.9000]
Train E36: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4667, E=0.8258,
KL=0.1210, wKL=0.9000]
Train E36: 48%|████▊ | 12/25 [00:17<00:18, 1.40s/batch, N=1.5384, E=0.8258,
KL=0.1205, wKL=0.9000]
Train E36: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.5384, E=0.8258,
KL=0.1205, wKL=0.9000]
Train E36: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4458, E=0.8254,
KL=0.1208, wKL=0.9000]
Train E36: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4458, E=0.8254,
KL=0.1208, wKL=0.9000]
Train E36: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5352, E=0.8223,
KL=0.1206, wKL=0.9000]
Train E36: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.5352, E=0.8223,
KL=0.1206, wKL=0.9000]
Train E36: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4949, E=0.8235,
KL=0.1207, wKL=0.9000]
Train E36: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.4949, E=0.8235,
KL=0.1207, wKL=0.9000]
Train E36: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.5505, E=0.8257,
KL=0.1228, wKL=0.9000]
Train E36: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.5505, E=0.8257,
KL=0.1228, wKL=0.9000]
Train E36: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5315, E=0.8177,
KL=0.1215, wKL=0.9000]
Train E36: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5315, E=0.8177,
KL=0.1215, wKL=0.9000]
Train E36: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.4923, E=0.8238,
KL=0.1214, wKL=0.9000]
Train E36: 76%|███████▌ | 19/25 [00:27<00:10, 1.67s/batch, N=1.4923, E=0.8238,
KL=0.1214, wKL=0.9000]
Train E36: 76%|███████▌ | 19/25 [00:28<00:10, 1.67s/batch, N=1.4555, E=0.8272,
KL=0.1211, wKL=0.9000]
Train E36: 80%|████████ | 20/25 [00:28<00:07, 1.58s/batch, N=1.4555, E=0.8272,
KL=0.1211, wKL=0.9000]
Train E36: 80%|████████ | 20/25 [00:30<00:07, 1.58s/batch, N=1.4590, E=0.8239,
KL=0.1198, wKL=0.9000]
Train E36: 84%|████████▍ | 21/25 [00:30<00:06, 1.52s/batch, N=1.4590, E=0.8239,
KL=0.1198, wKL=0.9000]
Train E36: 84%|████████▍ | 21/25 [00:31<00:06, 1.52s/batch, N=1.4606, E=0.8261,
KL=0.1196, wKL=0.9000]
Train E36: 88%|████████▊ | 22/25 [00:31<00:04, 1.48s/batch, N=1.4606, E=0.8261,
KL=0.1196, wKL=0.9000]
Train E36: 88%|████████▊ | 22/25 [00:32<00:04, 1.48s/batch, N=1.4828, E=0.8198,
KL=0.1187, wKL=0.9000]
Train E36: 92%|█████████▏| 23/25 [00:32<00:02, 1.46s/batch, N=1.4828, E=0.8198,
KL=0.1187, wKL=0.9000]
Train E36: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.4979, E=0.8214,
KL=0.1177, wKL=0.9000]
Train E36: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4979, E=0.8214,
KL=0.1177, wKL=0.9000]
Train E36: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.3230, E=0.8255,
KL=0.1172, wKL=0.9000]
Train E36: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.3230, E=0.8255,
KL=0.1172, wKL=0.9000]
Train E36: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.3230, E=0.8255,
KL=0.1172, wKL=0.9000]
1313.3s 146 [Epoch 036] Total: 2.3683 | N: 1.4898 | E: 0.8238 | KL(0.90×0.5):
0.1214
1348.2s 147 Train E37: 0%| | 0/25 [00:00<?, ?batch/s]
Train E37: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5144, E=0.8198, KL=0.1178,
wKL=0.9250]
Train E37: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5144, E=0.8198,
KL=0.1178, wKL=0.9250]
Train E37: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4818, E=0.8241,
KL=0.1157, wKL=0.9250]
Train E37: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4818, E=0.8241,
KL=0.1157, wKL=0.9250]
Train E37: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4888, E=0.8240,
KL=0.1154, wKL=0.9250]
Train E37: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4888, E=0.8240,
KL=0.1154, wKL=0.9250]
Train E37: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4117, E=0.8245,
KL=0.1166, wKL=0.9250]
Train E37: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4117, E=0.8245,
KL=0.1166, wKL=0.9250]
Train E37: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4826, E=0.8183,
KL=0.1159, wKL=0.9250]
Train E37: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4826, E=0.8183,
KL=0.1159, wKL=0.9250]
Train E37: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.5471, E=0.8263,
KL=0.1181, wKL=0.9250]
Train E37: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5471, E=0.8263,
KL=0.1181, wKL=0.9250]
Train E37: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4718, E=0.8259,
KL=0.1173, wKL=0.9250]
Train E37: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4718, E=0.8259,
KL=0.1173, wKL=0.9250]
Train E37: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4982, E=0.8239,
KL=0.1169, wKL=0.9250]
Train E37: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4982, E=0.8239,
KL=0.1169, wKL=0.9250]
Train E37: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5117, E=0.8258,
KL=0.1210, wKL=0.9250]
Train E37: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5117, E=0.8258,
KL=0.1210, wKL=0.9250]
Train E37: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5031, E=0.8210,
KL=0.1190, wKL=0.9250]
Train E37: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5031, E=0.8210,
KL=0.1190, wKL=0.9250]
Train E37: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5789, E=0.8203,
KL=0.1182, wKL=0.9250]
Train E37: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.5789, E=0.8203,
KL=0.1182, wKL=0.9250]
Train E37: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.4493, E=0.8230,
KL=0.1185, wKL=0.9250]
Train E37: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4493, E=0.8230,
KL=0.1185, wKL=0.9250]
Train E37: 48%|████▊ | 12/25 [00:17<00:17, 1.38s/batch, N=1.4436, E=0.8216,
KL=0.1172, wKL=0.9250]
Train E37: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4436, E=0.8216,
KL=0.1172, wKL=0.9250]
Train E37: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4973, E=0.8286,
KL=0.1157, wKL=0.9250]
Train E37: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4973, E=0.8286,
KL=0.1157, wKL=0.9250]
Train E37: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4384, E=0.8237,
KL=0.1156, wKL=0.9250]
Train E37: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.4384, E=0.8237,
KL=0.1156, wKL=0.9250]
Train E37: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5674, E=0.8234,
KL=0.1145, wKL=0.9250]
Train E37: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5674, E=0.8234,
KL=0.1145, wKL=0.9250]
Train E37: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4754, E=0.8260,
KL=0.1123, wKL=0.9250]
Train E37: 68%|██████▊ | 17/25 [00:23<00:11, 1.48s/batch, N=1.4754, E=0.8260,
KL=0.1123, wKL=0.9250]
Train E37: 68%|██████▊ | 17/25 [00:25<00:11, 1.48s/batch, N=1.4090, E=0.8216,
KL=0.1117, wKL=0.9250]
Train E37: 72%|███████▏ | 18/25 [00:25<00:10, 1.46s/batch, N=1.4090, E=0.8216,
KL=0.1117, wKL=0.9250]
Train E37: 72%|███████▏ | 18/25 [00:26<00:10, 1.46s/batch, N=1.5696, E=0.8261,
KL=0.1121, wKL=0.9250]
Train E37: 76%|███████▌ | 19/25 [00:26<00:08, 1.44s/batch, N=1.5696, E=0.8261,
KL=0.1121, wKL=0.9250]
Train E37: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.4805, E=0.8253,
KL=0.1124, wKL=0.9250]
Train E37: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.4805, E=0.8253,
KL=0.1124, wKL=0.9250]
Train E37: 80%|████████ | 20/25 [00:29<00:07, 1.43s/batch, N=1.4649, E=0.8268,
KL=0.1114, wKL=0.9250]
Train E37: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4649, E=0.8268,
KL=0.1114, wKL=0.9250]
Train E37: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4756, E=0.8249,
KL=0.1117, wKL=0.9250]
Train E37: 88%|████████▊ | 22/25 [00:30<00:04, 1.41s/batch, N=1.4756, E=0.8249,
KL=0.1117, wKL=0.9250]
Train E37: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4898, E=0.8279,
KL=0.1139, wKL=0.9250]
Train E37: 92%|█████████▏| 23/25 [00:32<00:03, 1.60s/batch, N=1.4898, E=0.8279,
KL=0.1139, wKL=0.9250]
Train E37: 92%|█████████▏| 23/25 [00:34<00:03, 1.60s/batch, N=1.4877, E=0.8234,
KL=0.1141, wKL=0.9250]
Train E37: 96%|█████████▌| 24/25 [00:34<00:01, 1.53s/batch, N=1.4877, E=0.8234,
KL=0.1141, wKL=0.9250]
Train E37: 96%|█████████▌| 24/25 [00:34<00:01, 1.53s/batch, N=1.5251, E=0.8217,
KL=0.1137, wKL=0.9250]
Train E37: 100%|██████████| 25/25 [00:34<00:00, 1.25s/batch, N=1.5251, E=0.8217,
KL=0.1137, wKL=0.9250]
Train E37: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5251, E=0.8217,
KL=0.1137, wKL=0.9250]
1348.2s 148 [Epoch 037] Total: 2.3671 | N: 1.4897 | E: 0.8240 | KL(0.93×0.5):
0.1155
1382.9s 149 Train E38: 0%| | 0/25 [00:00<?, ?batch/s]
Train E38: 0%| | 0/25 [00:01<?, ?batch/s, N=1.3975, E=0.8231, KL=0.1160,
wKL=0.9500]
Train E38: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.3975, E=0.8231,
KL=0.1160, wKL=0.9500]
Train E38: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4900, E=0.8273,
KL=0.1127, wKL=0.9500]
Train E38: 8%|▊ | 2/25 [00:02<00:32, 1.40s/batch, N=1.4900, E=0.8273,
KL=0.1127, wKL=0.9500]
Train E38: 8%|▊ | 2/25 [00:04<00:32, 1.40s/batch, N=1.5265, E=0.8256,
KL=0.1116, wKL=0.9500]
Train E38: 12%|█▏ | 3/25 [00:04<00:31, 1.41s/batch, N=1.5265, E=0.8256,
KL=0.1116, wKL=0.9500]
Train E38: 12%|█▏ | 3/25 [00:05<00:31, 1.41s/batch, N=1.4461, E=0.8215,
KL=0.1112, wKL=0.9500]
Train E38: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4461, E=0.8215,
KL=0.1112, wKL=0.9500]
Train E38: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4482, E=0.8210,
KL=0.1090, wKL=0.9500]
Train E38: 20%|██ | 5/25 [00:07<00:28, 1.40s/batch, N=1.4482, E=0.8210,
KL=0.1090, wKL=0.9500]
Train E38: 20%|██ | 5/25 [00:08<00:28, 1.40s/batch, N=1.5409, E=0.8273,
KL=0.1097, wKL=0.9500]
Train E38: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.5409, E=0.8273,
KL=0.1097, wKL=0.9500]
Train E38: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5331, E=0.8227,
KL=0.1090, wKL=0.9500]
Train E38: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5331, E=0.8227,
KL=0.1090, wKL=0.9500]
Train E38: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4959, E=0.8238,
KL=0.1073, wKL=0.9500]
Train E38: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4959, E=0.8238,
KL=0.1073, wKL=0.9500]
Train E38: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4676, E=0.8257,
KL=0.1088, wKL=0.9500]
Train E38: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.4676, E=0.8257,
KL=0.1088, wKL=0.9500]
Train E38: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5022, E=0.8219,
KL=0.1098, wKL=0.9500]
Train E38: 40%|████ | 10/25 [00:14<00:20, 1.40s/batch, N=1.5022, E=0.8219,
KL=0.1098, wKL=0.9500]
Train E38: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4515, E=0.8239,
KL=0.1085, wKL=0.9500]
Train E38: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4515, E=0.8239,
KL=0.1085, wKL=0.9500]
Train E38: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.5876, E=0.8209,
KL=0.1102, wKL=0.9500]
Train E38: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5876, E=0.8209,
KL=0.1102, wKL=0.9500]
Train E38: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4640, E=0.8252,
KL=0.1093, wKL=0.9500]
Train E38: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.4640, E=0.8252,
KL=0.1093, wKL=0.9500]
Train E38: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4884, E=0.8259,
KL=0.1089, wKL=0.9500]
Train E38: 56%|█████▌ | 14/25 [00:19<00:16, 1.51s/batch, N=1.4884, E=0.8259,
KL=0.1089, wKL=0.9500]
Train E38: 56%|█████▌ | 14/25 [00:21<00:16, 1.51s/batch, N=1.4373, E=0.8257,
KL=0.1082, wKL=0.9500]
Train E38: 60%|██████ | 15/25 [00:21<00:15, 1.51s/batch, N=1.4373, E=0.8257,
KL=0.1082, wKL=0.9500]
Train E38: 60%|██████ | 15/25 [00:22<00:15, 1.51s/batch, N=1.5021, E=0.8243,
KL=0.1090, wKL=0.9500]
Train E38: 64%|██████▍ | 16/25 [00:22<00:13, 1.48s/batch, N=1.5021, E=0.8243,
KL=0.1090, wKL=0.9500]
Train E38: 64%|██████▍ | 16/25 [00:24<00:13, 1.48s/batch, N=1.5413, E=0.8266,
KL=0.1100, wKL=0.9500]
Train E38: 68%|██████▊ | 17/25 [00:24<00:11, 1.45s/batch, N=1.5413, E=0.8266,
KL=0.1100, wKL=0.9500]
Train E38: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.5677, E=0.8260,
KL=0.1085, wKL=0.9500]
Train E38: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.5677, E=0.8260,
KL=0.1085, wKL=0.9500]
Train E38: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.3979, E=0.8245,
KL=0.1075, wKL=0.9500]
Train E38: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.3979, E=0.8245,
KL=0.1075, wKL=0.9500]
Train E38: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.4382, E=0.8224,
KL=0.1079, wKL=0.9500]
Train E38: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4382, E=0.8224,
KL=0.1079, wKL=0.9500]
Train E38: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.4632, E=0.8234,
KL=0.1076, wKL=0.9500]
Train E38: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4632, E=0.8234,
KL=0.1076, wKL=0.9500]
Train E38: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4659, E=0.8185,
KL=0.1075, wKL=0.9500]
Train E38: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4659, E=0.8185,
KL=0.1075, wKL=0.9500]
Train E38: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5386, E=0.8241,
KL=0.1066, wKL=0.9500]
Train E38: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.5386, E=0.8241,
KL=0.1066, wKL=0.9500]
Train E38: 92%|█████████▏| 23/25 [00:34<00:02, 1.42s/batch, N=1.5154, E=0.8234,
KL=0.1064, wKL=0.9500]
Train E38: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5154, E=0.8234,
KL=0.1064, wKL=0.9500]
Train E38: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5601, E=0.8308,
KL=0.1049, wKL=0.9500]
Train E38: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.5601, E=0.8308,
KL=0.1049, wKL=0.9500]
Train E38: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5601, E=0.8308,
KL=0.1049, wKL=0.9500]
1382.9s 150 [Epoch 038] Total: 2.3649 | N: 1.4890 | E: 0.8241 | KL(0.95×0.5):
0.1091
1417.8s 151 Train E39: 0%| | 0/25 [00:00<?, ?batch/s]
Train E39: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5663, E=0.8283, KL=0.1038,
wKL=0.9750]
Train E39: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5663, E=0.8283,
KL=0.1038, wKL=0.9750]
Train E39: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4691, E=0.8250,
KL=0.1042, wKL=0.9750]
Train E39: 8%|▊ | 2/25 [00:02<00:30, 1.35s/batch, N=1.4691, E=0.8250,
KL=0.1042, wKL=0.9750]
Train E39: 8%|▊ | 2/25 [00:04<00:30, 1.35s/batch, N=1.5091, E=0.8242,
KL=0.1039, wKL=0.9750]
Train E39: 12%|█▏ | 3/25 [00:04<00:36, 1.64s/batch, N=1.5091, E=0.8242,
KL=0.1039, wKL=0.9750]
Train E39: 12%|█▏ | 3/25 [00:06<00:36, 1.64s/batch, N=1.5274, E=0.8254,
KL=0.1034, wKL=0.9750]
Train E39: 16%|█▌ | 4/25 [00:06<00:32, 1.53s/batch, N=1.5274, E=0.8254,
KL=0.1034, wKL=0.9750]
Train E39: 16%|█▌ | 4/25 [00:07<00:32, 1.53s/batch, N=1.4498, E=0.8186,
KL=0.1039, wKL=0.9750]
Train E39: 20%|██ | 5/25 [00:07<00:29, 1.49s/batch, N=1.4498, E=0.8186,
KL=0.1039, wKL=0.9750]
Train E39: 20%|██ | 5/25 [00:08<00:29, 1.49s/batch, N=1.4019, E=0.8267,
KL=0.1015, wKL=0.9750]
Train E39: 24%|██▍ | 6/25 [00:08<00:27, 1.45s/batch, N=1.4019, E=0.8267,
KL=0.1015, wKL=0.9750]
Train E39: 24%|██▍ | 6/25 [00:10<00:27, 1.45s/batch, N=1.4461, E=0.8224,
KL=0.1029, wKL=0.9750]
Train E39: 28%|██▊ | 7/25 [00:10<00:25, 1.43s/batch, N=1.4461, E=0.8224,
KL=0.1029, wKL=0.9750]
Train E39: 28%|██▊ | 7/25 [00:11<00:25, 1.43s/batch, N=1.5134, E=0.8244,
KL=0.1037, wKL=0.9750]
Train E39: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5134, E=0.8244,
KL=0.1037, wKL=0.9750]
Train E39: 32%|███▏ | 8/25 [00:13<00:24, 1.42s/batch, N=1.4426, E=0.8210,
KL=0.1026, wKL=0.9750]
Train E39: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.4426, E=0.8210,
KL=0.1026, wKL=0.9750]
Train E39: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5256, E=0.8261,
KL=0.1022, wKL=0.9750]
Train E39: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.5256, E=0.8261,
KL=0.1022, wKL=0.9750]
Train E39: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4434, E=0.8220,
KL=0.1027, wKL=0.9750]
Train E39: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4434, E=0.8220,
KL=0.1027, wKL=0.9750]
Train E39: 44%|████▍ | 11/25 [00:17<00:19, 1.41s/batch, N=1.5193, E=0.8200,
KL=0.1035, wKL=0.9750]
Train E39: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.5193, E=0.8200,
KL=0.1035, wKL=0.9750]
Train E39: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.4843, E=0.8254,
KL=0.1043, wKL=0.9750]
Train E39: 52%|█████▏ | 13/25 [00:18<00:17, 1.46s/batch, N=1.4843, E=0.8254,
KL=0.1043, wKL=0.9750]
Train E39: 52%|█████▏ | 13/25 [00:20<00:17, 1.46s/batch, N=1.4627, E=0.8220,
KL=0.1030, wKL=0.9750]
Train E39: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4627, E=0.8220,
KL=0.1030, wKL=0.9750]
Train E39: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.4044, E=0.8224,
KL=0.1012, wKL=0.9750]
Train E39: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4044, E=0.8224,
KL=0.1012, wKL=0.9750]
Train E39: 60%|██████ | 15/25 [00:23<00:14, 1.42s/batch, N=1.5735, E=0.8245,
KL=0.1029, wKL=0.9750]
Train E39: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.5735, E=0.8245,
KL=0.1029, wKL=0.9750]
Train E39: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.4624, E=0.8236,
KL=0.1002, wKL=0.9750]
Train E39: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4624, E=0.8236,
KL=0.1002, wKL=0.9750]
Train E39: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.3989, E=0.8231,
KL=0.1024, wKL=0.9750]
Train E39: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.3989, E=0.8231,
KL=0.1024, wKL=0.9750]
Train E39: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5731, E=0.8240,
KL=0.1013, wKL=0.9750]
Train E39: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5731, E=0.8240,
KL=0.1013, wKL=0.9750]
Train E39: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5301, E=0.8249,
KL=0.0999, wKL=0.9750]
Train E39: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5301, E=0.8249,
KL=0.0999, wKL=0.9750]
Train E39: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.5093, E=0.8247,
KL=0.1007, wKL=0.9750]
Train E39: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5093, E=0.8247,
KL=0.1007, wKL=0.9750]
Train E39: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4519, E=0.8224,
KL=0.1003, wKL=0.9750]
Train E39: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4519, E=0.8224,
KL=0.1003, wKL=0.9750]
Train E39: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.5691, E=0.8293,
KL=0.0998, wKL=0.9750]
Train E39: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5691, E=0.8293,
KL=0.0998, wKL=0.9750]
Train E39: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4963, E=0.8251,
KL=0.1032, wKL=0.9750]
Train E39: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4963, E=0.8251,
KL=0.1032, wKL=0.9750]
Train E39: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4479, E=0.8215,
KL=0.1012, wKL=0.9750]
Train E39: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4479, E=0.8215,
KL=0.1012, wKL=0.9750]
Train E39: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4479, E=0.8215,
KL=0.1012, wKL=0.9750]
1417.8s 152 [Epoch 039] Total: 2.3619 | N: 1.4881 | E: 0.8239 | KL(0.97×0.5):
0.1024
1452.8s 153 Train E40: 0%| | 0/25 [00:00<?, ?batch/s]
Train E40: 0%| | 0/25 [00:01<?, ?batch/s, N=1.3696, E=0.8220, KL=0.0987,
wKL=1.0000]
Train E40: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.3696, E=0.8220,
KL=0.0987, wKL=1.0000]
Train E40: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4088, E=0.8241,
KL=0.1001, wKL=1.0000]
Train E40: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4088, E=0.8241,
KL=0.1001, wKL=1.0000]
Train E40: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.5651, E=0.8238,
KL=0.0999, wKL=1.0000]
Train E40: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5651, E=0.8238,
KL=0.0999, wKL=1.0000]
Train E40: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.6169, E=0.8232,
KL=0.0998, wKL=1.0000]
Train E40: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.6169, E=0.8232,
KL=0.0998, wKL=1.0000]
Train E40: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.5516, E=0.8270,
KL=0.0997, wKL=1.0000]
Train E40: 20%|██ | 5/25 [00:06<00:27, 1.40s/batch, N=1.5516, E=0.8270,
KL=0.0997, wKL=1.0000]
Train E40: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.4170, E=0.8236,
KL=0.0993, wKL=1.0000]
Train E40: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4170, E=0.8236,
KL=0.0993, wKL=1.0000]
Train E40: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.6072, E=0.8236,
KL=0.0992, wKL=1.0000]
Train E40: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.6072, E=0.8236,
KL=0.0992, wKL=1.0000]
Train E40: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5424, E=0.8221,
KL=0.0992, wKL=1.0000]
Train E40: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.5424, E=0.8221,
KL=0.0992, wKL=1.0000]
Train E40: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.5120, E=0.8222,
KL=0.0980, wKL=1.0000]
Train E40: 36%|███▌ | 9/25 [00:12<00:23, 1.47s/batch, N=1.5120, E=0.8222,
KL=0.0980, wKL=1.0000]
Train E40: 36%|███▌ | 9/25 [00:14<00:23, 1.47s/batch, N=1.4295, E=0.8260,
KL=0.0966, wKL=1.0000]
Train E40: 40%|████ | 10/25 [00:14<00:22, 1.48s/batch, N=1.4295, E=0.8260,
KL=0.0966, wKL=1.0000]
Train E40: 40%|████ | 10/25 [00:16<00:22, 1.48s/batch, N=1.5243, E=0.8246,
KL=0.0986, wKL=1.0000]
Train E40: 44%|████▍ | 11/25 [00:16<00:22, 1.64s/batch, N=1.5243, E=0.8246,
KL=0.0986, wKL=1.0000]
Train E40: 44%|████▍ | 11/25 [00:17<00:22, 1.64s/batch, N=1.4758, E=0.8238,
KL=0.0974, wKL=1.0000]
Train E40: 48%|████▊ | 12/25 [00:17<00:20, 1.56s/batch, N=1.4758, E=0.8238,
KL=0.0974, wKL=1.0000]
Train E40: 48%|████▊ | 12/25 [00:19<00:20, 1.56s/batch, N=1.5246, E=0.8238,
KL=0.0983, wKL=1.0000]
Train E40: 52%|█████▏ | 13/25 [00:19<00:18, 1.50s/batch, N=1.5246, E=0.8238,
KL=0.0983, wKL=1.0000]
Train E40: 52%|█████▏ | 13/25 [00:20<00:18, 1.50s/batch, N=1.5209, E=0.8293,
KL=0.0968, wKL=1.0000]
Train E40: 56%|█████▌ | 14/25 [00:20<00:16, 1.46s/batch, N=1.5209, E=0.8293,
KL=0.0968, wKL=1.0000]
Train E40: 56%|█████▌ | 14/25 [00:21<00:16, 1.46s/batch, N=1.4759, E=0.8231,
KL=0.0966, wKL=1.0000]
Train E40: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.4759, E=0.8231,
KL=0.0966, wKL=1.0000]
Train E40: 60%|██████ | 15/25 [00:23<00:14, 1.44s/batch, N=1.4312, E=0.8215,
KL=0.0975, wKL=1.0000]
Train E40: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.4312, E=0.8215,
KL=0.0975, wKL=1.0000]
Train E40: 64%|██████▍ | 16/25 [00:24<00:12, 1.44s/batch, N=1.4289, E=0.8247,
KL=0.0971, wKL=1.0000]
Train E40: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.4289, E=0.8247,
KL=0.0971, wKL=1.0000]
Train E40: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.5366, E=0.8221,
KL=0.0965, wKL=1.0000]
Train E40: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5366, E=0.8221,
KL=0.0965, wKL=1.0000]
Train E40: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5216, E=0.8250,
KL=0.0959, wKL=1.0000]
Train E40: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5216, E=0.8250,
KL=0.0959, wKL=1.0000]
Train E40: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5196, E=0.8227,
KL=0.0954, wKL=1.0000]
Train E40: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5196, E=0.8227,
KL=0.0954, wKL=1.0000]
Train E40: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.4418, E=0.8174,
KL=0.0963, wKL=1.0000]
Train E40: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.4418, E=0.8174,
KL=0.0963, wKL=1.0000]
Train E40: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.4178, E=0.8258,
KL=0.0947, wKL=1.0000]
Train E40: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4178, E=0.8258,
KL=0.0947, wKL=1.0000]
Train E40: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4469, E=0.8241,
KL=0.0962, wKL=1.0000]
Train E40: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4469, E=0.8241,
KL=0.0962, wKL=1.0000]
Train E40: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5050, E=0.8180,
KL=0.0967, wKL=1.0000]
Train E40: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5050, E=0.8180,
KL=0.0967, wKL=1.0000]
Train E40: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.2705, E=0.8167,
KL=0.0937, wKL=1.0000]
Train E40: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.2705, E=0.8167,
KL=0.0937, wKL=1.0000]
Train E40: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.2705, E=0.8167,
KL=0.0937, wKL=1.0000]
1452.8s 154 [Epoch 040] Total: 2.3597 | N: 1.4875 | E: 0.8234 | KL(1.00×0.5):
0.0976
1452.8s 155 Saved checkpoint: /kaggle/working/checkpoints/gvae_40_epoch040.pt
1487.8s 156 Train E41: 0%| | 0/25 [00:00<?, ?batch/s]
Train E41: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4295, E=0.8238, KL=0.0955,
wKL=1.0000]
Train E41: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.4295, E=0.8238,
KL=0.0955, wKL=1.0000]
Train E41: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.5074, E=0.8212,
KL=0.0939, wKL=1.0000]
Train E41: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.5074, E=0.8212,
KL=0.0939, wKL=1.0000]
Train E41: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4313, E=0.8225,
KL=0.0940, wKL=1.0000]
Train E41: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4313, E=0.8225,
KL=0.0940, wKL=1.0000]
Train E41: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.6103, E=0.8265,
KL=0.0960, wKL=1.0000]
Train E41: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.6103, E=0.8265,
KL=0.0960, wKL=1.0000]
Train E41: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.5094, E=0.8247,
KL=0.0928, wKL=1.0000]
Train E41: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5094, E=0.8247,
KL=0.0928, wKL=1.0000]
Train E41: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5119, E=0.8205,
KL=0.0950, wKL=1.0000]
Train E41: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.5119, E=0.8205,
KL=0.0950, wKL=1.0000]
Train E41: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4478, E=0.8201,
KL=0.0936, wKL=1.0000]
Train E41: 28%|██▊ | 7/25 [00:09<00:26, 1.45s/batch, N=1.4478, E=0.8201,
KL=0.0936, wKL=1.0000]
Train E41: 28%|██▊ | 7/25 [00:11<00:26, 1.45s/batch, N=1.4589, E=0.8244,
KL=0.0929, wKL=1.0000]
Train E41: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.4589, E=0.8244,
KL=0.0929, wKL=1.0000]
Train E41: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.5028, E=0.8208,
KL=0.0939, wKL=1.0000]
Train E41: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.5028, E=0.8208,
KL=0.0939, wKL=1.0000]
Train E41: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4986, E=0.8230,
KL=0.0933, wKL=1.0000]
Train E41: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4986, E=0.8230,
KL=0.0933, wKL=1.0000]
Train E41: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4687, E=0.8233,
KL=0.0919, wKL=1.0000]
Train E41: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4687, E=0.8233,
KL=0.0919, wKL=1.0000]
Train E41: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4581, E=0.8253,
KL=0.0944, wKL=1.0000]
Train E41: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4581, E=0.8253,
KL=0.0944, wKL=1.0000]
Train E41: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.5169, E=0.8264,
KL=0.0924, wKL=1.0000]
Train E41: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5169, E=0.8264,
KL=0.0924, wKL=1.0000]
Train E41: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.4955, E=0.8208,
KL=0.0941, wKL=1.0000]
Train E41: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4955, E=0.8208,
KL=0.0941, wKL=1.0000]
Train E41: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4991, E=0.8247,
KL=0.0930, wKL=1.0000]
Train E41: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.4991, E=0.8247,
KL=0.0930, wKL=1.0000]
Train E41: 60%|██████ | 15/25 [00:23<00:13, 1.40s/batch, N=1.5031, E=0.8200,
KL=0.0943, wKL=1.0000]
Train E41: 64%|██████▍ | 16/25 [00:23<00:14, 1.59s/batch, N=1.5031, E=0.8200,
KL=0.0943, wKL=1.0000]
Train E41: 64%|██████▍ | 16/25 [00:24<00:14, 1.59s/batch, N=1.5046, E=0.8189,
KL=0.0959, wKL=1.0000]
Train E41: 68%|██████▊ | 17/25 [00:24<00:12, 1.53s/batch, N=1.5046, E=0.8189,
KL=0.0959, wKL=1.0000]
Train E41: 68%|██████▊ | 17/25 [00:25<00:12, 1.53s/batch, N=1.5181, E=0.8237,
KL=0.0933, wKL=1.0000]
Train E41: 72%|███████▏ | 18/25 [00:25<00:10, 1.48s/batch, N=1.5181, E=0.8237,
KL=0.0933, wKL=1.0000]
Train E41: 72%|███████▏ | 18/25 [00:27<00:10, 1.48s/batch, N=1.4315, E=0.8232,
KL=0.0941, wKL=1.0000]
Train E41: 76%|███████▌ | 19/25 [00:27<00:08, 1.46s/batch, N=1.4315, E=0.8232,
KL=0.0941, wKL=1.0000]
Train E41: 76%|███████▌ | 19/25 [00:28<00:08, 1.46s/batch, N=1.4045, E=0.8238,
KL=0.0940, wKL=1.0000]
Train E41: 80%|████████ | 20/25 [00:28<00:07, 1.45s/batch, N=1.4045, E=0.8238,
KL=0.0940, wKL=1.0000]
Train E41: 80%|████████ | 20/25 [00:30<00:07, 1.45s/batch, N=1.5760, E=0.8215,
KL=0.0949, wKL=1.0000]
Train E41: 84%|████████▍ | 21/25 [00:30<00:05, 1.45s/batch, N=1.5760, E=0.8215,
KL=0.0949, wKL=1.0000]
Train E41: 84%|████████▍ | 21/25 [00:31<00:05, 1.45s/batch, N=1.4635, E=0.8189,
KL=0.0929, wKL=1.0000]
Train E41: 88%|████████▊ | 22/25 [00:31<00:04, 1.43s/batch, N=1.4635, E=0.8189,
KL=0.0929, wKL=1.0000]
Train E41: 88%|████████▊ | 22/25 [00:32<00:04, 1.43s/batch, N=1.4638, E=0.8204,
KL=0.0927, wKL=1.0000]
Train E41: 92%|█████████▏| 23/25 [00:32<00:02, 1.43s/batch, N=1.4638, E=0.8204,
KL=0.0927, wKL=1.0000]
Train E41: 92%|█████████▏| 23/25 [00:34<00:02, 1.43s/batch, N=1.5066, E=0.8214,
KL=0.0907, wKL=1.0000]
Train E41: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5066, E=0.8214,
KL=0.0907, wKL=1.0000]
Train E41: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4054, E=0.8254,
KL=0.0902, wKL=1.0000]
Train E41: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4054, E=0.8254,
KL=0.0902, wKL=1.0000]
Train E41: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4054, E=0.8254,
KL=0.0902, wKL=1.0000]
1487.8s 157 [Epoch 041] Total: 2.3562 | N: 1.4868 | E: 0.8225 | KL(1.00×0.5):
0.0937
1522.8s 158 Train E42: 0%| | 0/25 [00:00<?, ?batch/s]
Train E42: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5679, E=0.8247, KL=0.0900,
wKL=1.0000]
Train E42: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5679, E=0.8247,
KL=0.0900, wKL=1.0000]
Train E42: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4692, E=0.8231,
KL=0.0891, wKL=1.0000]
Train E42: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4692, E=0.8231,
KL=0.0891, wKL=1.0000]
Train E42: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4993, E=0.8243,
KL=0.0893, wKL=1.0000]
Train E42: 12%|█▏ | 3/25 [00:04<00:30, 1.40s/batch, N=1.4993, E=0.8243,
KL=0.0893, wKL=1.0000]
Train E42: 12%|█▏ | 3/25 [00:05<00:30, 1.40s/batch, N=1.5005, E=0.8239,
KL=0.0894, wKL=1.0000]
Train E42: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.5005, E=0.8239,
KL=0.0894, wKL=1.0000]
Train E42: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4276, E=0.8201,
KL=0.0894, wKL=1.0000]
Train E42: 20%|██ | 5/25 [00:07<00:30, 1.51s/batch, N=1.4276, E=0.8201,
KL=0.0894, wKL=1.0000]
Train E42: 20%|██ | 5/25 [00:08<00:30, 1.51s/batch, N=1.5631, E=0.8267,
KL=0.0908, wKL=1.0000]
Train E42: 24%|██▍ | 6/25 [00:08<00:28, 1.48s/batch, N=1.5631, E=0.8267,
KL=0.0908, wKL=1.0000]
Train E42: 24%|██▍ | 6/25 [00:10<00:28, 1.48s/batch, N=1.4921, E=0.8216,
KL=0.0902, wKL=1.0000]
Train E42: 28%|██▊ | 7/25 [00:10<00:26, 1.45s/batch, N=1.4921, E=0.8216,
KL=0.0902, wKL=1.0000]
Train E42: 28%|██▊ | 7/25 [00:11<00:26, 1.45s/batch, N=1.5528, E=0.8221,
KL=0.0926, wKL=1.0000]
Train E42: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5528, E=0.8221,
KL=0.0926, wKL=1.0000]
Train E42: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.4827, E=0.8232,
KL=0.0912, wKL=1.0000]
Train E42: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.4827, E=0.8232,
KL=0.0912, wKL=1.0000]
Train E42: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.5228, E=0.8163,
KL=0.0920, wKL=1.0000]
Train E42: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.5228, E=0.8163,
KL=0.0920, wKL=1.0000]
Train E42: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.3665, E=0.8197,
KL=0.0921, wKL=1.0000]
Train E42: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.3665, E=0.8197,
KL=0.0921, wKL=1.0000]
Train E42: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.4547, E=0.8205,
KL=0.0900, wKL=1.0000]
Train E42: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4547, E=0.8205,
KL=0.0900, wKL=1.0000]
Train E42: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4777, E=0.8198,
KL=0.0913, wKL=1.0000]
Train E42: 52%|█████▏ | 13/25 [00:18<00:16, 1.38s/batch, N=1.4777, E=0.8198,
KL=0.0913, wKL=1.0000]
Train E42: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.4691, E=0.8234,
KL=0.0908, wKL=1.0000]
Train E42: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.4691, E=0.8234,
KL=0.0908, wKL=1.0000]
Train E42: 56%|█████▌ | 14/25 [00:21<00:15, 1.38s/batch, N=1.4833, E=0.8217,
KL=0.0901, wKL=1.0000]
Train E42: 60%|██████ | 15/25 [00:21<00:13, 1.38s/batch, N=1.4833, E=0.8217,
KL=0.0901, wKL=1.0000]
Train E42: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4593, E=0.8242,
KL=0.0894, wKL=1.0000]
Train E42: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4593, E=0.8242,
KL=0.0894, wKL=1.0000]
Train E42: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4372, E=0.8232,
KL=0.0893, wKL=1.0000]
Train E42: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.4372, E=0.8232,
KL=0.0893, wKL=1.0000]
Train E42: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.5919, E=0.8243,
KL=0.0901, wKL=1.0000]
Train E42: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5919, E=0.8243,
KL=0.0901, wKL=1.0000]
Train E42: 72%|███████▏ | 18/25 [00:27<00:09, 1.39s/batch, N=1.5327, E=0.8221,
KL=0.0902, wKL=1.0000]
Train E42: 76%|███████▌ | 19/25 [00:27<00:09, 1.59s/batch, N=1.5327, E=0.8221,
KL=0.0902, wKL=1.0000]
Train E42: 76%|███████▌ | 19/25 [00:28<00:09, 1.59s/batch, N=1.4933, E=0.8172,
KL=0.0898, wKL=1.0000]
Train E42: 80%|████████ | 20/25 [00:28<00:07, 1.53s/batch, N=1.4933, E=0.8172,
KL=0.0898, wKL=1.0000]
Train E42: 80%|████████ | 20/25 [00:30<00:07, 1.53s/batch, N=1.5468, E=0.8256,
KL=0.0910, wKL=1.0000]
Train E42: 84%|████████▍ | 21/25 [00:30<00:05, 1.50s/batch, N=1.5468, E=0.8256,
KL=0.0910, wKL=1.0000]
Train E42: 84%|████████▍ | 21/25 [00:31<00:05, 1.50s/batch, N=1.4116, E=0.8284,
KL=0.0887, wKL=1.0000]
Train E42: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4116, E=0.8284,
KL=0.0887, wKL=1.0000]
Train E42: 88%|████████▊ | 22/25 [00:33<00:04, 1.46s/batch, N=1.4466, E=0.8237,
KL=0.0892, wKL=1.0000]
Train E42: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.4466, E=0.8237,
KL=0.0892, wKL=1.0000]
Train E42: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.4603, E=0.8193,
KL=0.0892, wKL=1.0000]
Train E42: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4603, E=0.8193,
KL=0.0892, wKL=1.0000]
Train E42: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4155, E=0.8167,
KL=0.0890, wKL=1.0000]
Train E42: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4155, E=0.8167,
KL=0.0890, wKL=1.0000]
Train E42: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4155, E=0.8167,
KL=0.0890, wKL=1.0000]
1522.8s 159 [Epoch 042] Total: 2.3541 | N: 1.4866 | E: 0.8224 | KL(1.00×0.5):
0.0902
1558.0s 160 Train E43: 0%| | 0/25 [00:00<?, ?batch/s]
Train E43: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4376, E=0.8203, KL=0.0887,
wKL=1.0000]
Train E43: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.4376, E=0.8203,
KL=0.0887, wKL=1.0000]
Train E43: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4334, E=0.8222,
KL=0.0875, wKL=1.0000]
Train E43: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.4334, E=0.8222,
KL=0.0875, wKL=1.0000]
Train E43: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.4085, E=0.8206,
KL=0.0869, wKL=1.0000]
Train E43: 12%|█▏ | 3/25 [00:04<00:31, 1.43s/batch, N=1.4085, E=0.8206,
KL=0.0869, wKL=1.0000]
Train E43: 12%|█▏ | 3/25 [00:05<00:31, 1.43s/batch, N=1.4554, E=0.8175,
KL=0.0886, wKL=1.0000]
Train E43: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4554, E=0.8175,
KL=0.0886, wKL=1.0000]
Train E43: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4750, E=0.8217,
KL=0.0868, wKL=1.0000]
Train E43: 20%|██ | 5/25 [00:07<00:28, 1.40s/batch, N=1.4750, E=0.8217,
KL=0.0868, wKL=1.0000]
Train E43: 20%|██ | 5/25 [00:08<00:28, 1.40s/batch, N=1.5020, E=0.8252,
KL=0.0877, wKL=1.0000]
Train E43: 24%|██▍ | 6/25 [00:08<00:27, 1.43s/batch, N=1.5020, E=0.8252,
KL=0.0877, wKL=1.0000]
Train E43: 24%|██▍ | 6/25 [00:09<00:27, 1.43s/batch, N=1.5210, E=0.8181,
KL=0.0879, wKL=1.0000]
Train E43: 28%|██▊ | 7/25 [00:09<00:25, 1.44s/batch, N=1.5210, E=0.8181,
KL=0.0879, wKL=1.0000]
Train E43: 28%|██▊ | 7/25 [00:11<00:25, 1.44s/batch, N=1.4709, E=0.8267,
KL=0.0871, wKL=1.0000]
Train E43: 32%|███▏ | 8/25 [00:11<00:24, 1.43s/batch, N=1.4709, E=0.8267,
KL=0.0871, wKL=1.0000]
Train E43: 32%|███▏ | 8/25 [00:12<00:24, 1.43s/batch, N=1.5285, E=0.8248,
KL=0.0880, wKL=1.0000]
Train E43: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.5285, E=0.8248,
KL=0.0880, wKL=1.0000]
Train E43: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.4978, E=0.8221,
KL=0.0889, wKL=1.0000]
Train E43: 40%|████ | 10/25 [00:14<00:20, 1.40s/batch, N=1.4978, E=0.8221,
KL=0.0889, wKL=1.0000]
Train E43: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4292, E=0.8223,
KL=0.0887, wKL=1.0000]
Train E43: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4292, E=0.8223,
KL=0.0887, wKL=1.0000]
Train E43: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4865, E=0.8197,
KL=0.0885, wKL=1.0000]
Train E43: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4865, E=0.8197,
KL=0.0885, wKL=1.0000]
Train E43: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4476, E=0.8265,
KL=0.0875, wKL=1.0000]
Train E43: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4476, E=0.8265,
KL=0.0875, wKL=1.0000]
Train E43: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4860, E=0.8204,
KL=0.0866, wKL=1.0000]
Train E43: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.4860, E=0.8204,
KL=0.0866, wKL=1.0000]
Train E43: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5598, E=0.8239,
KL=0.0890, wKL=1.0000]
Train E43: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5598, E=0.8239,
KL=0.0890, wKL=1.0000]
Train E43: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.4854, E=0.8245,
KL=0.0862, wKL=1.0000]
Train E43: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4854, E=0.8245,
KL=0.0862, wKL=1.0000]
Train E43: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4742, E=0.8242,
KL=0.0865, wKL=1.0000]
Train E43: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.4742, E=0.8242,
KL=0.0865, wKL=1.0000]
Train E43: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5693, E=0.8223,
KL=0.0868, wKL=1.0000]
Train E43: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5693, E=0.8223,
KL=0.0868, wKL=1.0000]
Train E43: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4116, E=0.8206,
KL=0.0861, wKL=1.0000]
Train E43: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4116, E=0.8206,
KL=0.0861, wKL=1.0000]
Train E43: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5046, E=0.8238,
KL=0.0863, wKL=1.0000]
Train E43: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5046, E=0.8238,
KL=0.0863, wKL=1.0000]
Train E43: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.4901, E=0.8210,
KL=0.0879, wKL=1.0000]
Train E43: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4901, E=0.8210,
KL=0.0879, wKL=1.0000]
Train E43: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5365, E=0.8272,
KL=0.0863, wKL=1.0000]
Train E43: 88%|████████▊ | 22/25 [00:30<00:04, 1.40s/batch, N=1.5365, E=0.8272,
KL=0.0863, wKL=1.0000]
Train E43: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4894, E=0.8221,
KL=0.0868, wKL=1.0000]
Train E43: 92%|█████████▏| 23/25 [00:32<00:03, 1.58s/batch, N=1.4894, E=0.8221,
KL=0.0868, wKL=1.0000]
Train E43: 92%|█████████▏| 23/25 [00:34<00:03, 1.58s/batch, N=1.4767, E=0.8214,
KL=0.0859, wKL=1.0000]
Train E43: 96%|█████████▌| 24/25 [00:34<00:01, 1.57s/batch, N=1.4767, E=0.8214,
KL=0.0859, wKL=1.0000]
Train E43: 96%|█████████▌| 24/25 [00:35<00:01, 1.57s/batch, N=1.6989, E=0.8144,
KL=0.0906, wKL=1.0000]
Train E43: 100%|██████████| 25/25 [00:35<00:00, 1.30s/batch, N=1.6989, E=0.8144,
KL=0.0906, wKL=1.0000]
Train E43: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.6989, E=0.8144,
KL=0.0906, wKL=1.0000]
1558.0s 161 [Epoch 043] Total: 2.3521 | N: 1.4860 | E: 0.8223 | KL(1.00×0.5):
0.0874
1592.4s 162 Train E44: 0%| | 0/25 [00:00<?, ?batch/s]
Train E44: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5135, E=0.8233, KL=0.0859,
wKL=1.0000]
Train E44: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.5135, E=0.8233,
KL=0.0859, wKL=1.0000]
Train E44: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.4919, E=0.8259,
KL=0.0862, wKL=1.0000]
Train E44: 8%|▊ | 2/25 [00:02<00:32, 1.41s/batch, N=1.4919, E=0.8259,
KL=0.0862, wKL=1.0000]
Train E44: 8%|▊ | 2/25 [00:04<00:32, 1.41s/batch, N=1.5137, E=0.8189,
KL=0.0862, wKL=1.0000]
Train E44: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5137, E=0.8189,
KL=0.0862, wKL=1.0000]
Train E44: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5374, E=0.8227,
KL=0.0866, wKL=1.0000]
Train E44: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5374, E=0.8227,
KL=0.0866, wKL=1.0000]
Train E44: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5020, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5020, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4688, E=0.8241,
KL=0.0847, wKL=1.0000]
Train E44: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4688, E=0.8241,
KL=0.0847, wKL=1.0000]
Train E44: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4678, E=0.8250,
KL=0.0855, wKL=1.0000]
Train E44: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4678, E=0.8250,
KL=0.0855, wKL=1.0000]
Train E44: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.5351, E=0.8278,
KL=0.0862, wKL=1.0000]
Train E44: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5351, E=0.8278,
KL=0.0862, wKL=1.0000]
Train E44: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4878, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4878, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.4413, E=0.8215,
KL=0.0852, wKL=1.0000]
Train E44: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4413, E=0.8215,
KL=0.0852, wKL=1.0000]
Train E44: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4637, E=0.8221,
KL=0.0855, wKL=1.0000]
Train E44: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4637, E=0.8221,
KL=0.0855, wKL=1.0000]
Train E44: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5963, E=0.8214,
KL=0.0871, wKL=1.0000]
Train E44: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.5963, E=0.8214,
KL=0.0871, wKL=1.0000]
Train E44: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.3942, E=0.8189,
KL=0.0854, wKL=1.0000]
Train E44: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.3942, E=0.8189,
KL=0.0854, wKL=1.0000]
Train E44: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.6081, E=0.8221,
KL=0.0860, wKL=1.0000]
Train E44: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.6081, E=0.8221,
KL=0.0860, wKL=1.0000]
Train E44: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.3839, E=0.8217,
KL=0.0862, wKL=1.0000]
Train E44: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.3839, E=0.8217,
KL=0.0862, wKL=1.0000]
Train E44: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.3449, E=0.8229,
KL=0.0844, wKL=1.0000]
Train E44: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.3449, E=0.8229,
KL=0.0844, wKL=1.0000]
Train E44: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4630, E=0.8242,
KL=0.0863, wKL=1.0000]
Train E44: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4630, E=0.8242,
KL=0.0863, wKL=1.0000]
Train E44: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4858, E=0.8177,
KL=0.0861, wKL=1.0000]
Train E44: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.4858, E=0.8177,
KL=0.0861, wKL=1.0000]
Train E44: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4612, E=0.8204,
KL=0.0858, wKL=1.0000]
Train E44: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4612, E=0.8204,
KL=0.0858, wKL=1.0000]
Train E44: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.4517, E=0.8208,
KL=0.0855, wKL=1.0000]
Train E44: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.4517, E=0.8208,
KL=0.0855, wKL=1.0000]
Train E44: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5385, E=0.8196,
KL=0.0849, wKL=1.0000]
Train E44: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5385, E=0.8196,
KL=0.0849, wKL=1.0000]
Train E44: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.5211, E=0.8272,
KL=0.0836, wKL=1.0000]
Train E44: 88%|████████▊ | 22/25 [00:30<00:04, 1.43s/batch, N=1.5211, E=0.8272,
KL=0.0836, wKL=1.0000]
Train E44: 88%|████████▊ | 22/25 [00:32<00:04, 1.43s/batch, N=1.5204, E=0.8202,
KL=0.0833, wKL=1.0000]
Train E44: 92%|█████████▏| 23/25 [00:32<00:02, 1.47s/batch, N=1.5204, E=0.8202,
KL=0.0833, wKL=1.0000]
Train E44: 92%|█████████▏| 23/25 [00:33<00:02, 1.47s/batch, N=1.4915, E=0.8211,
KL=0.0840, wKL=1.0000]
Train E44: 96%|█████████▌| 24/25 [00:33<00:01, 1.46s/batch, N=1.4915, E=0.8211,
KL=0.0840, wKL=1.0000]
Train E44: 96%|█████████▌| 24/25 [00:34<00:01, 1.46s/batch, N=1.4126, E=0.8153,
KL=0.0828, wKL=1.0000]
Train E44: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.4126, E=0.8153,
KL=0.0828, wKL=1.0000]
Train E44: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4126, E=0.8153,
KL=0.0828, wKL=1.0000]
1592.4s 163 [Epoch 044] Total: 2.3504 | N: 1.4856 | E: 0.8221 | KL(1.00×0.5):
0.0855
1627.6s 164 Train E45: 0%| | 0/25 [00:00<?, ?batch/s]
Train E45: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4566, E=0.8248, KL=0.0829,
wKL=1.0000]
Train E45: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4566, E=0.8248,
KL=0.0829, wKL=1.0000]
Train E45: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5751, E=0.8237,
KL=0.0836, wKL=1.0000]
Train E45: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5751, E=0.8237,
KL=0.0836, wKL=1.0000]
Train E45: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4525, E=0.8171,
KL=0.0842, wKL=1.0000]
Train E45: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4525, E=0.8171,
KL=0.0842, wKL=1.0000]
Train E45: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.3471, E=0.8157,
KL=0.0834, wKL=1.0000]
Train E45: 16%|█▌ | 4/25 [00:05<00:28, 1.35s/batch, N=1.3471, E=0.8157,
KL=0.0834, wKL=1.0000]
Train E45: 16%|█▌ | 4/25 [00:07<00:28, 1.35s/batch, N=1.4793, E=0.8214,
KL=0.0855, wKL=1.0000]
Train E45: 20%|██ | 5/25 [00:07<00:31, 1.58s/batch, N=1.4793, E=0.8214,
KL=0.0855, wKL=1.0000]
Train E45: 20%|██ | 5/25 [00:08<00:31, 1.58s/batch, N=1.5161, E=0.8222,
KL=0.0856, wKL=1.0000]
Train E45: 24%|██▍ | 6/25 [00:08<00:29, 1.54s/batch, N=1.5161, E=0.8222,
KL=0.0856, wKL=1.0000]
Train E45: 24%|██▍ | 6/25 [00:10<00:29, 1.54s/batch, N=1.4788, E=0.8249,
KL=0.0827, wKL=1.0000]
Train E45: 28%|██▊ | 7/25 [00:10<00:27, 1.50s/batch, N=1.4788, E=0.8249,
KL=0.0827, wKL=1.0000]
Train E45: 28%|██▊ | 7/25 [00:11<00:27, 1.50s/batch, N=1.4658, E=0.8233,
KL=0.0836, wKL=1.0000]
Train E45: 32%|███▏ | 8/25 [00:11<00:24, 1.47s/batch, N=1.4658, E=0.8233,
KL=0.0836, wKL=1.0000]
Train E45: 32%|███▏ | 8/25 [00:13<00:24, 1.47s/batch, N=1.4597, E=0.8182,
KL=0.0835, wKL=1.0000]
Train E45: 36%|███▌ | 9/25 [00:13<00:23, 1.46s/batch, N=1.4597, E=0.8182,
KL=0.0835, wKL=1.0000]
Train E45: 36%|███▌ | 9/25 [00:14<00:23, 1.46s/batch, N=1.5104, E=0.8276,
KL=0.0817, wKL=1.0000]
Train E45: 40%|████ | 10/25 [00:14<00:21, 1.45s/batch, N=1.5104, E=0.8276,
KL=0.0817, wKL=1.0000]
Train E45: 40%|████ | 10/25 [00:15<00:21, 1.45s/batch, N=1.4370, E=0.8272,
KL=0.0828, wKL=1.0000]
Train E45: 44%|████▍ | 11/25 [00:15<00:19, 1.43s/batch, N=1.4370, E=0.8272,
KL=0.0828, wKL=1.0000]
Train E45: 44%|████▍ | 11/25 [00:17<00:19, 1.43s/batch, N=1.5280, E=0.8220,
KL=0.0831, wKL=1.0000]
Train E45: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.5280, E=0.8220,
KL=0.0831, wKL=1.0000]
Train E45: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5195, E=0.8234,
KL=0.0818, wKL=1.0000]
Train E45: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5195, E=0.8234,
KL=0.0818, wKL=1.0000]
Train E45: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4473, E=0.8211,
KL=0.0823, wKL=1.0000]
Train E45: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4473, E=0.8211,
KL=0.0823, wKL=1.0000]
Train E45: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.6315, E=0.8238,
KL=0.0830, wKL=1.0000]
Train E45: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.6315, E=0.8238,
KL=0.0830, wKL=1.0000]
Train E45: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.5951, E=0.8163,
KL=0.0837, wKL=1.0000]
Train E45: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5951, E=0.8163,
KL=0.0837, wKL=1.0000]
Train E45: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.4949, E=0.8217,
KL=0.0831, wKL=1.0000]
Train E45: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4949, E=0.8217,
KL=0.0831, wKL=1.0000]
Train E45: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.4731, E=0.8241,
KL=0.0812, wKL=1.0000]
Train E45: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4731, E=0.8241,
KL=0.0812, wKL=1.0000]
Train E45: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4672, E=0.8301,
KL=0.0817, wKL=1.0000]
Train E45: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.4672, E=0.8301,
KL=0.0817, wKL=1.0000]
Train E45: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4109, E=0.8223,
KL=0.0828, wKL=1.0000]
Train E45: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.4109, E=0.8223,
KL=0.0828, wKL=1.0000]
Train E45: 80%|████████ | 20/25 [00:30<00:07, 1.50s/batch, N=1.4622, E=0.8208,
KL=0.0813, wKL=1.0000]
Train E45: 84%|████████▍ | 21/25 [00:30<00:05, 1.47s/batch, N=1.4622, E=0.8208,
KL=0.0813, wKL=1.0000]
Train E45: 84%|████████▍ | 21/25 [00:31<00:05, 1.47s/batch, N=1.4943, E=0.8205,
KL=0.0822, wKL=1.0000]
Train E45: 88%|████████▊ | 22/25 [00:31<00:04, 1.45s/batch, N=1.4943, E=0.8205,
KL=0.0822, wKL=1.0000]
Train E45: 88%|████████▊ | 22/25 [00:33<00:04, 1.45s/batch, N=1.4580, E=0.8217,
KL=0.0820, wKL=1.0000]
Train E45: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.4580, E=0.8217,
KL=0.0820, wKL=1.0000]
Train E45: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.5441, E=0.8238,
KL=0.0818, wKL=1.0000]
Train E45: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.5441, E=0.8238,
KL=0.0818, wKL=1.0000]
Train E45: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.4128, E=0.8160,
KL=0.0819, wKL=1.0000]
Train E45: 100%|██████████| 25/25 [00:35<00:00, 1.18s/batch, N=1.4128, E=0.8160,
KL=0.0819, wKL=1.0000]
Train E45: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4128, E=0.8160,
KL=0.0819, wKL=1.0000]
1627.6s 165 [Epoch 045] Total: 2.3502 | N: 1.4864 | E: 0.8223 | KL(1.00×0.5):
0.0829
1662.7s 166 Train E46: 0%| | 0/25 [00:00<?, ?batch/s]
Train E46: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4966, E=0.8271, KL=0.0809,
wKL=1.0000]
Train E46: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4966, E=0.8271,
KL=0.0809, wKL=1.0000]
Train E46: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.6388, E=0.8261,
KL=0.0831, wKL=1.0000]
Train E46: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.6388, E=0.8261,
KL=0.0831, wKL=1.0000]
Train E46: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4539, E=0.8223,
KL=0.0807, wKL=1.0000]
Train E46: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4539, E=0.8223,
KL=0.0807, wKL=1.0000]
Train E46: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4620, E=0.8223,
KL=0.0815, wKL=1.0000]
Train E46: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4620, E=0.8223,
KL=0.0815, wKL=1.0000]
Train E46: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4527, E=0.8231,
KL=0.0815, wKL=1.0000]
Train E46: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4527, E=0.8231,
KL=0.0815, wKL=1.0000]
Train E46: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4528, E=0.8258,
KL=0.0810, wKL=1.0000]
Train E46: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4528, E=0.8258,
KL=0.0810, wKL=1.0000]
Train E46: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4294, E=0.8198,
KL=0.0808, wKL=1.0000]
Train E46: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4294, E=0.8198,
KL=0.0808, wKL=1.0000]
Train E46: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5097, E=0.8203,
KL=0.0816, wKL=1.0000]
Train E46: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.5097, E=0.8203,
KL=0.0816, wKL=1.0000]
Train E46: 32%|███▏ | 8/25 [00:13<00:23, 1.41s/batch, N=1.4672, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 36%|███▌ | 9/25 [00:13<00:25, 1.62s/batch, N=1.4672, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 36%|███▌ | 9/25 [00:14<00:25, 1.62s/batch, N=1.5093, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 40%|████ | 10/25 [00:14<00:23, 1.54s/batch, N=1.5093, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 40%|████ | 10/25 [00:16<00:23, 1.54s/batch, N=1.5525, E=0.8247,
KL=0.0813, wKL=1.0000]
Train E46: 44%|████▍ | 11/25 [00:16<00:20, 1.50s/batch, N=1.5525, E=0.8247,
KL=0.0813, wKL=1.0000]
Train E46: 44%|████▍ | 11/25 [00:17<00:20, 1.50s/batch, N=1.4712, E=0.8233,
KL=0.0796, wKL=1.0000]
Train E46: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.4712, E=0.8233,
KL=0.0796, wKL=1.0000]
Train E46: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.4193, E=0.8175,
KL=0.0798, wKL=1.0000]
Train E46: 52%|█████▏ | 13/25 [00:18<00:17, 1.44s/batch, N=1.4193, E=0.8175,
KL=0.0798, wKL=1.0000]
Train E46: 52%|█████▏ | 13/25 [00:20<00:17, 1.44s/batch, N=1.4009, E=0.8212,
KL=0.0791, wKL=1.0000]
Train E46: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4009, E=0.8212,
KL=0.0791, wKL=1.0000]
Train E46: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.4093, E=0.8173,
KL=0.0792, wKL=1.0000]
Train E46: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4093, E=0.8173,
KL=0.0792, wKL=1.0000]
Train E46: 60%|██████ | 15/25 [00:23<00:14, 1.42s/batch, N=1.5214, E=0.8253,
KL=0.0805, wKL=1.0000]
Train E46: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.5214, E=0.8253,
KL=0.0805, wKL=1.0000]
Train E46: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.5000, E=0.8233,
KL=0.0791, wKL=1.0000]
Train E46: 68%|██████▊ | 17/25 [00:24<00:11, 1.48s/batch, N=1.5000, E=0.8233,
KL=0.0791, wKL=1.0000]
Train E46: 68%|██████▊ | 17/25 [00:26<00:11, 1.48s/batch, N=1.4953, E=0.8156,
KL=0.0802, wKL=1.0000]
Train E46: 72%|███████▏ | 18/25 [00:26<00:10, 1.48s/batch, N=1.4953, E=0.8156,
KL=0.0802, wKL=1.0000]
Train E46: 72%|███████▏ | 18/25 [00:27<00:10, 1.48s/batch, N=1.4897, E=0.8244,
KL=0.0796, wKL=1.0000]
Train E46: 76%|███████▌ | 19/25 [00:27<00:08, 1.45s/batch, N=1.4897, E=0.8244,
KL=0.0796, wKL=1.0000]
Train E46: 76%|███████▌ | 19/25 [00:28<00:08, 1.45s/batch, N=1.5558, E=0.8178,
KL=0.0799, wKL=1.0000]
Train E46: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5558, E=0.8178,
KL=0.0799, wKL=1.0000]
Train E46: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4294, E=0.8221,
KL=0.0795, wKL=1.0000]
Train E46: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4294, E=0.8221,
KL=0.0795, wKL=1.0000]
Train E46: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5338, E=0.8214,
KL=0.0790, wKL=1.0000]
Train E46: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5338, E=0.8214,
KL=0.0790, wKL=1.0000]
Train E46: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.4331, E=0.8269,
KL=0.0784, wKL=1.0000]
Train E46: 92%|█████████▏| 23/25 [00:33<00:02, 1.43s/batch, N=1.4331, E=0.8269,
KL=0.0784, wKL=1.0000]
Train E46: 92%|█████████▏| 23/25 [00:34<00:02, 1.43s/batch, N=1.5634, E=0.8249,
KL=0.0794, wKL=1.0000]
Train E46: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5634, E=0.8249,
KL=0.0794, wKL=1.0000]
Train E46: 96%|█████████▌| 24/25 [00:35<00:01, 1.41s/batch, N=1.5010, E=0.8203,
KL=0.0796, wKL=1.0000]
Train E46: 100%|██████████| 25/25 [00:35<00:00, 1.18s/batch, N=1.5010, E=0.8203,
KL=0.0796, wKL=1.0000]
Train E46: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5010, E=0.8203,
KL=0.0796, wKL=1.0000]
1662.7s 167 [Epoch 046] Total: 2.3482 | N: 1.4856 | E: 0.8224 | KL(1.00×0.5):
0.0803
1697.4s 168 Train E47: 0%| | 0/25 [00:00<?, ?batch/s]
Train E47: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5041, E=0.8229, KL=0.0790,
wKL=1.0000]
Train E47: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5041, E=0.8229,
KL=0.0790, wKL=1.0000]
Train E47: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4062, E=0.8228,
KL=0.0783, wKL=1.0000]
Train E47: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4062, E=0.8228,
KL=0.0783, wKL=1.0000]
Train E47: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.5236, E=0.8230,
KL=0.0788, wKL=1.0000]
Train E47: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5236, E=0.8230,
KL=0.0788, wKL=1.0000]
Train E47: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.3876, E=0.8250,
KL=0.0786, wKL=1.0000]
Train E47: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.3876, E=0.8250,
KL=0.0786, wKL=1.0000]
Train E47: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4904, E=0.8264,
KL=0.0785, wKL=1.0000]
Train E47: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4904, E=0.8264,
KL=0.0785, wKL=1.0000]
Train E47: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4356, E=0.8172,
KL=0.0777, wKL=1.0000]
Train E47: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4356, E=0.8172,
KL=0.0777, wKL=1.0000]
Train E47: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5207, E=0.8126,
KL=0.0784, wKL=1.0000]
Train E47: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.5207, E=0.8126,
KL=0.0784, wKL=1.0000]
Train E47: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4146, E=0.8214,
KL=0.0768, wKL=1.0000]
Train E47: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4146, E=0.8214,
KL=0.0768, wKL=1.0000]
Train E47: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5255, E=0.8271,
KL=0.0762, wKL=1.0000]
Train E47: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.5255, E=0.8271,
KL=0.0762, wKL=1.0000]
Train E47: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4402, E=0.8255,
KL=0.0775, wKL=1.0000]
Train E47: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4402, E=0.8255,
KL=0.0775, wKL=1.0000]
Train E47: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5578, E=0.8218,
KL=0.0780, wKL=1.0000]
Train E47: 44%|████▍ | 11/25 [00:15<00:19, 1.37s/batch, N=1.5578, E=0.8218,
KL=0.0780, wKL=1.0000]
Train E47: 44%|████▍ | 11/25 [00:16<00:19, 1.37s/batch, N=1.4389, E=0.8236,
KL=0.0778, wKL=1.0000]
Train E47: 48%|████▊ | 12/25 [00:16<00:17, 1.37s/batch, N=1.4389, E=0.8236,
KL=0.0778, wKL=1.0000]
Train E47: 48%|████▊ | 12/25 [00:17<00:17, 1.37s/batch, N=1.4299, E=0.8229,
KL=0.0773, wKL=1.0000]
Train E47: 52%|█████▏ | 13/25 [00:17<00:16, 1.40s/batch, N=1.4299, E=0.8229,
KL=0.0773, wKL=1.0000]
Train E47: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5222, E=0.8264,
KL=0.0771, wKL=1.0000]
Train E47: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5222, E=0.8264,
KL=0.0771, wKL=1.0000]
Train E47: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4223, E=0.8198,
KL=0.0779, wKL=1.0000]
Train E47: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.4223, E=0.8198,
KL=0.0779, wKL=1.0000]
Train E47: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5693, E=0.8243,
KL=0.0800, wKL=1.0000]
Train E47: 64%|██████▍ | 16/25 [00:22<00:13, 1.47s/batch, N=1.5693, E=0.8243,
KL=0.0800, wKL=1.0000]
Train E47: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5303, E=0.8221,
KL=0.0770, wKL=1.0000]
Train E47: 68%|██████▊ | 17/25 [00:24<00:13, 1.63s/batch, N=1.5303, E=0.8221,
KL=0.0770, wKL=1.0000]
Train E47: 68%|██████▊ | 17/25 [00:25<00:13, 1.63s/batch, N=1.4786, E=0.8264,
KL=0.0772, wKL=1.0000]
Train E47: 72%|███████▏ | 18/25 [00:25<00:10, 1.56s/batch, N=1.4786, E=0.8264,
KL=0.0772, wKL=1.0000]
Train E47: 72%|███████▏ | 18/25 [00:27<00:10, 1.56s/batch, N=1.5342, E=0.8165,
KL=0.0779, wKL=1.0000]
Train E47: 76%|███████▌ | 19/25 [00:27<00:09, 1.50s/batch, N=1.5342, E=0.8165,
KL=0.0779, wKL=1.0000]
Train E47: 76%|███████▌ | 19/25 [00:28<00:09, 1.50s/batch, N=1.5173, E=0.8201,
KL=0.0778, wKL=1.0000]
Train E47: 80%|████████ | 20/25 [00:28<00:07, 1.47s/batch, N=1.5173, E=0.8201,
KL=0.0778, wKL=1.0000]
Train E47: 80%|████████ | 20/25 [00:29<00:07, 1.47s/batch, N=1.4978, E=0.8211,
KL=0.0773, wKL=1.0000]
Train E47: 84%|████████▍ | 21/25 [00:29<00:05, 1.44s/batch, N=1.4978, E=0.8211,
KL=0.0773, wKL=1.0000]
Train E47: 84%|████████▍ | 21/25 [00:31<00:05, 1.44s/batch, N=1.4767, E=0.8249,
KL=0.0755, wKL=1.0000]
Train E47: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.4767, E=0.8249,
KL=0.0755, wKL=1.0000]
Train E47: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5094, E=0.8205,
KL=0.0748, wKL=1.0000]
Train E47: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5094, E=0.8205,
KL=0.0748, wKL=1.0000]
Train E47: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5368, E=0.8225,
KL=0.0760, wKL=1.0000]
Train E47: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5368, E=0.8225,
KL=0.0760, wKL=1.0000]
Train E47: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4317, E=0.8261,
KL=0.0752, wKL=1.0000]
Train E47: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4317, E=0.8261,
KL=0.0752, wKL=1.0000]
Train E47: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4317, E=0.8261,
KL=0.0752, wKL=1.0000]
1697.4s 169 [Epoch 047] Total: 2.3465 | N: 1.4853 | E: 0.8224 | KL(1.00×0.5):
0.0775
1732.1s 170 Train E48: 0%| | 0/25 [00:00<?, ?batch/s]
Train E48: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4546, E=0.8230, KL=0.0749,
wKL=1.0000]
Train E48: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4546, E=0.8230,
KL=0.0749, wKL=1.0000]
Train E48: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5069, E=0.8200,
KL=0.0758, wKL=1.0000]
Train E48: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5069, E=0.8200,
KL=0.0758, wKL=1.0000]
Train E48: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4519, E=0.8235,
KL=0.0762, wKL=1.0000]
Train E48: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4519, E=0.8235,
KL=0.0762, wKL=1.0000]
Train E48: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5137, E=0.8239,
KL=0.0764, wKL=1.0000]
Train E48: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5137, E=0.8239,
KL=0.0764, wKL=1.0000]
Train E48: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5730, E=0.8238,
KL=0.0769, wKL=1.0000]
Train E48: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5730, E=0.8238,
KL=0.0769, wKL=1.0000]
Train E48: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4092, E=0.8225,
KL=0.0763, wKL=1.0000]
Train E48: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.4092, E=0.8225,
KL=0.0763, wKL=1.0000]
Train E48: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.4801, E=0.8253,
KL=0.0755, wKL=1.0000]
Train E48: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4801, E=0.8253,
KL=0.0755, wKL=1.0000]
Train E48: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4735, E=0.8235,
KL=0.0765, wKL=1.0000]
Train E48: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.4735, E=0.8235,
KL=0.0765, wKL=1.0000]
Train E48: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.5277, E=0.8214,
KL=0.0778, wKL=1.0000]
Train E48: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.5277, E=0.8214,
KL=0.0778, wKL=1.0000]
Train E48: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4368, E=0.8221,
KL=0.0733, wKL=1.0000]
Train E48: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4368, E=0.8221,
KL=0.0733, wKL=1.0000]
Train E48: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4703, E=0.8245,
KL=0.0745, wKL=1.0000]
Train E48: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4703, E=0.8245,
KL=0.0745, wKL=1.0000]
Train E48: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4519, E=0.8240,
KL=0.0728, wKL=1.0000]
Train E48: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4519, E=0.8240,
KL=0.0728, wKL=1.0000]
Train E48: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5084, E=0.8214,
KL=0.0736, wKL=1.0000]
Train E48: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5084, E=0.8214,
KL=0.0736, wKL=1.0000]
Train E48: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4798, E=0.8219,
KL=0.0738, wKL=1.0000]
Train E48: 56%|█████▌ | 14/25 [00:19<00:15, 1.43s/batch, N=1.4798, E=0.8219,
KL=0.0738, wKL=1.0000]
Train E48: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4692, E=0.8182,
KL=0.0735, wKL=1.0000]
Train E48: 60%|██████ | 15/25 [00:20<00:14, 1.40s/batch, N=1.4692, E=0.8182,
KL=0.0735, wKL=1.0000]
Train E48: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.5161, E=0.8219,
KL=0.0728, wKL=1.0000]
Train E48: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5161, E=0.8219,
KL=0.0728, wKL=1.0000]
Train E48: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4922, E=0.8257,
KL=0.0728, wKL=1.0000]
Train E48: 68%|██████▊ | 17/25 [00:23<00:11, 1.42s/batch, N=1.4922, E=0.8257,
KL=0.0728, wKL=1.0000]
Train E48: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4482, E=0.8237,
KL=0.0725, wKL=1.0000]
Train E48: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4482, E=0.8237,
KL=0.0725, wKL=1.0000]
Train E48: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.5042, E=0.8226,
KL=0.0728, wKL=1.0000]
Train E48: 76%|███████▌ | 19/25 [00:27<00:09, 1.60s/batch, N=1.5042, E=0.8226,
KL=0.0728, wKL=1.0000]
Train E48: 76%|███████▌ | 19/25 [00:28<00:09, 1.60s/batch, N=1.4764, E=0.8249,
KL=0.0731, wKL=1.0000]
Train E48: 80%|████████ | 20/25 [00:28<00:07, 1.53s/batch, N=1.4764, E=0.8249,
KL=0.0731, wKL=1.0000]
Train E48: 80%|████████ | 20/25 [00:29<00:07, 1.53s/batch, N=1.4676, E=0.8211,
KL=0.0728, wKL=1.0000]
Train E48: 84%|████████▍ | 21/25 [00:29<00:05, 1.50s/batch, N=1.4676, E=0.8211,
KL=0.0728, wKL=1.0000]
Train E48: 84%|████████▍ | 21/25 [00:31<00:05, 1.50s/batch, N=1.4947, E=0.8231,
KL=0.0716, wKL=1.0000]
Train E48: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4947, E=0.8231,
KL=0.0716, wKL=1.0000]
Train E48: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.5155, E=0.8196,
KL=0.0756, wKL=1.0000]
Train E48: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.5155, E=0.8196,
KL=0.0756, wKL=1.0000]
Train E48: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.5301, E=0.8187,
KL=0.0733, wKL=1.0000]
Train E48: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5301, E=0.8187,
KL=0.0733, wKL=1.0000]
Train E48: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4765, E=0.8208,
KL=0.0715, wKL=1.0000]
Train E48: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.4765, E=0.8208,
KL=0.0715, wKL=1.0000]
Train E48: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4765, E=0.8208,
KL=0.0715, wKL=1.0000]
1732.1s 171 [Epoch 048] Total: 2.3450 | N: 1.4854 | E: 0.8225 | KL(1.00×0.5):
0.0743
1767.0s 172 Train E49: 0%| | 0/25 [00:00<?, ?batch/s]
Train E49: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4446, E=0.8252, KL=0.0722,
wKL=1.0000]
Train E49: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4446, E=0.8252,
KL=0.0722, wKL=1.0000]
Train E49: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5524, E=0.8241,
KL=0.0718, wKL=1.0000]
Train E49: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5524, E=0.8241,
KL=0.0718, wKL=1.0000]
Train E49: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.5319, E=0.8233,
KL=0.0727, wKL=1.0000]
Train E49: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5319, E=0.8233,
KL=0.0727, wKL=1.0000]
Train E49: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5242, E=0.8232,
KL=0.0719, wKL=1.0000]
Train E49: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.5242, E=0.8232,
KL=0.0719, wKL=1.0000]
Train E49: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4718, E=0.8235,
KL=0.0703, wKL=1.0000]
Train E49: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4718, E=0.8235,
KL=0.0703, wKL=1.0000]
Train E49: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5581, E=0.8252,
KL=0.0707, wKL=1.0000]
Train E49: 24%|██▍ | 6/25 [00:08<00:25, 1.36s/batch, N=1.5581, E=0.8252,
KL=0.0707, wKL=1.0000]
Train E49: 24%|██▍ | 6/25 [00:09<00:25, 1.36s/batch, N=1.4666, E=0.8218,
KL=0.0705, wKL=1.0000]
Train E49: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4666, E=0.8218,
KL=0.0705, wKL=1.0000]
Train E49: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4446, E=0.8248,
KL=0.0686, wKL=1.0000]
Train E49: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4446, E=0.8248,
KL=0.0686, wKL=1.0000]
Train E49: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5240, E=0.8243,
KL=0.0712, wKL=1.0000]
Train E49: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5240, E=0.8243,
KL=0.0712, wKL=1.0000]
Train E49: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5354, E=0.8227,
KL=0.0694, wKL=1.0000]
Train E49: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.5354, E=0.8227,
KL=0.0694, wKL=1.0000]
Train E49: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.3782, E=0.8197,
KL=0.0694, wKL=1.0000]
Train E49: 44%|████▍ | 11/25 [00:15<00:20, 1.45s/batch, N=1.3782, E=0.8197,
KL=0.0694, wKL=1.0000]
Train E49: 44%|████▍ | 11/25 [00:16<00:20, 1.45s/batch, N=1.5673, E=0.8279,
KL=0.0697, wKL=1.0000]
Train E49: 48%|████▊ | 12/25 [00:16<00:18, 1.44s/batch, N=1.5673, E=0.8279,
KL=0.0697, wKL=1.0000]
Train E49: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.3759, E=0.8246,
KL=0.0693, wKL=1.0000]
Train E49: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.3759, E=0.8246,
KL=0.0693, wKL=1.0000]
Train E49: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.5125, E=0.8227,
KL=0.0704, wKL=1.0000]
Train E49: 56%|█████▌ | 14/25 [00:19<00:15, 1.44s/batch, N=1.5125, E=0.8227,
KL=0.0704, wKL=1.0000]
Train E49: 56%|█████▌ | 14/25 [00:21<00:15, 1.44s/batch, N=1.4641, E=0.8244,
KL=0.0685, wKL=1.0000]
Train E49: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4641, E=0.8244,
KL=0.0685, wKL=1.0000]
Train E49: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.5103, E=0.8148,
KL=0.0710, wKL=1.0000]
Train E49: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5103, E=0.8148,
KL=0.0710, wKL=1.0000]
Train E49: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4939, E=0.8181,
KL=0.0697, wKL=1.0000]
Train E49: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4939, E=0.8181,
KL=0.0697, wKL=1.0000]
Train E49: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4437, E=0.8232,
KL=0.0683, wKL=1.0000]
Train E49: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4437, E=0.8232,
KL=0.0683, wKL=1.0000]
Train E49: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4717, E=0.8259,
KL=0.0691, wKL=1.0000]
Train E49: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4717, E=0.8259,
KL=0.0691, wKL=1.0000]
Train E49: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.5957, E=0.8254,
KL=0.0716, wKL=1.0000]
Train E49: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.5957, E=0.8254,
KL=0.0716, wKL=1.0000]
Train E49: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.3224, E=0.8180,
KL=0.0671, wKL=1.0000]
Train E49: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.3224, E=0.8180,
KL=0.0671, wKL=1.0000]
Train E49: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5174, E=0.8230,
KL=0.0697, wKL=1.0000]
Train E49: 88%|████████▊ | 22/25 [00:30<00:04, 1.39s/batch, N=1.5174, E=0.8230,
KL=0.0697, wKL=1.0000]
Train E49: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4841, E=0.8243,
KL=0.0677, wKL=1.0000]
Train E49: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4841, E=0.8243,
KL=0.0677, wKL=1.0000]
Train E49: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4632, E=0.8186,
KL=0.0672, wKL=1.0000]
Train E49: 96%|█████████▌| 24/25 [00:34<00:01, 1.58s/batch, N=1.4632, E=0.8186,
KL=0.0672, wKL=1.0000]
Train E49: 96%|█████████▌| 24/25 [00:34<00:01, 1.58s/batch, N=1.4540, E=0.8214,
KL=0.0676, wKL=1.0000]
Train E49: 100%|██████████| 25/25 [00:34<00:00, 1.29s/batch, N=1.4540, E=0.8214,
KL=0.0676, wKL=1.0000]
Train E49: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4540, E=0.8214,
KL=0.0676, wKL=1.0000]
1767.0s 173 [Epoch 049] Total: 2.3428 | N: 1.4850 | E: 0.8228 | KL(1.00×0.5):
0.0699
1801.1s 174 Train E50: 0%| | 0/25 [00:00<?, ?batch/s]
Train E50: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5561, E=0.8204, KL=0.0672,
wKL=1.0000]
Train E50: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5561, E=0.8204,
KL=0.0672, wKL=1.0000]
Train E50: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4961, E=0.8182,
KL=0.0662, wKL=1.0000]
Train E50: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4961, E=0.8182,
KL=0.0662, wKL=1.0000]
Train E50: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4259, E=0.8215,
KL=0.0659, wKL=1.0000]
Train E50: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4259, E=0.8215,
KL=0.0659, wKL=1.0000]
Train E50: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4951, E=0.8241,
KL=0.0660, wKL=1.0000]
Train E50: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4951, E=0.8241,
KL=0.0660, wKL=1.0000]
Train E50: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4179, E=0.8273,
KL=0.0643, wKL=1.0000]
Train E50: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4179, E=0.8273,
KL=0.0643, wKL=1.0000]
Train E50: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4940, E=0.8202,
KL=0.0660, wKL=1.0000]
Train E50: 24%|██▍ | 6/25 [00:08<00:26, 1.37s/batch, N=1.4940, E=0.8202,
KL=0.0660, wKL=1.0000]
Train E50: 24%|██▍ | 6/25 [00:09<00:26, 1.37s/batch, N=1.4698, E=0.8197,
KL=0.0655, wKL=1.0000]
Train E50: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4698, E=0.8197,
KL=0.0655, wKL=1.0000]
Train E50: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4582, E=0.8228,
KL=0.0664, wKL=1.0000]
Train E50: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4582, E=0.8228,
KL=0.0664, wKL=1.0000]
Train E50: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5089, E=0.8228,
KL=0.0667, wKL=1.0000]
Train E50: 36%|███▌ | 9/25 [00:12<00:23, 1.44s/batch, N=1.5089, E=0.8228,
KL=0.0667, wKL=1.0000]
Train E50: 36%|███▌ | 9/25 [00:14<00:23, 1.44s/batch, N=1.4875, E=0.8236,
KL=0.0669, wKL=1.0000]
Train E50: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.4875, E=0.8236,
KL=0.0669, wKL=1.0000]
Train E50: 40%|████ | 10/25 [00:15<00:21, 1.44s/batch, N=1.5300, E=0.8237,
KL=0.0678, wKL=1.0000]
Train E50: 44%|████▍ | 11/25 [00:15<00:20, 1.43s/batch, N=1.5300, E=0.8237,
KL=0.0678, wKL=1.0000]
Train E50: 44%|████▍ | 11/25 [00:16<00:20, 1.43s/batch, N=1.4916, E=0.8245,
KL=0.0674, wKL=1.0000]
Train E50: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4916, E=0.8245,
KL=0.0674, wKL=1.0000]
Train E50: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4580, E=0.8196,
KL=0.0668, wKL=1.0000]
Train E50: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.4580, E=0.8196,
KL=0.0668, wKL=1.0000]
Train E50: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4585, E=0.8247,
KL=0.0662, wKL=1.0000]
Train E50: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4585, E=0.8247,
KL=0.0662, wKL=1.0000]
Train E50: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4893, E=0.8228,
KL=0.0665, wKL=1.0000]
Train E50: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.4893, E=0.8228,
KL=0.0665, wKL=1.0000]
Train E50: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.5102, E=0.8212,
KL=0.0670, wKL=1.0000]
Train E50: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5102, E=0.8212,
KL=0.0670, wKL=1.0000]
Train E50: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5306, E=0.8231,
KL=0.0663, wKL=1.0000]
Train E50: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5306, E=0.8231,
KL=0.0663, wKL=1.0000]
Train E50: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4957, E=0.8194,
KL=0.0654, wKL=1.0000]
Train E50: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.4957, E=0.8194,
KL=0.0654, wKL=1.0000]
Train E50: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4519, E=0.8215,
KL=0.0645, wKL=1.0000]
Train E50: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4519, E=0.8215,
KL=0.0645, wKL=1.0000]
Train E50: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.5363, E=0.8255,
KL=0.0645, wKL=1.0000]
Train E50: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.5363, E=0.8255,
KL=0.0645, wKL=1.0000]
Train E50: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5143, E=0.8225,
KL=0.0664, wKL=1.0000]
Train E50: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5143, E=0.8225,
KL=0.0664, wKL=1.0000]
Train E50: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.3579, E=0.8151,
KL=0.0641, wKL=1.0000]
Train E50: 88%|████████▊ | 22/25 [00:30<00:04, 1.38s/batch, N=1.3579, E=0.8151,
KL=0.0641, wKL=1.0000]
Train E50: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.4420, E=0.8223,
KL=0.0641, wKL=1.0000]
Train E50: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.4420, E=0.8223,
KL=0.0641, wKL=1.0000]
Train E50: 92%|█████████▏| 23/25 [00:33<00:02, 1.38s/batch, N=1.5206, E=0.8216,
KL=0.0642, wKL=1.0000]
Train E50: 96%|█████████▌| 24/25 [00:33<00:01, 1.39s/batch, N=1.5206, E=0.8216,
KL=0.0642, wKL=1.0000]
Train E50: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5017, E=0.8153,
KL=0.0661, wKL=1.0000]
Train E50: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5017, E=0.8153,
KL=0.0661, wKL=1.0000]
Train E50: 100%|██████████| 25/25 [00:34<00:00, 1.36s/batch, N=1.5017, E=0.8153,
KL=0.0661, wKL=1.0000]
1801.1s 175 [Epoch 050] Total: 2.3384 | N: 1.4835 | E: 0.8219 | KL(1.00×0.5):
0.0659
1801.1s 176 Saved checkpoint: /kaggle/working/checkpoints/gvae_50_epoch050.pt
1835.6s 177 Train E51: 0%| | 0/25 [00:00<?, ?batch/s]
Train E51: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4596, E=0.8239, KL=0.0659,
wKL=1.0000]
Train E51: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4596, E=0.8239,
KL=0.0659, wKL=1.0000]
Train E51: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5342, E=0.8252,
KL=0.0650, wKL=1.0000]
Train E51: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.5342, E=0.8252,
KL=0.0650, wKL=1.0000]
Train E51: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.4610, E=0.8193,
KL=0.0647, wKL=1.0000]
Train E51: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4610, E=0.8193,
KL=0.0647, wKL=1.0000]
Train E51: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4688, E=0.8237,
KL=0.0647, wKL=1.0000]
Train E51: 16%|█▌ | 4/25 [00:05<00:28, 1.34s/batch, N=1.4688, E=0.8237,
KL=0.0647, wKL=1.0000]
Train E51: 16%|█▌ | 4/25 [00:07<00:28, 1.34s/batch, N=1.5779, E=0.8233,
KL=0.0662, wKL=1.0000]
Train E51: 20%|██ | 5/25 [00:07<00:31, 1.57s/batch, N=1.5779, E=0.8233,
KL=0.0662, wKL=1.0000]
Train E51: 20%|██ | 5/25 [00:08<00:31, 1.57s/batch, N=1.4651, E=0.8290,
KL=0.0662, wKL=1.0000]
Train E51: 24%|██▍ | 6/25 [00:08<00:28, 1.50s/batch, N=1.4651, E=0.8290,
KL=0.0662, wKL=1.0000]
Train E51: 24%|██▍ | 6/25 [00:10<00:28, 1.50s/batch, N=1.4969, E=0.8176,
KL=0.0658, wKL=1.0000]
Train E51: 28%|██▊ | 7/25 [00:10<00:27, 1.53s/batch, N=1.4969, E=0.8176,
KL=0.0658, wKL=1.0000]
Train E51: 28%|██▊ | 7/25 [00:11<00:27, 1.53s/batch, N=1.4891, E=0.8194,
KL=0.0653, wKL=1.0000]
Train E51: 32%|███▏ | 8/25 [00:11<00:25, 1.49s/batch, N=1.4891, E=0.8194,
KL=0.0653, wKL=1.0000]
Train E51: 32%|███▏ | 8/25 [00:13<00:25, 1.49s/batch, N=1.4327, E=0.8201,
KL=0.0661, wKL=1.0000]
Train E51: 36%|███▌ | 9/25 [00:13<00:23, 1.45s/batch, N=1.4327, E=0.8201,
KL=0.0661, wKL=1.0000]
Train E51: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.5221, E=0.8225,
KL=0.0653, wKL=1.0000]
Train E51: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.5221, E=0.8225,
KL=0.0653, wKL=1.0000]
Train E51: 40%|████ | 10/25 [00:15<00:21, 1.44s/batch, N=1.4690, E=0.8167,
KL=0.0650, wKL=1.0000]
Train E51: 44%|████▍ | 11/25 [00:15<00:19, 1.43s/batch, N=1.4690, E=0.8167,
KL=0.0650, wKL=1.0000]
Train E51: 44%|████▍ | 11/25 [00:17<00:19, 1.43s/batch, N=1.4986, E=0.8261,
KL=0.0640, wKL=1.0000]
Train E51: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4986, E=0.8261,
KL=0.0640, wKL=1.0000]
Train E51: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5334, E=0.8180,
KL=0.0643, wKL=1.0000]
Train E51: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5334, E=0.8180,
KL=0.0643, wKL=1.0000]
Train E51: 52%|█████▏ | 13/25 [00:20<00:16, 1.40s/batch, N=1.4541, E=0.8207,
KL=0.0642, wKL=1.0000]
Train E51: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4541, E=0.8207,
KL=0.0642, wKL=1.0000]
Train E51: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4852, E=0.8202,
KL=0.0640, wKL=1.0000]
Train E51: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4852, E=0.8202,
KL=0.0640, wKL=1.0000]
Train E51: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4519, E=0.8181,
KL=0.0644, wKL=1.0000]
Train E51: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4519, E=0.8181,
KL=0.0644, wKL=1.0000]
Train E51: 64%|██████▍ | 16/25 [00:24<00:12, 1.39s/batch, N=1.4425, E=0.8193,
KL=0.0641, wKL=1.0000]
Train E51: 68%|██████▊ | 17/25 [00:24<00:11, 1.38s/batch, N=1.4425, E=0.8193,
KL=0.0641, wKL=1.0000]
Train E51: 68%|██████▊ | 17/25 [00:25<00:11, 1.38s/batch, N=1.5141, E=0.8253,
KL=0.0633, wKL=1.0000]
Train E51: 72%|███████▏ | 18/25 [00:25<00:09, 1.38s/batch, N=1.5141, E=0.8253,
KL=0.0633, wKL=1.0000]
Train E51: 72%|███████▏ | 18/25 [00:26<00:09, 1.38s/batch, N=1.5034, E=0.8206,
KL=0.0640, wKL=1.0000]
Train E51: 76%|███████▌ | 19/25 [00:26<00:08, 1.38s/batch, N=1.5034, E=0.8206,
KL=0.0640, wKL=1.0000]
Train E51: 76%|███████▌ | 19/25 [00:28<00:08, 1.38s/batch, N=1.5178, E=0.8169,
KL=0.0640, wKL=1.0000]
Train E51: 80%|████████ | 20/25 [00:28<00:06, 1.37s/batch, N=1.5178, E=0.8169,
KL=0.0640, wKL=1.0000]
Train E51: 80%|████████ | 20/25 [00:29<00:06, 1.37s/batch, N=1.4794, E=0.8214,
KL=0.0637, wKL=1.0000]
Train E51: 84%|████████▍ | 21/25 [00:29<00:05, 1.38s/batch, N=1.4794, E=0.8214,
KL=0.0637, wKL=1.0000]
Train E51: 84%|████████▍ | 21/25 [00:31<00:05, 1.38s/batch, N=1.4109, E=0.8204,
KL=0.0632, wKL=1.0000]
Train E51: 88%|████████▊ | 22/25 [00:31<00:04, 1.38s/batch, N=1.4109, E=0.8204,
KL=0.0632, wKL=1.0000]
Train E51: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.3979, E=0.8160,
KL=0.0630, wKL=1.0000]
Train E51: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.3979, E=0.8160,
KL=0.0630, wKL=1.0000]
Train E51: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.5633, E=0.8216,
KL=0.0644, wKL=1.0000]
Train E51: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.5633, E=0.8216,
KL=0.0644, wKL=1.0000]
Train E51: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4130, E=0.8244,
KL=0.0629, wKL=1.0000]
Train E51: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.4130, E=0.8244,
KL=0.0629, wKL=1.0000]
Train E51: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4130, E=0.8244,
KL=0.0629, wKL=1.0000]
1835.6s 178 [Epoch 051] Total: 2.3367 | N: 1.4833 | E: 0.8211 | KL(1.00×0.5):
0.0646
1870.5s 179 Train E52: 0%| | 0/25 [00:00<?, ?batch/s]
Train E52: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4659, E=0.8226, KL=0.0637,
wKL=1.0000]
Train E52: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.4659, E=0.8226,
KL=0.0637, wKL=1.0000]
Train E52: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.5334, E=0.8172,
KL=0.0635, wKL=1.0000]
Train E52: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5334, E=0.8172,
KL=0.0635, wKL=1.0000]
Train E52: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.4983, E=0.8172,
KL=0.0642, wKL=1.0000]
Train E52: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4983, E=0.8172,
KL=0.0642, wKL=1.0000]
Train E52: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4641, E=0.8219,
KL=0.0634, wKL=1.0000]
Train E52: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4641, E=0.8219,
KL=0.0634, wKL=1.0000]
Train E52: 16%|█▌ | 4/25 [00:07<00:29, 1.39s/batch, N=1.4626, E=0.8222,
KL=0.0639, wKL=1.0000]
Train E52: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.4626, E=0.8222,
KL=0.0639, wKL=1.0000]
Train E52: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.4692, E=0.8196,
KL=0.0632, wKL=1.0000]
Train E52: 24%|██▍ | 6/25 [00:08<00:27, 1.45s/batch, N=1.4692, E=0.8196,
KL=0.0632, wKL=1.0000]
Train E52: 24%|██▍ | 6/25 [00:10<00:27, 1.45s/batch, N=1.5219, E=0.8206,
KL=0.0635, wKL=1.0000]
Train E52: 28%|██▊ | 7/25 [00:10<00:29, 1.64s/batch, N=1.5219, E=0.8206,
KL=0.0635, wKL=1.0000]
Train E52: 28%|██▊ | 7/25 [00:11<00:29, 1.64s/batch, N=1.4448, E=0.8178,
KL=0.0643, wKL=1.0000]
Train E52: 32%|███▏ | 8/25 [00:11<00:26, 1.57s/batch, N=1.4448, E=0.8178,
KL=0.0643, wKL=1.0000]
Train E52: 32%|███▏ | 8/25 [00:13<00:26, 1.57s/batch, N=1.4479, E=0.8258,
KL=0.0625, wKL=1.0000]
Train E52: 36%|███▌ | 9/25 [00:13<00:24, 1.50s/batch, N=1.4479, E=0.8258,
KL=0.0625, wKL=1.0000]
Train E52: 36%|███▌ | 9/25 [00:14<00:24, 1.50s/batch, N=1.5461, E=0.8251,
KL=0.0629, wKL=1.0000]
Train E52: 40%|████ | 10/25 [00:14<00:22, 1.47s/batch, N=1.5461, E=0.8251,
KL=0.0629, wKL=1.0000]
Train E52: 40%|████ | 10/25 [00:16<00:22, 1.47s/batch, N=1.5123, E=0.8203,
KL=0.0633, wKL=1.0000]
Train E52: 44%|████▍ | 11/25 [00:16<00:20, 1.47s/batch, N=1.5123, E=0.8203,
KL=0.0633, wKL=1.0000]
Train E52: 44%|████▍ | 11/25 [00:17<00:20, 1.47s/batch, N=1.4959, E=0.8209,
KL=0.0630, wKL=1.0000]
Train E52: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.4959, E=0.8209,
KL=0.0630, wKL=1.0000]
Train E52: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.4674, E=0.8174,
KL=0.0633, wKL=1.0000]
Train E52: 52%|█████▏ | 13/25 [00:18<00:17, 1.43s/batch, N=1.4674, E=0.8174,
KL=0.0633, wKL=1.0000]
Train E52: 52%|█████▏ | 13/25 [00:20<00:17, 1.43s/batch, N=1.4535, E=0.8191,
KL=0.0638, wKL=1.0000]
Train E52: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.4535, E=0.8191,
KL=0.0638, wKL=1.0000]
Train E52: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.4727, E=0.8206,
KL=0.0647, wKL=1.0000]
Train E52: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4727, E=0.8206,
KL=0.0647, wKL=1.0000]
Train E52: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4882, E=0.8141,
KL=0.0642, wKL=1.0000]
Train E52: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4882, E=0.8141,
KL=0.0642, wKL=1.0000]
Train E52: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4582, E=0.8223,
KL=0.0633, wKL=1.0000]
Train E52: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4582, E=0.8223,
KL=0.0633, wKL=1.0000]
Train E52: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4778, E=0.8210,
KL=0.0637, wKL=1.0000]
Train E52: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4778, E=0.8210,
KL=0.0637, wKL=1.0000]
Train E52: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.5649, E=0.8214,
KL=0.0632, wKL=1.0000]
Train E52: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5649, E=0.8214,
KL=0.0632, wKL=1.0000]
Train E52: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5156, E=0.8246,
KL=0.0638, wKL=1.0000]
Train E52: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5156, E=0.8246,
KL=0.0638, wKL=1.0000]
Train E52: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.4987, E=0.8233,
KL=0.0633, wKL=1.0000]
Train E52: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.4987, E=0.8233,
KL=0.0633, wKL=1.0000]
Train E52: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.4377, E=0.8216,
KL=0.0640, wKL=1.0000]
Train E52: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4377, E=0.8216,
KL=0.0640, wKL=1.0000]
Train E52: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4873, E=0.8256,
KL=0.0634, wKL=1.0000]
Train E52: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.4873, E=0.8256,
KL=0.0634, wKL=1.0000]
Train E52: 92%|█████████▏| 23/25 [00:34<00:02, 1.38s/batch, N=1.3629, E=0.8216,
KL=0.0623, wKL=1.0000]
Train E52: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.3629, E=0.8216,
KL=0.0623, wKL=1.0000]
Train E52: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5957, E=0.8207,
KL=0.0636, wKL=1.0000]
Train E52: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5957, E=0.8207,
KL=0.0636, wKL=1.0000]
Train E52: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5957, E=0.8207,
KL=0.0636, wKL=1.0000]
1870.5s 180 [Epoch 052] Total: 2.3358 | N: 1.4831 | E: 0.8210 | KL(1.00×0.5):
0.0635
1905.6s 181 Train E53: 0%| | 0/25 [00:00<?, ?batch/s]
Train E53: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4278, E=0.8251, KL=0.0628,
wKL=1.0000]
Train E53: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.4278, E=0.8251,
KL=0.0628, wKL=1.0000]
Train E53: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.4902, E=0.8207,
KL=0.0623, wKL=1.0000]
Train E53: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.4902, E=0.8207,
KL=0.0623, wKL=1.0000]
Train E53: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.5219, E=0.8184,
KL=0.0624, wKL=1.0000]
Train E53: 12%|█▏ | 3/25 [00:04<00:32, 1.49s/batch, N=1.5219, E=0.8184,
KL=0.0624, wKL=1.0000]
Train E53: 12%|█▏ | 3/25 [00:05<00:32, 1.49s/batch, N=1.5057, E=0.8212,
KL=0.0616, wKL=1.0000]
Train E53: 16%|█▌ | 4/25 [00:05<00:30, 1.44s/batch, N=1.5057, E=0.8212,
KL=0.0616, wKL=1.0000]
Train E53: 16%|█▌ | 4/25 [00:07<00:30, 1.44s/batch, N=1.4231, E=0.8216,
KL=0.0610, wKL=1.0000]
Train E53: 20%|██ | 5/25 [00:07<00:28, 1.41s/batch, N=1.4231, E=0.8216,
KL=0.0610, wKL=1.0000]
Train E53: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.4501, E=0.8189,
KL=0.0625, wKL=1.0000]
Train E53: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4501, E=0.8189,
KL=0.0625, wKL=1.0000]
Train E53: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.4824, E=0.8223,
KL=0.0606, wKL=1.0000]
Train E53: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4824, E=0.8223,
KL=0.0606, wKL=1.0000]
Train E53: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4502, E=0.8260,
KL=0.0622, wKL=1.0000]
Train E53: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.4502, E=0.8260,
KL=0.0622, wKL=1.0000]
Train E53: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.4644, E=0.8243,
KL=0.0616, wKL=1.0000]
Train E53: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4644, E=0.8243,
KL=0.0616, wKL=1.0000]
Train E53: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.5173, E=0.8211,
KL=0.0618, wKL=1.0000]
Train E53: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.5173, E=0.8211,
KL=0.0618, wKL=1.0000]
Train E53: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4500, E=0.8231,
KL=0.0623, wKL=1.0000]
Train E53: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4500, E=0.8231,
KL=0.0623, wKL=1.0000]
Train E53: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.5075, E=0.8144,
KL=0.0623, wKL=1.0000]
Train E53: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5075, E=0.8144,
KL=0.0623, wKL=1.0000]
Train E53: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4229, E=0.8179,
KL=0.0625, wKL=1.0000]
Train E53: 52%|█████▏ | 13/25 [00:18<00:19, 1.59s/batch, N=1.4229, E=0.8179,
KL=0.0625, wKL=1.0000]
Train E53: 52%|█████▏ | 13/25 [00:20<00:19, 1.59s/batch, N=1.5100, E=0.8202,
KL=0.0621, wKL=1.0000]
Train E53: 56%|█████▌ | 14/25 [00:20<00:16, 1.54s/batch, N=1.5100, E=0.8202,
KL=0.0621, wKL=1.0000]
Train E53: 56%|█████▌ | 14/25 [00:21<00:16, 1.54s/batch, N=1.5266, E=0.8181,
KL=0.0621, wKL=1.0000]
Train E53: 60%|██████ | 15/25 [00:21<00:14, 1.49s/batch, N=1.5266, E=0.8181,
KL=0.0621, wKL=1.0000]
Train E53: 60%|██████ | 15/25 [00:23<00:14, 1.49s/batch, N=1.4419, E=0.8183,
KL=0.0629, wKL=1.0000]
Train E53: 64%|██████▍ | 16/25 [00:23<00:13, 1.47s/batch, N=1.4419, E=0.8183,
KL=0.0629, wKL=1.0000]
Train E53: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5207, E=0.8210,
KL=0.0617, wKL=1.0000]
Train E53: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5207, E=0.8210,
KL=0.0617, wKL=1.0000]
Train E53: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4742, E=0.8171,
KL=0.0628, wKL=1.0000]
Train E53: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.4742, E=0.8171,
KL=0.0628, wKL=1.0000]
Train E53: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.4699, E=0.8203,
KL=0.0625, wKL=1.0000]
Train E53: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.4699, E=0.8203,
KL=0.0625, wKL=1.0000]
Train E53: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5714, E=0.8255,
KL=0.0630, wKL=1.0000]
Train E53: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5714, E=0.8255,
KL=0.0630, wKL=1.0000]
Train E53: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.4449, E=0.8193,
KL=0.0631, wKL=1.0000]
Train E53: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4449, E=0.8193,
KL=0.0631, wKL=1.0000]
Train E53: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.4569, E=0.8236,
KL=0.0623, wKL=1.0000]
Train E53: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4569, E=0.8236,
KL=0.0623, wKL=1.0000]
Train E53: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4723, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E53: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4723, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E53: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5997, E=0.8220,
KL=0.0630, wKL=1.0000]
Train E53: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5997, E=0.8220,
KL=0.0630, wKL=1.0000]
Train E53: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.4308, E=0.8287,
KL=0.0619, wKL=1.0000]
Train E53: 100%|██████████| 25/25 [00:35<00:00, 1.20s/batch, N=1.4308, E=0.8287,
KL=0.0619, wKL=1.0000]
Train E53: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4308, E=0.8287,
KL=0.0619, wKL=1.0000]
1905.6s 182 [Epoch 053] Total: 2.3348 | N: 1.4825 | E: 0.8211 | KL(1.00×0.5):
0.0623
1940.8s 183 Train E54: 0%| | 0/25 [00:00<?, ?batch/s]
Train E54: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4421, E=0.8257, KL=0.0602,
wKL=1.0000]
Train E54: 4%|▍ | 1/25 [00:01<00:35, 1.47s/batch, N=1.4421, E=0.8257,
KL=0.0602, wKL=1.0000]
Train E54: 4%|▍ | 1/25 [00:02<00:35, 1.47s/batch, N=1.3872, E=0.8171,
KL=0.0609, wKL=1.0000]
Train E54: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.3872, E=0.8171,
KL=0.0609, wKL=1.0000]
Train E54: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.5473, E=0.8245,
KL=0.0618, wKL=1.0000]
Train E54: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5473, E=0.8245,
KL=0.0618, wKL=1.0000]
Train E54: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.5784, E=0.8198,
KL=0.0613, wKL=1.0000]
Train E54: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.5784, E=0.8198,
KL=0.0613, wKL=1.0000]
Train E54: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4545, E=0.8201,
KL=0.0611, wKL=1.0000]
Train E54: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4545, E=0.8201,
KL=0.0611, wKL=1.0000]
Train E54: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5236, E=0.8230,
KL=0.0627, wKL=1.0000]
Train E54: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.5236, E=0.8230,
KL=0.0627, wKL=1.0000]
Train E54: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.4935, E=0.8242,
KL=0.0612, wKL=1.0000]
Train E54: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4935, E=0.8242,
KL=0.0612, wKL=1.0000]
Train E54: 28%|██▊ | 7/25 [00:11<00:24, 1.37s/batch, N=1.4967, E=0.8218,
KL=0.0632, wKL=1.0000]
Train E54: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4967, E=0.8218,
KL=0.0632, wKL=1.0000]
Train E54: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4116, E=0.8214,
KL=0.0614, wKL=1.0000]
Train E54: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4116, E=0.8214,
KL=0.0614, wKL=1.0000]
Train E54: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4566, E=0.8238,
KL=0.0613, wKL=1.0000]
Train E54: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4566, E=0.8238,
KL=0.0613, wKL=1.0000]
Train E54: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4585, E=0.8250,
KL=0.0618, wKL=1.0000]
Train E54: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4585, E=0.8250,
KL=0.0618, wKL=1.0000]
Train E54: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4688, E=0.8234,
KL=0.0623, wKL=1.0000]
Train E54: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.4688, E=0.8234,
KL=0.0623, wKL=1.0000]
Train E54: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.4281, E=0.8225,
KL=0.0605, wKL=1.0000]
Train E54: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4281, E=0.8225,
KL=0.0605, wKL=1.0000]
Train E54: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4751, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E54: 56%|█████▌ | 14/25 [00:19<00:15, 1.44s/batch, N=1.4751, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E54: 56%|█████▌ | 14/25 [00:21<00:15, 1.44s/batch, N=1.4879, E=0.8194,
KL=0.0605, wKL=1.0000]
Train E54: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.4879, E=0.8194,
KL=0.0605, wKL=1.0000]
Train E54: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.5082, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 64%|██████▍ | 16/25 [00:23<00:14, 1.60s/batch, N=1.5082, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 64%|██████▍ | 16/25 [00:24<00:14, 1.60s/batch, N=1.4793, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 68%|██████▊ | 17/25 [00:24<00:12, 1.54s/batch, N=1.4793, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 68%|██████▊ | 17/25 [00:25<00:12, 1.54s/batch, N=1.5570, E=0.8159,
KL=0.0620, wKL=1.0000]
Train E54: 72%|███████▏ | 18/25 [00:25<00:10, 1.52s/batch, N=1.5570, E=0.8159,
KL=0.0620, wKL=1.0000]
Train E54: 72%|███████▏ | 18/25 [00:27<00:10, 1.52s/batch, N=1.4193, E=0.8215,
KL=0.0597, wKL=1.0000]
Train E54: 76%|███████▌ | 19/25 [00:27<00:08, 1.49s/batch, N=1.4193, E=0.8215,
KL=0.0597, wKL=1.0000]
Train E54: 76%|███████▌ | 19/25 [00:28<00:08, 1.49s/batch, N=1.4836, E=0.8162,
KL=0.0612, wKL=1.0000]
Train E54: 80%|████████ | 20/25 [00:28<00:07, 1.46s/batch, N=1.4836, E=0.8162,
KL=0.0612, wKL=1.0000]
Train E54: 80%|████████ | 20/25 [00:30<00:07, 1.46s/batch, N=1.5155, E=0.8202,
KL=0.0619, wKL=1.0000]
Train E54: 84%|████████▍ | 21/25 [00:30<00:05, 1.45s/batch, N=1.5155, E=0.8202,
KL=0.0619, wKL=1.0000]
Train E54: 84%|████████▍ | 21/25 [00:31<00:05, 1.45s/batch, N=1.5202, E=0.8242,
KL=0.0614, wKL=1.0000]
Train E54: 88%|████████▊ | 22/25 [00:31<00:04, 1.48s/batch, N=1.5202, E=0.8242,
KL=0.0614, wKL=1.0000]
Train E54: 88%|████████▊ | 22/25 [00:33<00:04, 1.48s/batch, N=1.4819, E=0.8165,
KL=0.0624, wKL=1.0000]
Train E54: 92%|█████████▏| 23/25 [00:33<00:03, 1.50s/batch, N=1.4819, E=0.8165,
KL=0.0624, wKL=1.0000]
Train E54: 92%|█████████▏| 23/25 [00:34<00:03, 1.50s/batch, N=1.5146, E=0.8249,
KL=0.0610, wKL=1.0000]
Train E54: 96%|█████████▌| 24/25 [00:34<00:01, 1.46s/batch, N=1.5146, E=0.8249,
KL=0.0610, wKL=1.0000]
Train E54: 96%|█████████▌| 24/25 [00:35<00:01, 1.46s/batch, N=1.4522, E=0.8171,
KL=0.0613, wKL=1.0000]
Train E54: 100%|██████████| 25/25 [00:35<00:00, 1.22s/batch, N=1.4522, E=0.8171,
KL=0.0613, wKL=1.0000]
Train E54: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4522, E=0.8171,
KL=0.0613, wKL=1.0000]
1940.9s 184 [Epoch 054] Total: 2.3345 | N: 1.4824 | E: 0.8214 | KL(1.00×0.5):
0.0614
1975.7s 185 Train E55: 0%| | 0/25 [00:00<?, ?batch/s]
Train E55: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4193, E=0.8191, KL=0.0614,
wKL=1.0000]
Train E55: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4193, E=0.8191,
KL=0.0614, wKL=1.0000]
Train E55: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4716, E=0.8212,
KL=0.0607, wKL=1.0000]
Train E55: 8%|▊ | 2/25 [00:02<00:30, 1.35s/batch, N=1.4716, E=0.8212,
KL=0.0607, wKL=1.0000]
Train E55: 8%|▊ | 2/25 [00:04<00:30, 1.35s/batch, N=1.5290, E=0.8176,
KL=0.0618, wKL=1.0000]
Train E55: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5290, E=0.8176,
KL=0.0618, wKL=1.0000]
Train E55: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4785, E=0.8136,
KL=0.0613, wKL=1.0000]
Train E55: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.4785, E=0.8136,
KL=0.0613, wKL=1.0000]
Train E55: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.4825, E=0.8174,
KL=0.0608, wKL=1.0000]
Train E55: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4825, E=0.8174,
KL=0.0608, wKL=1.0000]
Train E55: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.3969, E=0.8212,
KL=0.0596, wKL=1.0000]
Train E55: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.3969, E=0.8212,
KL=0.0596, wKL=1.0000]
Train E55: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4467, E=0.8203,
KL=0.0601, wKL=1.0000]
Train E55: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4467, E=0.8203,
KL=0.0601, wKL=1.0000]
Train E55: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5206, E=0.8188,
KL=0.0620, wKL=1.0000]
Train E55: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5206, E=0.8188,
KL=0.0620, wKL=1.0000]
Train E55: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5845, E=0.8203,
KL=0.0606, wKL=1.0000]
Train E55: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5845, E=0.8203,
KL=0.0606, wKL=1.0000]
Train E55: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4354, E=0.8240,
KL=0.0603, wKL=1.0000]
Train E55: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4354, E=0.8240,
KL=0.0603, wKL=1.0000]
Train E55: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4740, E=0.8239,
KL=0.0610, wKL=1.0000]
Train E55: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4740, E=0.8239,
KL=0.0610, wKL=1.0000]
Train E55: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.0597, wKL=1.0000]
Train E55: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.0597, wKL=1.0000]
Train E55: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.5621, E=0.8259,
KL=0.0602, wKL=1.0000]
Train E55: 52%|█████▏ | 13/25 [00:17<00:16, 1.38s/batch, N=1.5621, E=0.8259,
KL=0.0602, wKL=1.0000]
Train E55: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.4759, E=0.8193,
KL=0.0604, wKL=1.0000]
Train E55: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4759, E=0.8193,
KL=0.0604, wKL=1.0000]
Train E55: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.4622, E=0.8227,
KL=0.0605, wKL=1.0000]
Train E55: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.4622, E=0.8227,
KL=0.0605, wKL=1.0000]
Train E55: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.4558, E=0.8225,
KL=0.0592, wKL=1.0000]
Train E55: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4558, E=0.8225,
KL=0.0592, wKL=1.0000]
Train E55: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4286, E=0.8178,
KL=0.0605, wKL=1.0000]
Train E55: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4286, E=0.8178,
KL=0.0605, wKL=1.0000]
Train E55: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5827, E=0.8210,
KL=0.0612, wKL=1.0000]
Train E55: 72%|███████▏ | 18/25 [00:24<00:09, 1.39s/batch, N=1.5827, E=0.8210,
KL=0.0612, wKL=1.0000]
Train E55: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4752, E=0.8240,
KL=0.0606, wKL=1.0000]
Train E55: 76%|███████▌ | 19/25 [00:26<00:08, 1.38s/batch, N=1.4752, E=0.8240,
KL=0.0606, wKL=1.0000]
Train E55: 76%|███████▌ | 19/25 [00:28<00:08, 1.38s/batch, N=1.5546, E=0.8258,
KL=0.0615, wKL=1.0000]
Train E55: 80%|████████ | 20/25 [00:28<00:08, 1.61s/batch, N=1.5546, E=0.8258,
KL=0.0615, wKL=1.0000]
Train E55: 80%|████████ | 20/25 [00:29<00:08, 1.61s/batch, N=1.4775, E=0.8217,
KL=0.0603, wKL=1.0000]
Train E55: 84%|████████▍ | 21/25 [00:29<00:06, 1.58s/batch, N=1.4775, E=0.8217,
KL=0.0603, wKL=1.0000]
Train E55: 84%|████████▍ | 21/25 [00:31<00:06, 1.58s/batch, N=1.3895, E=0.8185,
KL=0.0607, wKL=1.0000]
Train E55: 88%|████████▊ | 22/25 [00:31<00:04, 1.51s/batch, N=1.3895, E=0.8185,
KL=0.0607, wKL=1.0000]
Train E55: 88%|████████▊ | 22/25 [00:32<00:04, 1.51s/batch, N=1.5480, E=0.8249,
KL=0.0624, wKL=1.0000]
Train E55: 92%|█████████▏| 23/25 [00:32<00:02, 1.48s/batch, N=1.5480, E=0.8249,
KL=0.0624, wKL=1.0000]
Train E55: 92%|█████████▏| 23/25 [00:34<00:02, 1.48s/batch, N=1.4393, E=0.8189,
KL=0.0590, wKL=1.0000]
Train E55: 96%|█████████▌| 24/25 [00:34<00:01, 1.47s/batch, N=1.4393, E=0.8189,
KL=0.0590, wKL=1.0000]
Train E55: 96%|█████████▌| 24/25 [00:34<00:01, 1.47s/batch, N=1.4660, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E55: 100%|██████████| 25/25 [00:34<00:00, 1.22s/batch, N=1.4660, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E55: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4660, E=0.8250,
KL=0.0587, wKL=1.0000]
1975.7s 186 [Epoch 055] Total: 2.3340 | N: 1.4827 | E: 0.8211 | KL(1.00×0.5):
0.0606
2009.9s 187 Train E56: 0%| | 0/25 [00:00<?, ?batch/s]
Train E56: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4897, E=0.8172, KL=0.0608,
wKL=1.0000]
Train E56: 4%|▍ | 1/25 [00:01<00:33, 1.39s/batch, N=1.4897, E=0.8172,
KL=0.0608, wKL=1.0000]
Train E56: 4%|▍ | 1/25 [00:02<00:33, 1.39s/batch, N=1.5538, E=0.8217,
KL=0.0593, wKL=1.0000]
Train E56: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.5538, E=0.8217,
KL=0.0593, wKL=1.0000]
Train E56: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4686, E=0.8256,
KL=0.0599, wKL=1.0000]
Train E56: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4686, E=0.8256,
KL=0.0599, wKL=1.0000]
Train E56: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.5151, E=0.8162,
KL=0.0611, wKL=1.0000]
Train E56: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5151, E=0.8162,
KL=0.0611, wKL=1.0000]
Train E56: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4412, E=0.8174,
KL=0.0599, wKL=1.0000]
Train E56: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4412, E=0.8174,
KL=0.0599, wKL=1.0000]
Train E56: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4282, E=0.8252,
KL=0.0594, wKL=1.0000]
Train E56: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.4282, E=0.8252,
KL=0.0594, wKL=1.0000]
Train E56: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4506, E=0.8197,
KL=0.0601, wKL=1.0000]
Train E56: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4506, E=0.8197,
KL=0.0601, wKL=1.0000]
Train E56: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4377, E=0.8204,
KL=0.0600, wKL=1.0000]
Train E56: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4377, E=0.8204,
KL=0.0600, wKL=1.0000]
Train E56: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4778, E=0.8185,
KL=0.0595, wKL=1.0000]
Train E56: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4778, E=0.8185,
KL=0.0595, wKL=1.0000]
Train E56: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5146, E=0.8198,
KL=0.0592, wKL=1.0000]
Train E56: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5146, E=0.8198,
KL=0.0592, wKL=1.0000]
Train E56: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.6206, E=0.8271,
KL=0.0609, wKL=1.0000]
Train E56: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.6206, E=0.8271,
KL=0.0609, wKL=1.0000]
Train E56: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4893, E=0.8230,
KL=0.0592, wKL=1.0000]
Train E56: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4893, E=0.8230,
KL=0.0592, wKL=1.0000]
Train E56: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4349, E=0.8215,
KL=0.0590, wKL=1.0000]
Train E56: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4349, E=0.8215,
KL=0.0590, wKL=1.0000]
Train E56: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4728, E=0.8194,
KL=0.0601, wKL=1.0000]
Train E56: 56%|█████▌ | 14/25 [00:19<00:15, 1.37s/batch, N=1.4728, E=0.8194,
KL=0.0601, wKL=1.0000]
Train E56: 56%|█████▌ | 14/25 [00:20<00:15, 1.37s/batch, N=1.4217, E=0.8214,
KL=0.0590, wKL=1.0000]
Train E56: 60%|██████ | 15/25 [00:20<00:13, 1.38s/batch, N=1.4217, E=0.8214,
KL=0.0590, wKL=1.0000]
Train E56: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4738, E=0.8265,
KL=0.0592, wKL=1.0000]
Train E56: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4738, E=0.8265,
KL=0.0592, wKL=1.0000]
Train E56: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4171, E=0.8223,
KL=0.0588, wKL=1.0000]
Train E56: 68%|██████▊ | 17/25 [00:23<00:11, 1.38s/batch, N=1.4171, E=0.8223,
KL=0.0588, wKL=1.0000]
Train E56: 68%|██████▊ | 17/25 [00:25<00:11, 1.38s/batch, N=1.4776, E=0.8200,
KL=0.0592, wKL=1.0000]
Train E56: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.4776, E=0.8200,
KL=0.0592, wKL=1.0000]
Train E56: 72%|███████▏ | 18/25 [00:26<00:10, 1.44s/batch, N=1.5783, E=0.8205,
KL=0.0593, wKL=1.0000]
Train E56: 76%|███████▌ | 19/25 [00:26<00:08, 1.44s/batch, N=1.5783, E=0.8205,
KL=0.0593, wKL=1.0000]
Train E56: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.5008, E=0.8202,
KL=0.0601, wKL=1.0000]
Train E56: 80%|████████ | 20/25 [00:27<00:07, 1.43s/batch, N=1.5008, E=0.8202,
KL=0.0601, wKL=1.0000]
Train E56: 80%|████████ | 20/25 [00:29<00:07, 1.43s/batch, N=1.5469, E=0.8231,
KL=0.0598, wKL=1.0000]
Train E56: 84%|████████▍ | 21/25 [00:29<00:05, 1.43s/batch, N=1.5469, E=0.8231,
KL=0.0598, wKL=1.0000]
Train E56: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.3813, E=0.8227,
KL=0.0578, wKL=1.0000]
Train E56: 88%|████████▊ | 22/25 [00:30<00:04, 1.42s/batch, N=1.3813, E=0.8227,
KL=0.0578, wKL=1.0000]
Train E56: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.4835, E=0.8200,
KL=0.0595, wKL=1.0000]
Train E56: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4835, E=0.8200,
KL=0.0595, wKL=1.0000]
Train E56: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4326, E=0.8203,
KL=0.0588, wKL=1.0000]
Train E56: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.4326, E=0.8203,
KL=0.0588, wKL=1.0000]
Train E56: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.6360, E=0.8177,
KL=0.0595, wKL=1.0000]
Train E56: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.6360, E=0.8177,
KL=0.0595, wKL=1.0000]
Train E56: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.6360, E=0.8177,
KL=0.0595, wKL=1.0000]
2009.9s 188 [Epoch 056] Total: 2.3331 | N: 1.4822 | E: 0.8212 | KL(1.00×0.5):
0.0596
2044.8s 189 Train E57: 0%| | 0/25 [00:00<?, ?batch/s]
Train E57: 0%| | 0/25 [00:02<?, ?batch/s, N=1.4930, E=0.8226, KL=0.0590,
wKL=1.0000]
Train E57: 4%|▍ | 1/25 [00:02<00:48, 2.02s/batch, N=1.4930, E=0.8226,
KL=0.0590, wKL=1.0000]
Train E57: 4%|▍ | 1/25 [00:03<00:48, 2.02s/batch, N=1.4115, E=0.8207,
KL=0.0583, wKL=1.0000]
Train E57: 8%|▊ | 2/25 [00:03<00:38, 1.66s/batch, N=1.4115, E=0.8207,
KL=0.0583, wKL=1.0000]
Train E57: 8%|▊ | 2/25 [00:04<00:38, 1.66s/batch, N=1.4536, E=0.8215,
KL=0.0598, wKL=1.0000]
Train E57: 12%|█▏ | 3/25 [00:04<00:33, 1.52s/batch, N=1.4536, E=0.8215,
KL=0.0598, wKL=1.0000]
Train E57: 12%|█▏ | 3/25 [00:06<00:33, 1.52s/batch, N=1.5131, E=0.8186,
KL=0.0597, wKL=1.0000]
Train E57: 16%|█▌ | 4/25 [00:06<00:30, 1.46s/batch, N=1.5131, E=0.8186,
KL=0.0597, wKL=1.0000]
Train E57: 16%|█▌ | 4/25 [00:07<00:30, 1.46s/batch, N=1.5687, E=0.8208,
KL=0.0592, wKL=1.0000]
Train E57: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.5687, E=0.8208,
KL=0.0592, wKL=1.0000]
Train E57: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.4961, E=0.8180,
KL=0.0589, wKL=1.0000]
Train E57: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.4961, E=0.8180,
KL=0.0589, wKL=1.0000]
Train E57: 24%|██▍ | 6/25 [00:10<00:26, 1.40s/batch, N=1.4652, E=0.8226,
KL=0.0596, wKL=1.0000]
Train E57: 28%|██▊ | 7/25 [00:10<00:25, 1.40s/batch, N=1.4652, E=0.8226,
KL=0.0596, wKL=1.0000]
Train E57: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4354, E=0.8162,
KL=0.0586, wKL=1.0000]
Train E57: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.4354, E=0.8162,
KL=0.0586, wKL=1.0000]
Train E57: 32%|███▏ | 8/25 [00:13<00:24, 1.42s/batch, N=1.4943, E=0.8167,
KL=0.0592, wKL=1.0000]
Train E57: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.4943, E=0.8167,
KL=0.0592, wKL=1.0000]
Train E57: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5461, E=0.8270,
KL=0.0593, wKL=1.0000]
Train E57: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.5461, E=0.8270,
KL=0.0593, wKL=1.0000]
Train E57: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4839, E=0.8195,
KL=0.0601, wKL=1.0000]
Train E57: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4839, E=0.8195,
KL=0.0601, wKL=1.0000]
Train E57: 44%|████▍ | 11/25 [00:17<00:19, 1.39s/batch, N=1.4294, E=0.8251,
KL=0.0602, wKL=1.0000]
Train E57: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4294, E=0.8251,
KL=0.0602, wKL=1.0000]
Train E57: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4766, E=0.8220,
KL=0.0609, wKL=1.0000]
Train E57: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4766, E=0.8220,
KL=0.0609, wKL=1.0000]
Train E57: 52%|█████▏ | 13/25 [00:20<00:16, 1.39s/batch, N=1.5450, E=0.8208,
KL=0.0598, wKL=1.0000]
Train E57: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5450, E=0.8208,
KL=0.0598, wKL=1.0000]
Train E57: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4963, E=0.8199,
KL=0.0607, wKL=1.0000]
Train E57: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4963, E=0.8199,
KL=0.0607, wKL=1.0000]
Train E57: 60%|██████ | 15/25 [00:23<00:14, 1.40s/batch, N=1.5216, E=0.8200,
KL=0.0586, wKL=1.0000]
Train E57: 64%|██████▍ | 16/25 [00:23<00:13, 1.48s/batch, N=1.5216, E=0.8200,
KL=0.0586, wKL=1.0000]
Train E57: 64%|██████▍ | 16/25 [00:24<00:13, 1.48s/batch, N=1.4825, E=0.8172,
KL=0.0588, wKL=1.0000]
Train E57: 68%|██████▊ | 17/25 [00:24<00:11, 1.45s/batch, N=1.4825, E=0.8172,
KL=0.0588, wKL=1.0000]
Train E57: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.5928, E=0.8191,
KL=0.0582, wKL=1.0000]
Train E57: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.5928, E=0.8191,
KL=0.0582, wKL=1.0000]
Train E57: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.4310, E=0.8216,
KL=0.0576, wKL=1.0000]
Train E57: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.4310, E=0.8216,
KL=0.0576, wKL=1.0000]
Train E57: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.4585, E=0.8227,
KL=0.0576, wKL=1.0000]
Train E57: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4585, E=0.8227,
KL=0.0576, wKL=1.0000]
Train E57: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4809, E=0.8252,
KL=0.0576, wKL=1.0000]
Train E57: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4809, E=0.8252,
KL=0.0576, wKL=1.0000]
Train E57: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.3490, E=0.8262,
KL=0.0572, wKL=1.0000]
Train E57: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.3490, E=0.8262,
KL=0.0572, wKL=1.0000]
Train E57: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.5422, E=0.8256,
KL=0.0582, wKL=1.0000]
Train E57: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5422, E=0.8256,
KL=0.0582, wKL=1.0000]
Train E57: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4474, E=0.8198,
KL=0.0576, wKL=1.0000]
Train E57: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4474, E=0.8198,
KL=0.0576, wKL=1.0000]
Train E57: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4220, E=0.8269,
KL=0.0570, wKL=1.0000]
Train E57: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4220, E=0.8269,
KL=0.0570, wKL=1.0000]
Train E57: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4220, E=0.8269,
KL=0.0570, wKL=1.0000]
2044.8s 190 [Epoch 057] Total: 2.3336 | N: 1.4829 | E: 0.8213 | KL(1.00×0.5):
0.0589
2079.9s 191 Train E58: 0%| | 0/25 [00:00<?, ?batch/s]
Train E58: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5280, E=0.8213, KL=0.0587,
wKL=1.0000]
Train E58: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5280, E=0.8213,
KL=0.0587, wKL=1.0000]
Train E58: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4515, E=0.8206,
KL=0.0579, wKL=1.0000]
Train E58: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4515, E=0.8206,
KL=0.0579, wKL=1.0000]
Train E58: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4281, E=0.8224,
KL=0.0595, wKL=1.0000]
Train E58: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4281, E=0.8224,
KL=0.0595, wKL=1.0000]
Train E58: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.3850, E=0.8196,
KL=0.0585, wKL=1.0000]
Train E58: 16%|█▌ | 4/25 [00:05<00:28, 1.35s/batch, N=1.3850, E=0.8196,
KL=0.0585, wKL=1.0000]
Train E58: 16%|█▌ | 4/25 [00:06<00:28, 1.35s/batch, N=1.4562, E=0.8218,
KL=0.0587, wKL=1.0000]
Train E58: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4562, E=0.8218,
KL=0.0587, wKL=1.0000]
Train E58: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5751, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E58: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5751, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E58: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5012, E=0.8229,
KL=0.0596, wKL=1.0000]
Train E58: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5012, E=0.8229,
KL=0.0596, wKL=1.0000]
Train E58: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4887, E=0.8213,
KL=0.0595, wKL=1.0000]
Train E58: 32%|███▏ | 8/25 [00:11<00:27, 1.60s/batch, N=1.4887, E=0.8213,
KL=0.0595, wKL=1.0000]
Train E58: 32%|███▏ | 8/25 [00:13<00:27, 1.60s/batch, N=1.5004, E=0.8208,
KL=0.0599, wKL=1.0000]
Train E58: 36%|███▌ | 9/25 [00:13<00:24, 1.53s/batch, N=1.5004, E=0.8208,
KL=0.0599, wKL=1.0000]
Train E58: 36%|███▌ | 9/25 [00:14<00:24, 1.53s/batch, N=1.5834, E=0.8202,
KL=0.0599, wKL=1.0000]
Train E58: 40%|████ | 10/25 [00:14<00:22, 1.49s/batch, N=1.5834, E=0.8202,
KL=0.0599, wKL=1.0000]
Train E58: 40%|████ | 10/25 [00:15<00:22, 1.49s/batch, N=1.5269, E=0.8251,
KL=0.0597, wKL=1.0000]
Train E58: 44%|████▍ | 11/25 [00:15<00:20, 1.48s/batch, N=1.5269, E=0.8251,
KL=0.0597, wKL=1.0000]
Train E58: 44%|████▍ | 11/25 [00:17<00:20, 1.48s/batch, N=1.5177, E=0.8193,
KL=0.0610, wKL=1.0000]
Train E58: 48%|████▊ | 12/25 [00:17<00:18, 1.46s/batch, N=1.5177, E=0.8193,
KL=0.0610, wKL=1.0000]
Train E58: 48%|████▊ | 12/25 [00:18<00:18, 1.46s/batch, N=1.4307, E=0.8164,
KL=0.0589, wKL=1.0000]
Train E58: 52%|█████▏ | 13/25 [00:18<00:17, 1.43s/batch, N=1.4307, E=0.8164,
KL=0.0589, wKL=1.0000]
Train E58: 52%|█████▏ | 13/25 [00:20<00:17, 1.43s/batch, N=1.4053, E=0.8197,
KL=0.0581, wKL=1.0000]
Train E58: 56%|█████▌ | 14/25 [00:20<00:16, 1.51s/batch, N=1.4053, E=0.8197,
KL=0.0581, wKL=1.0000]
Train E58: 56%|█████▌ | 14/25 [00:21<00:16, 1.51s/batch, N=1.5196, E=0.8211,
KL=0.0577, wKL=1.0000]
Train E58: 60%|██████ | 15/25 [00:21<00:14, 1.48s/batch, N=1.5196, E=0.8211,
KL=0.0577, wKL=1.0000]
Train E58: 60%|██████ | 15/25 [00:23<00:14, 1.48s/batch, N=1.4431, E=0.8264,
KL=0.0567, wKL=1.0000]
Train E58: 64%|██████▍ | 16/25 [00:23<00:13, 1.45s/batch, N=1.4431, E=0.8264,
KL=0.0567, wKL=1.0000]
Train E58: 64%|██████▍ | 16/25 [00:24<00:13, 1.45s/batch, N=1.5717, E=0.8247,
KL=0.0570, wKL=1.0000]
Train E58: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5717, E=0.8247,
KL=0.0570, wKL=1.0000]
Train E58: 68%|██████▊ | 17/25 [00:26<00:11, 1.44s/batch, N=1.4408, E=0.8238,
KL=0.0566, wKL=1.0000]
Train E58: 72%|███████▏ | 18/25 [00:26<00:10, 1.46s/batch, N=1.4408, E=0.8238,
KL=0.0566, wKL=1.0000]
Train E58: 72%|███████▏ | 18/25 [00:27<00:10, 1.46s/batch, N=1.4716, E=0.8234,
KL=0.0567, wKL=1.0000]
Train E58: 76%|███████▌ | 19/25 [00:27<00:08, 1.43s/batch, N=1.4716, E=0.8234,
KL=0.0567, wKL=1.0000]
Train E58: 76%|███████▌ | 19/25 [00:28<00:08, 1.43s/batch, N=1.4371, E=0.8178,
KL=0.0573, wKL=1.0000]
Train E58: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4371, E=0.8178,
KL=0.0573, wKL=1.0000]
Train E58: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4514, E=0.8161,
KL=0.0574, wKL=1.0000]
Train E58: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4514, E=0.8161,
KL=0.0574, wKL=1.0000]
Train E58: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5225, E=0.8175,
KL=0.0582, wKL=1.0000]
Train E58: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5225, E=0.8175,
KL=0.0582, wKL=1.0000]
Train E58: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.4665, E=0.8210,
KL=0.0580, wKL=1.0000]
Train E58: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4665, E=0.8210,
KL=0.0580, wKL=1.0000]
Train E58: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4892, E=0.8182,
KL=0.0583, wKL=1.0000]
Train E58: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4892, E=0.8182,
KL=0.0583, wKL=1.0000]
Train E58: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.4484, E=0.8257,
KL=0.0577, wKL=1.0000]
Train E58: 100%|██████████| 25/25 [00:35<00:00, 1.16s/batch, N=1.4484, E=0.8257,
KL=0.0577, wKL=1.0000]
Train E58: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4484, E=0.8257,
KL=0.0577, wKL=1.0000]
2079.9s 192 [Epoch 058] Total: 2.3328 | N: 1.4824 | E: 0.8212 | KL(1.00×0.5):
0.0584
2114.6s 193 Train E59: 0%| | 0/25 [00:00<?, ?batch/s]
Train E59: 0%| | 0/25 [00:01<?, ?batch/s, N=1.3896, E=0.8201, KL=0.0578,
wKL=1.0000]
Train E59: 4%|▍ | 1/25 [00:01<00:33, 1.39s/batch, N=1.3896, E=0.8201,
KL=0.0578, wKL=1.0000]
Train E59: 4%|▍ | 1/25 [00:02<00:33, 1.39s/batch, N=1.4594, E=0.8187,
KL=0.0567, wKL=1.0000]
Train E59: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4594, E=0.8187,
KL=0.0567, wKL=1.0000]
Train E59: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.3810, E=0.8206,
KL=0.0576, wKL=1.0000]
Train E59: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.3810, E=0.8206,
KL=0.0576, wKL=1.0000]
Train E59: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4191, E=0.8234,
KL=0.0569, wKL=1.0000]
Train E59: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4191, E=0.8234,
KL=0.0569, wKL=1.0000]
Train E59: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.5472, E=0.8217,
KL=0.0569, wKL=1.0000]
Train E59: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5472, E=0.8217,
KL=0.0569, wKL=1.0000]
Train E59: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4681, E=0.8221,
KL=0.0564, wKL=1.0000]
Train E59: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4681, E=0.8221,
KL=0.0564, wKL=1.0000]
Train E59: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4830, E=0.8199,
KL=0.0567, wKL=1.0000]
Train E59: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4830, E=0.8199,
KL=0.0567, wKL=1.0000]
Train E59: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.3956, E=0.8221,
KL=0.0567, wKL=1.0000]
Train E59: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.3956, E=0.8221,
KL=0.0567, wKL=1.0000]
Train E59: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5022, E=0.8210,
KL=0.0589, wKL=1.0000]
Train E59: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5022, E=0.8210,
KL=0.0589, wKL=1.0000]
Train E59: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4354, E=0.8206,
KL=0.0573, wKL=1.0000]
Train E59: 40%|████ | 10/25 [00:13<00:21, 1.40s/batch, N=1.4354, E=0.8206,
KL=0.0573, wKL=1.0000]
Train E59: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.5134, E=0.8223,
KL=0.0580, wKL=1.0000]
Train E59: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.5134, E=0.8223,
KL=0.0580, wKL=1.0000]
Train E59: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4888, E=0.8170,
KL=0.0576, wKL=1.0000]
Train E59: 48%|████▊ | 12/25 [00:16<00:18, 1.44s/batch, N=1.4888, E=0.8170,
KL=0.0576, wKL=1.0000]
Train E59: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.4964, E=0.8205,
KL=0.0575, wKL=1.0000]
Train E59: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4964, E=0.8205,
KL=0.0575, wKL=1.0000]
Train E59: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4565, E=0.8209,
KL=0.0567, wKL=1.0000]
Train E59: 56%|█████▌ | 14/25 [00:20<00:17, 1.61s/batch, N=1.4565, E=0.8209,
KL=0.0567, wKL=1.0000]
Train E59: 56%|█████▌ | 14/25 [00:21<00:17, 1.61s/batch, N=1.5354, E=0.8231,
KL=0.0576, wKL=1.0000]
Train E59: 60%|██████ | 15/25 [00:21<00:15, 1.55s/batch, N=1.5354, E=0.8231,
KL=0.0576, wKL=1.0000]
Train E59: 60%|██████ | 15/25 [00:23<00:15, 1.55s/batch, N=1.5142, E=0.8156,
KL=0.0579, wKL=1.0000]
Train E59: 64%|██████▍ | 16/25 [00:23<00:13, 1.50s/batch, N=1.5142, E=0.8156,
KL=0.0579, wKL=1.0000]
Train E59: 64%|██████▍ | 16/25 [00:24<00:13, 1.50s/batch, N=1.5506, E=0.8194,
KL=0.0585, wKL=1.0000]
Train E59: 68%|██████▊ | 17/25 [00:24<00:11, 1.47s/batch, N=1.5506, E=0.8194,
KL=0.0585, wKL=1.0000]
Train E59: 68%|██████▊ | 17/25 [00:25<00:11, 1.47s/batch, N=1.5020, E=0.8224,
KL=0.0567, wKL=1.0000]
Train E59: 72%|███████▏ | 18/25 [00:25<00:10, 1.45s/batch, N=1.5020, E=0.8224,
KL=0.0567, wKL=1.0000]
Train E59: 72%|███████▏ | 18/25 [00:27<00:10, 1.45s/batch, N=1.4980, E=0.8222,
KL=0.0575, wKL=1.0000]
Train E59: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.4980, E=0.8222,
KL=0.0575, wKL=1.0000]
Train E59: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5125, E=0.8258,
KL=0.0579, wKL=1.0000]
Train E59: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5125, E=0.8258,
KL=0.0579, wKL=1.0000]
Train E59: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.5610, E=0.8221,
KL=0.0561, wKL=1.0000]
Train E59: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.5610, E=0.8221,
KL=0.0561, wKL=1.0000]
Train E59: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.4307, E=0.8185,
KL=0.0560, wKL=1.0000]
Train E59: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.4307, E=0.8185,
KL=0.0560, wKL=1.0000]
Train E59: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5730, E=0.8241,
KL=0.0581, wKL=1.0000]
Train E59: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5730, E=0.8241,
KL=0.0581, wKL=1.0000]
Train E59: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4375, E=0.8229,
KL=0.0565, wKL=1.0000]
Train E59: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4375, E=0.8229,
KL=0.0565, wKL=1.0000]
Train E59: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5060, E=0.8232,
KL=0.0591, wKL=1.0000]
Train E59: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.5060, E=0.8232,
KL=0.0591, wKL=1.0000]
Train E59: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5060, E=0.8232,
KL=0.0591, wKL=1.0000]
2114.6s 194 [Epoch 059] Total: 2.3315 | N: 1.4817 | E: 0.8212 | KL(1.00×0.5):
0.0573
2149.4s 195 Train E60: 0%| | 0/25 [00:00<?, ?batch/s]
Train E60: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5427, E=0.8289, KL=0.0561,
wKL=1.0000]
Train E60: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5427, E=0.8289,
KL=0.0561, wKL=1.0000]
Train E60: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5053, E=0.8214,
KL=0.0567, wKL=1.0000]
Train E60: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5053, E=0.8214,
KL=0.0567, wKL=1.0000]
Train E60: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.4755, E=0.8227,
KL=0.0568, wKL=1.0000]
Train E60: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4755, E=0.8227,
KL=0.0568, wKL=1.0000]
Train E60: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4752, E=0.8237,
KL=0.0571, wKL=1.0000]
Train E60: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4752, E=0.8237,
KL=0.0571, wKL=1.0000]
Train E60: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5050, E=0.8212,
KL=0.0565, wKL=1.0000]
Train E60: 20%|██ | 5/25 [00:06<00:27, 1.40s/batch, N=1.5050, E=0.8212,
KL=0.0565, wKL=1.0000]
Train E60: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.5242, E=0.8195,
KL=0.0579, wKL=1.0000]
Train E60: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5242, E=0.8195,
KL=0.0579, wKL=1.0000]
Train E60: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5417, E=0.8197,
KL=0.0568, wKL=1.0000]
Train E60: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.5417, E=0.8197,
KL=0.0568, wKL=1.0000]
Train E60: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.5324, E=0.8200,
KL=0.0568, wKL=1.0000]
Train E60: 32%|███▏ | 8/25 [00:11<00:23, 1.37s/batch, N=1.5324, E=0.8200,
KL=0.0568, wKL=1.0000]
Train E60: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.5417, E=0.8272,
KL=0.0563, wKL=1.0000]
Train E60: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.5417, E=0.8272,
KL=0.0563, wKL=1.0000]
Train E60: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.4623, E=0.8212,
KL=0.0561, wKL=1.0000]
Train E60: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.4623, E=0.8212,
KL=0.0561, wKL=1.0000]
Train E60: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4679, E=0.8211,
KL=0.0568, wKL=1.0000]
Train E60: 44%|████▍ | 11/25 [00:15<00:20, 1.44s/batch, N=1.4679, E=0.8211,
KL=0.0568, wKL=1.0000]
Train E60: 44%|████▍ | 11/25 [00:16<00:20, 1.44s/batch, N=1.5331, E=0.8231,
KL=0.0570, wKL=1.0000]
Train E60: 48%|████▊ | 12/25 [00:16<00:18, 1.43s/batch, N=1.5331, E=0.8231,
KL=0.0570, wKL=1.0000]
Train E60: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.4464, E=0.8192,
KL=0.0560, wKL=1.0000]
Train E60: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4464, E=0.8192,
KL=0.0560, wKL=1.0000]
Train E60: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5154, E=0.8192,
KL=0.0562, wKL=1.0000]
Train E60: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5154, E=0.8192,
KL=0.0562, wKL=1.0000]
Train E60: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5149, E=0.8218,
KL=0.0563, wKL=1.0000]
Train E60: 60%|██████ | 15/25 [00:21<00:13, 1.39s/batch, N=1.5149, E=0.8218,
KL=0.0563, wKL=1.0000]
Train E60: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4454, E=0.8180,
KL=0.0558, wKL=1.0000]
Train E60: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4454, E=0.8180,
KL=0.0558, wKL=1.0000]
Train E60: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4025, E=0.8182,
KL=0.0555, wKL=1.0000]
Train E60: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.4025, E=0.8182,
KL=0.0555, wKL=1.0000]
Train E60: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4223, E=0.8136,
KL=0.0567, wKL=1.0000]
Train E60: 72%|███████▏ | 18/25 [00:25<00:09, 1.38s/batch, N=1.4223, E=0.8136,
KL=0.0567, wKL=1.0000]
Train E60: 72%|███████▏ | 18/25 [00:27<00:09, 1.38s/batch, N=1.4164, E=0.8181,
KL=0.0570, wKL=1.0000]
Train E60: 76%|███████▌ | 19/25 [00:27<00:09, 1.59s/batch, N=1.4164, E=0.8181,
KL=0.0570, wKL=1.0000]
Train E60: 76%|███████▌ | 19/25 [00:28<00:09, 1.59s/batch, N=1.5217, E=0.8194,
KL=0.0584, wKL=1.0000]
Train E60: 80%|████████ | 20/25 [00:28<00:07, 1.52s/batch, N=1.5217, E=0.8194,
KL=0.0584, wKL=1.0000]
Train E60: 80%|████████ | 20/25 [00:29<00:07, 1.52s/batch, N=1.3921, E=0.8245,
KL=0.0562, wKL=1.0000]
Train E60: 84%|████████▍ | 21/25 [00:29<00:05, 1.47s/batch, N=1.3921, E=0.8245,
KL=0.0562, wKL=1.0000]
Train E60: 84%|████████▍ | 21/25 [00:31<00:05, 1.47s/batch, N=1.4009, E=0.8258,
KL=0.0587, wKL=1.0000]
Train E60: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.4009, E=0.8258,
KL=0.0587, wKL=1.0000]
Train E60: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.5431, E=0.8212,
KL=0.0569, wKL=1.0000]
Train E60: 92%|█████████▏| 23/25 [00:32<00:02, 1.43s/batch, N=1.5431, E=0.8212,
KL=0.0569, wKL=1.0000]
Train E60: 92%|█████████▏| 23/25 [00:34<00:02, 1.43s/batch, N=1.4008, E=0.8196,
KL=0.0565, wKL=1.0000]
Train E60: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4008, E=0.8196,
KL=0.0565, wKL=1.0000]
Train E60: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5456, E=0.8192,
KL=0.0580, wKL=1.0000]
Train E60: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5456, E=0.8192,
KL=0.0580, wKL=1.0000]
Train E60: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5456, E=0.8192,
KL=0.0580, wKL=1.0000]
2149.4s 196 [Epoch 060] Total: 2.3310 | N: 1.4815 | E: 0.8212 | KL(1.00×0.5):
0.0567
2149.4s 197 Saved checkpoint: /kaggle/working/checkpoints/gvae_60_epoch060.pt
2183.8s 198 Train E61: 0%| | 0/25 [00:00<?, ?batch/s]
Train E61: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4862, E=0.8194, KL=0.0564,
wKL=1.0000]
Train E61: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.4862, E=0.8194,
KL=0.0564, wKL=1.0000]
Train E61: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.5512, E=0.8199,
KL=0.0557, wKL=1.0000]
Train E61: 8%|▊ | 2/25 [00:02<00:32, 1.40s/batch, N=1.5512, E=0.8199,
KL=0.0557, wKL=1.0000]
Train E61: 8%|▊ | 2/25 [00:04<00:32, 1.40s/batch, N=1.5592, E=0.8242,
KL=0.0570, wKL=1.0000]
Train E61: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5592, E=0.8242,
KL=0.0570, wKL=1.0000]
Train E61: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.4455, E=0.8231,
KL=0.0552, wKL=1.0000]
Train E61: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4455, E=0.8231,
KL=0.0552, wKL=1.0000]
Train E61: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5418, E=0.8181,
KL=0.0560, wKL=1.0000]
Train E61: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5418, E=0.8181,
KL=0.0560, wKL=1.0000]
Train E61: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.5273, E=0.8180,
KL=0.0562, wKL=1.0000]
Train E61: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5273, E=0.8180,
KL=0.0562, wKL=1.0000]
Train E61: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5232, E=0.8243,
KL=0.0561, wKL=1.0000]
Train E61: 28%|██▊ | 7/25 [00:09<00:26, 1.45s/batch, N=1.5232, E=0.8243,
KL=0.0561, wKL=1.0000]
Train E61: 28%|██▊ | 7/25 [00:11<00:26, 1.45s/batch, N=1.5159, E=0.8253,
KL=0.0560, wKL=1.0000]
Train E61: 32%|███▏ | 8/25 [00:11<00:25, 1.48s/batch, N=1.5159, E=0.8253,
KL=0.0560, wKL=1.0000]
Train E61: 32%|███▏ | 8/25 [00:12<00:25, 1.48s/batch, N=1.4236, E=0.8191,
KL=0.0556, wKL=1.0000]
Train E61: 36%|███▌ | 9/25 [00:12<00:23, 1.45s/batch, N=1.4236, E=0.8191,
KL=0.0556, wKL=1.0000]
Train E61: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.4462, E=0.8208,
KL=0.0564, wKL=1.0000]
Train E61: 40%|████ | 10/25 [00:14<00:21, 1.43s/batch, N=1.4462, E=0.8208,
KL=0.0564, wKL=1.0000]
Train E61: 40%|████ | 10/25 [00:15<00:21, 1.43s/batch, N=1.4677, E=0.8263,
KL=0.0557, wKL=1.0000]
Train E61: 44%|████▍ | 11/25 [00:15<00:19, 1.43s/batch, N=1.4677, E=0.8263,
KL=0.0557, wKL=1.0000]
Train E61: 44%|████▍ | 11/25 [00:17<00:19, 1.43s/batch, N=1.4512, E=0.8240,
KL=0.0555, wKL=1.0000]
Train E61: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4512, E=0.8240,
KL=0.0555, wKL=1.0000]
Train E61: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5160, E=0.8220,
KL=0.0566, wKL=1.0000]
Train E61: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5160, E=0.8220,
KL=0.0566, wKL=1.0000]
Train E61: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5105, E=0.8250,
KL=0.0580, wKL=1.0000]
Train E61: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.5105, E=0.8250,
KL=0.0580, wKL=1.0000]
Train E61: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4069, E=0.8171,
KL=0.0553, wKL=1.0000]
Train E61: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.4069, E=0.8171,
KL=0.0553, wKL=1.0000]
Train E61: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5228, E=0.8184,
KL=0.0563, wKL=1.0000]
Train E61: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.5228, E=0.8184,
KL=0.0563, wKL=1.0000]
Train E61: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4608, E=0.8207,
KL=0.0554, wKL=1.0000]
Train E61: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4608, E=0.8207,
KL=0.0554, wKL=1.0000]
Train E61: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5074, E=0.8205,
KL=0.0553, wKL=1.0000]
Train E61: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5074, E=0.8205,
KL=0.0553, wKL=1.0000]
Train E61: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4420, E=0.8201,
KL=0.0554, wKL=1.0000]
Train E61: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4420, E=0.8201,
KL=0.0554, wKL=1.0000]
Train E61: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5041, E=0.8161,
KL=0.0564, wKL=1.0000]
Train E61: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5041, E=0.8161,
KL=0.0564, wKL=1.0000]
Train E61: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.3591, E=0.8210,
KL=0.0553, wKL=1.0000]
Train E61: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.3591, E=0.8210,
KL=0.0553, wKL=1.0000]
Train E61: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4759, E=0.8187,
KL=0.0556, wKL=1.0000]
Train E61: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.4759, E=0.8187,
KL=0.0556, wKL=1.0000]
Train E61: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4933, E=0.8246,
KL=0.0566, wKL=1.0000]
Train E61: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.4933, E=0.8246,
KL=0.0566, wKL=1.0000]
Train E61: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4116, E=0.8201,
KL=0.0560, wKL=1.0000]
Train E61: 96%|█████████▌| 24/25 [00:33<00:01, 1.41s/batch, N=1.4116, E=0.8201,
KL=0.0560, wKL=1.0000]
Train E61: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4995, E=0.8214,
KL=0.0559, wKL=1.0000]
Train E61: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.4995, E=0.8214,
KL=0.0559, wKL=1.0000]
Train E61: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4995, E=0.8214,
KL=0.0559, wKL=1.0000]
2183.8s 199 [Epoch 061] Total: 2.3306 | N: 1.4815 | E: 0.8211 | KL(1.00×0.5):
0.0560
2218.9s 200 Train E62: 0%| | 0/25 [00:00<?, ?batch/s]
Train E62: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5869, E=0.8184, KL=0.0562,
wKL=1.0000]
Train E62: 4%|▍ | 1/25 [00:01<00:46, 1.94s/batch, N=1.5869, E=0.8184,
KL=0.0562, wKL=1.0000]
Train E62: 4%|▍ | 1/25 [00:03<00:46, 1.94s/batch, N=1.5100, E=0.8260,
KL=0.0556, wKL=1.0000]
Train E62: 8%|▊ | 2/25 [00:03<00:36, 1.60s/batch, N=1.5100, E=0.8260,
KL=0.0556, wKL=1.0000]
Train E62: 8%|▊ | 2/25 [00:04<00:36, 1.60s/batch, N=1.4725, E=0.8156,
KL=0.0555, wKL=1.0000]
Train E62: 12%|█▏ | 3/25 [00:04<00:33, 1.50s/batch, N=1.4725, E=0.8156,
KL=0.0555, wKL=1.0000]
Train E62: 12%|█▏ | 3/25 [00:06<00:33, 1.50s/batch, N=1.4697, E=0.8195,
KL=0.0550, wKL=1.0000]
Train E62: 16%|█▌ | 4/25 [00:06<00:30, 1.45s/batch, N=1.4697, E=0.8195,
KL=0.0550, wKL=1.0000]
Train E62: 16%|█▌ | 4/25 [00:07<00:30, 1.45s/batch, N=1.3685, E=0.8214,
KL=0.0550, wKL=1.0000]
Train E62: 20%|██ | 5/25 [00:07<00:31, 1.57s/batch, N=1.3685, E=0.8214,
KL=0.0550, wKL=1.0000]
Train E62: 20%|██ | 5/25 [00:09<00:31, 1.57s/batch, N=1.4554, E=0.8246,
KL=0.0550, wKL=1.0000]
Train E62: 24%|██▍ | 6/25 [00:09<00:28, 1.51s/batch, N=1.4554, E=0.8246,
KL=0.0550, wKL=1.0000]
Train E62: 24%|██▍ | 6/25 [00:10<00:28, 1.51s/batch, N=1.5100, E=0.8221,
KL=0.0560, wKL=1.0000]
Train E62: 28%|██▊ | 7/25 [00:10<00:26, 1.47s/batch, N=1.5100, E=0.8221,
KL=0.0560, wKL=1.0000]
Train E62: 28%|██▊ | 7/25 [00:12<00:26, 1.47s/batch, N=1.4020, E=0.8235,
KL=0.0549, wKL=1.0000]
Train E62: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.4020, E=0.8235,
KL=0.0549, wKL=1.0000]
Train E62: 32%|███▏ | 8/25 [00:13<00:24, 1.45s/batch, N=1.5667, E=0.8249,
KL=0.0552, wKL=1.0000]
Train E62: 36%|███▌ | 9/25 [00:13<00:22, 1.42s/batch, N=1.5667, E=0.8249,
KL=0.0552, wKL=1.0000]
Train E62: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.3666, E=0.8176,
KL=0.0546, wKL=1.0000]
Train E62: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.3666, E=0.8176,
KL=0.0546, wKL=1.0000]
Train E62: 40%|████ | 10/25 [00:16<00:21, 1.42s/batch, N=1.5454, E=0.8205,
KL=0.0557, wKL=1.0000]
Train E62: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5454, E=0.8205,
KL=0.0557, wKL=1.0000]
Train E62: 44%|████▍ | 11/25 [00:17<00:19, 1.41s/batch, N=1.3799, E=0.8154,
KL=0.0552, wKL=1.0000]
Train E62: 48%|████▊ | 12/25 [00:17<00:18, 1.42s/batch, N=1.3799, E=0.8154,
KL=0.0552, wKL=1.0000]
Train E62: 48%|████▊ | 12/25 [00:19<00:18, 1.42s/batch, N=1.4801, E=0.8241,
KL=0.0553, wKL=1.0000]
Train E62: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4801, E=0.8241,
KL=0.0553, wKL=1.0000]
Train E62: 52%|█████▏ | 13/25 [00:20<00:16, 1.40s/batch, N=1.4903, E=0.8225,
KL=0.0550, wKL=1.0000]
Train E62: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.4903, E=0.8225,
KL=0.0550, wKL=1.0000]
Train E62: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5751, E=0.8206,
KL=0.0559, wKL=1.0000]
Train E62: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5751, E=0.8206,
KL=0.0559, wKL=1.0000]
Train E62: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4334, E=0.8185,
KL=0.0555, wKL=1.0000]
Train E62: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4334, E=0.8185,
KL=0.0555, wKL=1.0000]
Train E62: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.4481, E=0.8206,
KL=0.0542, wKL=1.0000]
Train E62: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4481, E=0.8206,
KL=0.0542, wKL=1.0000]
Train E62: 68%|██████▊ | 17/25 [00:26<00:11, 1.41s/batch, N=1.5558, E=0.8232,
KL=0.0543, wKL=1.0000]
Train E62: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.5558, E=0.8232,
KL=0.0543, wKL=1.0000]
Train E62: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4670, E=0.8234,
KL=0.0563, wKL=1.0000]
Train E62: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.4670, E=0.8234,
KL=0.0563, wKL=1.0000]
Train E62: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4810, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E62: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4810, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E62: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.4700, E=0.8230,
KL=0.0560, wKL=1.0000]
Train E62: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4700, E=0.8230,
KL=0.0560, wKL=1.0000]
Train E62: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5159, E=0.8183,
KL=0.0559, wKL=1.0000]
Train E62: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5159, E=0.8183,
KL=0.0559, wKL=1.0000]
Train E62: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.5283, E=0.8245,
KL=0.0553, wKL=1.0000]
Train E62: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.5283, E=0.8245,
KL=0.0553, wKL=1.0000]
Train E62: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4722, E=0.8216,
KL=0.0559, wKL=1.0000]
Train E62: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4722, E=0.8216,
KL=0.0559, wKL=1.0000]
Train E62: 96%|█████████▌| 24/25 [00:35<00:01, 1.39s/batch, N=1.4968, E=0.8209,
KL=0.0600, wKL=1.0000]
Train E62: 100%|██████████| 25/25 [00:35<00:00, 1.15s/batch, N=1.4968, E=0.8209,
KL=0.0600, wKL=1.0000]
Train E62: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4968, E=0.8209,
KL=0.0600, wKL=1.0000]
2218.9s 201 [Epoch 062] Total: 2.3305 | N: 1.4815 | E: 0.8212 | KL(1.00×0.5):
0.0554
2254.0s 202 Train E63: 0%| | 0/25 [00:00<?, ?batch/s]
Train E63: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4612, E=0.8169, KL=0.0559,
wKL=1.0000]
Train E63: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4612, E=0.8169,
KL=0.0559, wKL=1.0000]
Train E63: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5522, E=0.8183,
KL=0.0558, wKL=1.0000]
Train E63: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.5522, E=0.8183,
KL=0.0558, wKL=1.0000]
Train E63: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4666, E=0.8217,
KL=0.0548, wKL=1.0000]
Train E63: 12%|█▏ | 3/25 [00:04<00:31, 1.44s/batch, N=1.4666, E=0.8217,
KL=0.0548, wKL=1.0000]
Train E63: 12%|█▏ | 3/25 [00:05<00:31, 1.44s/batch, N=1.3776, E=0.8235,
KL=0.0553, wKL=1.0000]
Train E63: 16%|█▌ | 4/25 [00:05<00:30, 1.43s/batch, N=1.3776, E=0.8235,
KL=0.0553, wKL=1.0000]
Train E63: 16%|█▌ | 4/25 [00:07<00:30, 1.43s/batch, N=1.5305, E=0.8228,
KL=0.0562, wKL=1.0000]
Train E63: 20%|██ | 5/25 [00:07<00:28, 1.41s/batch, N=1.5305, E=0.8228,
KL=0.0562, wKL=1.0000]
Train E63: 20%|██ | 5/25 [00:09<00:28, 1.41s/batch, N=1.3883, E=0.8177,
KL=0.0548, wKL=1.0000]
Train E63: 24%|██▍ | 6/25 [00:09<00:30, 1.62s/batch, N=1.3883, E=0.8177,
KL=0.0548, wKL=1.0000]
Train E63: 24%|██▍ | 6/25 [00:10<00:30, 1.62s/batch, N=1.4968, E=0.8264,
KL=0.0542, wKL=1.0000]
Train E63: 28%|██▊ | 7/25 [00:10<00:27, 1.54s/batch, N=1.4968, E=0.8264,
KL=0.0542, wKL=1.0000]
Train E63: 28%|██▊ | 7/25 [00:11<00:27, 1.54s/batch, N=1.4666, E=0.8199,
KL=0.0562, wKL=1.0000]
Train E63: 32%|███▏ | 8/25 [00:11<00:25, 1.51s/batch, N=1.4666, E=0.8199,
KL=0.0562, wKL=1.0000]
Train E63: 32%|███▏ | 8/25 [00:13<00:25, 1.51s/batch, N=1.5110, E=0.8223,
KL=0.0540, wKL=1.0000]
Train E63: 36%|███▌ | 9/25 [00:13<00:23, 1.47s/batch, N=1.5110, E=0.8223,
KL=0.0540, wKL=1.0000]
Train E63: 36%|███▌ | 9/25 [00:14<00:23, 1.47s/batch, N=1.4464, E=0.8208,
KL=0.0551, wKL=1.0000]
Train E63: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.4464, E=0.8208,
KL=0.0551, wKL=1.0000]
Train E63: 40%|████ | 10/25 [00:16<00:21, 1.44s/batch, N=1.4042, E=0.8183,
KL=0.0533, wKL=1.0000]
Train E63: 44%|████▍ | 11/25 [00:16<00:20, 1.44s/batch, N=1.4042, E=0.8183,
KL=0.0533, wKL=1.0000]
Train E63: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.5291, E=0.8190,
KL=0.0546, wKL=1.0000]
Train E63: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.5291, E=0.8190,
KL=0.0546, wKL=1.0000]
Train E63: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.4994, E=0.8217,
KL=0.0532, wKL=1.0000]
Train E63: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4994, E=0.8217,
KL=0.0532, wKL=1.0000]
Train E63: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4757, E=0.8232,
KL=0.0554, wKL=1.0000]
Train E63: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.4757, E=0.8232,
KL=0.0554, wKL=1.0000]
Train E63: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.5515, E=0.8221,
KL=0.0554, wKL=1.0000]
Train E63: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.5515, E=0.8221,
KL=0.0554, wKL=1.0000]
Train E63: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.4774, E=0.8212,
KL=0.0545, wKL=1.0000]
Train E63: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.4774, E=0.8212,
KL=0.0545, wKL=1.0000]
Train E63: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.4834, E=0.8208,
KL=0.0553, wKL=1.0000]
Train E63: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.4834, E=0.8208,
KL=0.0553, wKL=1.0000]
Train E63: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4705, E=0.8189,
KL=0.0550, wKL=1.0000]
Train E63: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4705, E=0.8189,
KL=0.0550, wKL=1.0000]
Train E63: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5043, E=0.8228,
KL=0.0543, wKL=1.0000]
Train E63: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5043, E=0.8228,
KL=0.0543, wKL=1.0000]
Train E63: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4977, E=0.8218,
KL=0.0552, wKL=1.0000]
Train E63: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4977, E=0.8218,
KL=0.0552, wKL=1.0000]
Train E63: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.5200, E=0.8197,
KL=0.0553, wKL=1.0000]
Train E63: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5200, E=0.8197,
KL=0.0553, wKL=1.0000]
Train E63: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4464, E=0.8244,
KL=0.0538, wKL=1.0000]
Train E63: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4464, E=0.8244,
KL=0.0538, wKL=1.0000]
Train E63: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.4687, E=0.8249,
KL=0.0544, wKL=1.0000]
Train E63: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4687, E=0.8249,
KL=0.0544, wKL=1.0000]
Train E63: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.5023, E=0.8226,
KL=0.0538, wKL=1.0000]
Train E63: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5023, E=0.8226,
KL=0.0538, wKL=1.0000]
Train E63: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.5394, E=0.8284,
KL=0.0542, wKL=1.0000]
Train E63: 100%|██████████| 25/25 [00:35<00:00, 1.19s/batch, N=1.5394, E=0.8284,
KL=0.0542, wKL=1.0000]
Train E63: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5394, E=0.8284,
KL=0.0542, wKL=1.0000]
2254.0s 203 [Epoch 063] Total: 2.3302 | N: 1.4813 | E: 0.8214 | KL(1.00×0.5):
0.0548
2288.8s 204 Train E64: 0%| | 0/25 [00:00<?, ?batch/s]
Train E64: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4348, E=0.8238, KL=0.0542,
wKL=1.0000]
Train E64: 4%|▍ | 1/25 [00:01<00:34, 1.42s/batch, N=1.4348, E=0.8238,
KL=0.0542, wKL=1.0000]
Train E64: 4%|▍ | 1/25 [00:02<00:34, 1.42s/batch, N=1.5075, E=0.8223,
KL=0.0544, wKL=1.0000]
Train E64: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.5075, E=0.8223,
KL=0.0544, wKL=1.0000]
Train E64: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4151, E=0.8184,
KL=0.0559, wKL=1.0000]
Train E64: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4151, E=0.8184,
KL=0.0559, wKL=1.0000]
Train E64: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5013, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E64: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.5013, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E64: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.5299, E=0.8214,
KL=0.0540, wKL=1.0000]
Train E64: 20%|██ | 5/25 [00:06<00:27, 1.40s/batch, N=1.5299, E=0.8214,
KL=0.0540, wKL=1.0000]
Train E64: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.4526, E=0.8203,
KL=0.0538, wKL=1.0000]
Train E64: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4526, E=0.8203,
KL=0.0538, wKL=1.0000]
Train E64: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4864, E=0.8193,
KL=0.0550, wKL=1.0000]
Train E64: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4864, E=0.8193,
KL=0.0550, wKL=1.0000]
Train E64: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4620, E=0.8191,
KL=0.0538, wKL=1.0000]
Train E64: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4620, E=0.8191,
KL=0.0538, wKL=1.0000]
Train E64: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4288, E=0.8215,
KL=0.0540, wKL=1.0000]
Train E64: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4288, E=0.8215,
KL=0.0540, wKL=1.0000]
Train E64: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4979, E=0.8187,
KL=0.0544, wKL=1.0000]
Train E64: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4979, E=0.8187,
KL=0.0544, wKL=1.0000]
Train E64: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5414, E=0.8181,
KL=0.0550, wKL=1.0000]
Train E64: 44%|████▍ | 11/25 [00:15<00:22, 1.58s/batch, N=1.5414, E=0.8181,
KL=0.0550, wKL=1.0000]
Train E64: 44%|████▍ | 11/25 [00:17<00:22, 1.58s/batch, N=1.5897, E=0.8265,
KL=0.0552, wKL=1.0000]
Train E64: 48%|████▊ | 12/25 [00:17<00:19, 1.53s/batch, N=1.5897, E=0.8265,
KL=0.0552, wKL=1.0000]
Train E64: 48%|████▊ | 12/25 [00:18<00:19, 1.53s/batch, N=1.4349, E=0.8228,
KL=0.0552, wKL=1.0000]
Train E64: 52%|█████▏ | 13/25 [00:18<00:17, 1.48s/batch, N=1.4349, E=0.8228,
KL=0.0552, wKL=1.0000]
Train E64: 52%|█████▏ | 13/25 [00:20<00:17, 1.48s/batch, N=1.6031, E=0.8219,
KL=0.0543, wKL=1.0000]
Train E64: 56%|█████▌ | 14/25 [00:20<00:15, 1.45s/batch, N=1.6031, E=0.8219,
KL=0.0543, wKL=1.0000]
Train E64: 56%|█████▌ | 14/25 [00:21<00:15, 1.45s/batch, N=1.4090, E=0.8217,
KL=0.0534, wKL=1.0000]
Train E64: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.4090, E=0.8217,
KL=0.0534, wKL=1.0000]
Train E64: 60%|██████ | 15/25 [00:22<00:14, 1.43s/batch, N=1.4542, E=0.8209,
KL=0.0536, wKL=1.0000]
Train E64: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.4542, E=0.8209,
KL=0.0536, wKL=1.0000]
Train E64: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.4342, E=0.8238,
KL=0.0534, wKL=1.0000]
Train E64: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4342, E=0.8238,
KL=0.0534, wKL=1.0000]
Train E64: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5048, E=0.8256,
KL=0.0539, wKL=1.0000]
Train E64: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5048, E=0.8256,
KL=0.0539, wKL=1.0000]
Train E64: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.4602, E=0.8164,
KL=0.0547, wKL=1.0000]
Train E64: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.4602, E=0.8164,
KL=0.0547, wKL=1.0000]
Train E64: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4523, E=0.8222,
KL=0.0540, wKL=1.0000]
Train E64: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4523, E=0.8222,
KL=0.0540, wKL=1.0000]
Train E64: 80%|████████ | 20/25 [00:29<00:06, 1.40s/batch, N=1.4608, E=0.8221,
KL=0.0541, wKL=1.0000]
Train E64: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.4608, E=0.8221,
KL=0.0541, wKL=1.0000]
Train E64: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.5291, E=0.8252,
KL=0.0542, wKL=1.0000]
Train E64: 88%|████████▊ | 22/25 [00:31<00:04, 1.38s/batch, N=1.5291, E=0.8252,
KL=0.0542, wKL=1.0000]
Train E64: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.5219, E=0.8194,
KL=0.0555, wKL=1.0000]
Train E64: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.5219, E=0.8194,
KL=0.0555, wKL=1.0000]
Train E64: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.3813, E=0.8200,
KL=0.0531, wKL=1.0000]
Train E64: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.3813, E=0.8200,
KL=0.0531, wKL=1.0000]
Train E64: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.6153, E=0.8154,
KL=0.0562, wKL=1.0000]
Train E64: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.6153, E=0.8154,
KL=0.0562, wKL=1.0000]
Train E64: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.6153, E=0.8154,
KL=0.0562, wKL=1.0000]
2288.8s 205 [Epoch 064] Total: 2.3297 | N: 1.4812 | E: 0.8213 | KL(1.00×0.5):
0.0544
2323.7s 206 Train E65: 0%| | 0/25 [00:00<?, ?batch/s]
Train E65: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5643, E=0.8207, KL=0.0539,
wKL=1.0000]
Train E65: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5643, E=0.8207,
KL=0.0539, wKL=1.0000]
Train E65: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4792, E=0.8196,
KL=0.0540, wKL=1.0000]
Train E65: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4792, E=0.8196,
KL=0.0540, wKL=1.0000]
Train E65: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.5189, E=0.8176,
KL=0.0541, wKL=1.0000]
Train E65: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.5189, E=0.8176,
KL=0.0541, wKL=1.0000]
Train E65: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4931, E=0.8186,
KL=0.0535, wKL=1.0000]
Train E65: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4931, E=0.8186,
KL=0.0535, wKL=1.0000]
Train E65: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4659, E=0.8245,
KL=0.0528, wKL=1.0000]
Train E65: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4659, E=0.8245,
KL=0.0528, wKL=1.0000]
Train E65: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4153, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E65: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4153, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E65: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4647, E=0.8252,
KL=0.0531, wKL=1.0000]
Train E65: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4647, E=0.8252,
KL=0.0531, wKL=1.0000]
Train E65: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4272, E=0.8183,
KL=0.0534, wKL=1.0000]
Train E65: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4272, E=0.8183,
KL=0.0534, wKL=1.0000]
Train E65: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.3457, E=0.8225,
KL=0.0525, wKL=1.0000]
Train E65: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.3457, E=0.8225,
KL=0.0525, wKL=1.0000]
Train E65: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.4709, E=0.8218,
KL=0.0532, wKL=1.0000]
Train E65: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4709, E=0.8218,
KL=0.0532, wKL=1.0000]
Train E65: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5343, E=0.8225,
KL=0.0534, wKL=1.0000]
Train E65: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5343, E=0.8225,
KL=0.0534, wKL=1.0000]
Train E65: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5450, E=0.8258,
KL=0.0532, wKL=1.0000]
Train E65: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5450, E=0.8258,
KL=0.0532, wKL=1.0000]
Train E65: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5394, E=0.8201,
KL=0.0558, wKL=1.0000]
Train E65: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5394, E=0.8201,
KL=0.0558, wKL=1.0000]
Train E65: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4526, E=0.8231,
KL=0.0533, wKL=1.0000]
Train E65: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.4526, E=0.8231,
KL=0.0533, wKL=1.0000]
Train E65: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4885, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E65: 60%|██████ | 15/25 [00:20<00:13, 1.38s/batch, N=1.4885, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E65: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4469, E=0.8196,
KL=0.0533, wKL=1.0000]
Train E65: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4469, E=0.8196,
KL=0.0533, wKL=1.0000]
Train E65: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4468, E=0.8239,
KL=0.0540, wKL=1.0000]
Train E65: 68%|██████▊ | 17/25 [00:24<00:12, 1.59s/batch, N=1.4468, E=0.8239,
KL=0.0540, wKL=1.0000]
Train E65: 68%|██████▊ | 17/25 [00:25<00:12, 1.59s/batch, N=1.4485, E=0.8205,
KL=0.0534, wKL=1.0000]
Train E65: 72%|███████▏ | 18/25 [00:25<00:10, 1.54s/batch, N=1.4485, E=0.8205,
KL=0.0534, wKL=1.0000]
Train E65: 72%|███████▏ | 18/25 [00:27<00:10, 1.54s/batch, N=1.4383, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E65: 76%|███████▌ | 19/25 [00:27<00:08, 1.49s/batch, N=1.4383, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E65: 76%|███████▌ | 19/25 [00:28<00:08, 1.49s/batch, N=1.5229, E=0.8171,
KL=0.0545, wKL=1.0000]
Train E65: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.5229, E=0.8171,
KL=0.0545, wKL=1.0000]
Train E65: 80%|████████ | 20/25 [00:30<00:07, 1.50s/batch, N=1.4957, E=0.8244,
KL=0.0530, wKL=1.0000]
Train E65: 84%|████████▍ | 21/25 [00:30<00:05, 1.50s/batch, N=1.4957, E=0.8244,
KL=0.0530, wKL=1.0000]
Train E65: 84%|████████▍ | 21/25 [00:31<00:05, 1.50s/batch, N=1.4536, E=0.8238,
KL=0.0544, wKL=1.0000]
Train E65: 88%|████████▊ | 22/25 [00:31<00:04, 1.47s/batch, N=1.4536, E=0.8238,
KL=0.0544, wKL=1.0000]
Train E65: 88%|████████▊ | 22/25 [00:32<00:04, 1.47s/batch, N=1.5599, E=0.8233,
KL=0.0543, wKL=1.0000]
Train E65: 92%|█████████▏| 23/25 [00:32<00:02, 1.46s/batch, N=1.5599, E=0.8233,
KL=0.0543, wKL=1.0000]
Train E65: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5398, E=0.8217,
KL=0.0535, wKL=1.0000]
Train E65: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5398, E=0.8217,
KL=0.0535, wKL=1.0000]
Train E65: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4616, E=0.8173,
KL=0.0525, wKL=1.0000]
Train E65: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.4616, E=0.8173,
KL=0.0525, wKL=1.0000]
Train E65: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4616, E=0.8173,
KL=0.0525, wKL=1.0000]
2323.7s 207 [Epoch 065] Total: 2.3294 | N: 1.4812 | E: 0.8213 | KL(1.00×0.5):
0.0537
2358.3s 208 Train E66: 0%| | 0/25 [00:00<?, ?batch/s]
Train E66: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4298, E=0.8203, KL=0.0520,
wKL=1.0000]
Train E66: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4298, E=0.8203,
KL=0.0520, wKL=1.0000]
Train E66: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4013, E=0.8181,
KL=0.0530, wKL=1.0000]
Train E66: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4013, E=0.8181,
KL=0.0530, wKL=1.0000]
Train E66: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.5090, E=0.8231,
KL=0.0523, wKL=1.0000]
Train E66: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5090, E=0.8231,
KL=0.0523, wKL=1.0000]
Train E66: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4873, E=0.8234,
KL=0.0527, wKL=1.0000]
Train E66: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4873, E=0.8234,
KL=0.0527, wKL=1.0000]
Train E66: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4589, E=0.8169,
KL=0.0539, wKL=1.0000]
Train E66: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4589, E=0.8169,
KL=0.0539, wKL=1.0000]
Train E66: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4133, E=0.8201,
KL=0.0526, wKL=1.0000]
Train E66: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4133, E=0.8201,
KL=0.0526, wKL=1.0000]
Train E66: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.3960, E=0.8222,
KL=0.0519, wKL=1.0000]
Train E66: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.3960, E=0.8222,
KL=0.0519, wKL=1.0000]
Train E66: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4687, E=0.8214,
KL=0.0539, wKL=1.0000]
Train E66: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.4687, E=0.8214,
KL=0.0539, wKL=1.0000]
Train E66: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4718, E=0.8255,
KL=0.0521, wKL=1.0000]
Train E66: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4718, E=0.8255,
KL=0.0521, wKL=1.0000]
Train E66: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.5631, E=0.8215,
KL=0.0534, wKL=1.0000]
Train E66: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5631, E=0.8215,
KL=0.0534, wKL=1.0000]
Train E66: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5146, E=0.8166,
KL=0.0576, wKL=1.0000]
Train E66: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.5146, E=0.8166,
KL=0.0576, wKL=1.0000]
Train E66: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.4594, E=0.8241,
KL=0.0531, wKL=1.0000]
Train E66: 48%|████▊ | 12/25 [00:16<00:17, 1.37s/batch, N=1.4594, E=0.8241,
KL=0.0531, wKL=1.0000]
Train E66: 48%|████▊ | 12/25 [00:17<00:17, 1.37s/batch, N=1.4708, E=0.8167,
KL=0.0530, wKL=1.0000]
Train E66: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4708, E=0.8167,
KL=0.0530, wKL=1.0000]
Train E66: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5087, E=0.8222,
KL=0.0531, wKL=1.0000]
Train E66: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.5087, E=0.8222,
KL=0.0531, wKL=1.0000]
Train E66: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4599, E=0.8217,
KL=0.0525, wKL=1.0000]
Train E66: 60%|██████ | 15/25 [00:20<00:14, 1.40s/batch, N=1.4599, E=0.8217,
KL=0.0525, wKL=1.0000]
Train E66: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.5173, E=0.8200,
KL=0.0530, wKL=1.0000]
Train E66: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5173, E=0.8200,
KL=0.0530, wKL=1.0000]
Train E66: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5032, E=0.8208,
KL=0.0524, wKL=1.0000]
Train E66: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5032, E=0.8208,
KL=0.0524, wKL=1.0000]
Train E66: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4609, E=0.8257,
KL=0.0524, wKL=1.0000]
Train E66: 72%|███████▏ | 18/25 [00:24<00:09, 1.38s/batch, N=1.4609, E=0.8257,
KL=0.0524, wKL=1.0000]
Train E66: 72%|███████▏ | 18/25 [00:27<00:09, 1.38s/batch, N=1.4616, E=0.8238,
KL=0.0531, wKL=1.0000]
Train E66: 76%|███████▌ | 19/25 [00:27<00:09, 1.62s/batch, N=1.4616, E=0.8238,
KL=0.0531, wKL=1.0000]
Train E66: 76%|███████▌ | 19/25 [00:28<00:09, 1.62s/batch, N=1.4918, E=0.8186,
KL=0.0525, wKL=1.0000]
Train E66: 80%|████████ | 20/25 [00:28<00:07, 1.55s/batch, N=1.4918, E=0.8186,
KL=0.0525, wKL=1.0000]
Train E66: 80%|████████ | 20/25 [00:29<00:07, 1.55s/batch, N=1.6526, E=0.8289,
KL=0.0540, wKL=1.0000]
Train E66: 84%|████████▍ | 21/25 [00:29<00:06, 1.50s/batch, N=1.6526, E=0.8289,
KL=0.0540, wKL=1.0000]
Train E66: 84%|████████▍ | 21/25 [00:31<00:06, 1.50s/batch, N=1.4237, E=0.8210,
KL=0.0523, wKL=1.0000]
Train E66: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4237, E=0.8210,
KL=0.0523, wKL=1.0000]
Train E66: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.4858, E=0.8241,
KL=0.0533, wKL=1.0000]
Train E66: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.4858, E=0.8241,
KL=0.0533, wKL=1.0000]
Train E66: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.4710, E=0.8188,
KL=0.0547, wKL=1.0000]
Train E66: 96%|█████████▌| 24/25 [00:33<00:01, 1.42s/batch, N=1.4710, E=0.8188,
KL=0.0547, wKL=1.0000]
Train E66: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.6378, E=0.8216,
KL=0.0533, wKL=1.0000]
Train E66: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.6378, E=0.8216,
KL=0.0533, wKL=1.0000]
Train E66: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.6378, E=0.8216,
KL=0.0533, wKL=1.0000]
2358.3s 209 [Epoch 066] Total: 2.3291 | N: 1.4811 | E: 0.8215 | KL(1.00×0.5):
0.0531
2392.3s 210 Train E67: 0%| | 0/25 [00:00<?, ?batch/s]
Train E67: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4827, E=0.8258, KL=0.0535,
wKL=1.0000]
Train E67: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4827, E=0.8258,
KL=0.0535, wKL=1.0000]
Train E67: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4244, E=0.8231,
KL=0.0531, wKL=1.0000]
Train E67: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4244, E=0.8231,
KL=0.0531, wKL=1.0000]
Train E67: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4834, E=0.8201,
KL=0.0536, wKL=1.0000]
Train E67: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4834, E=0.8201,
KL=0.0536, wKL=1.0000]
Train E67: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5212, E=0.8220,
KL=0.0529, wKL=1.0000]
Train E67: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5212, E=0.8220,
KL=0.0529, wKL=1.0000]
Train E67: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4226, E=0.8193,
KL=0.0518, wKL=1.0000]
Train E67: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4226, E=0.8193,
KL=0.0518, wKL=1.0000]
Train E67: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4276, E=0.8184,
KL=0.0518, wKL=1.0000]
Train E67: 24%|██▍ | 6/25 [00:08<00:26, 1.37s/batch, N=1.4276, E=0.8184,
KL=0.0518, wKL=1.0000]
Train E67: 24%|██▍ | 6/25 [00:09<00:26, 1.37s/batch, N=1.4292, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E67: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4292, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E67: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.5343, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E67: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.5343, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E67: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.4281, E=0.8209,
KL=0.0529, wKL=1.0000]
Train E67: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4281, E=0.8209,
KL=0.0529, wKL=1.0000]
Train E67: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5147, E=0.8228,
KL=0.0534, wKL=1.0000]
Train E67: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.5147, E=0.8228,
KL=0.0534, wKL=1.0000]
Train E67: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4481, E=0.8244,
KL=0.0527, wKL=1.0000]
Train E67: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.4481, E=0.8244,
KL=0.0527, wKL=1.0000]
Train E67: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.5053, E=0.8253,
KL=0.0526, wKL=1.0000]
Train E67: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.5053, E=0.8253,
KL=0.0526, wKL=1.0000]
Train E67: 48%|████▊ | 12/25 [00:17<00:17, 1.38s/batch, N=1.4830, E=0.8228,
KL=0.0526, wKL=1.0000]
Train E67: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4830, E=0.8228,
KL=0.0526, wKL=1.0000]
Train E67: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4451, E=0.8246,
KL=0.0520, wKL=1.0000]
Train E67: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4451, E=0.8246,
KL=0.0520, wKL=1.0000]
Train E67: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5060, E=0.8179,
KL=0.0523, wKL=1.0000]
Train E67: 60%|██████ | 15/25 [00:20<00:13, 1.38s/batch, N=1.5060, E=0.8179,
KL=0.0523, wKL=1.0000]
Train E67: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4106, E=0.8179,
KL=0.0532, wKL=1.0000]
Train E67: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4106, E=0.8179,
KL=0.0532, wKL=1.0000]
Train E67: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4755, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E67: 68%|██████▊ | 17/25 [00:23<00:11, 1.45s/batch, N=1.4755, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E67: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.4910, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E67: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.4910, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E67: 72%|███████▏ | 18/25 [00:26<00:10, 1.43s/batch, N=1.4568, E=0.8254,
KL=0.0536, wKL=1.0000]
Train E67: 76%|███████▌ | 19/25 [00:26<00:08, 1.42s/batch, N=1.4568, E=0.8254,
KL=0.0536, wKL=1.0000]
Train E67: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.6119, E=0.8218,
KL=0.0530, wKL=1.0000]
Train E67: 80%|████████ | 20/25 [00:27<00:07, 1.40s/batch, N=1.6119, E=0.8218,
KL=0.0530, wKL=1.0000]
Train E67: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.4645, E=0.8199,
KL=0.0546, wKL=1.0000]
Train E67: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.4645, E=0.8199,
KL=0.0546, wKL=1.0000]
Train E67: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.5156, E=0.8183,
KL=0.0530, wKL=1.0000]
Train E67: 88%|████████▊ | 22/25 [00:30<00:04, 1.41s/batch, N=1.5156, E=0.8183,
KL=0.0530, wKL=1.0000]
Train E67: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5222, E=0.8220,
KL=0.0531, wKL=1.0000]
Train E67: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5222, E=0.8220,
KL=0.0531, wKL=1.0000]
Train E67: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4979, E=0.8236,
KL=0.0528, wKL=1.0000]
Train E67: 96%|█████████▌| 24/25 [00:33<00:01, 1.39s/batch, N=1.4979, E=0.8236,
KL=0.0528, wKL=1.0000]
Train E67: 96%|█████████▌| 24/25 [00:33<00:01, 1.39s/batch, N=1.5812, E=0.8201,
KL=0.0568, wKL=1.0000]
Train E67: 100%|██████████| 25/25 [00:33<00:00, 1.16s/batch, N=1.5812, E=0.8201,
KL=0.0568, wKL=1.0000]
Train E67: 100%|██████████| 25/25 [00:33<00:00, 1.36s/batch, N=1.5812, E=0.8201,
KL=0.0568, wKL=1.0000]
2392.3s 211 [Epoch 067] Total: 2.3288 | N: 1.4810 | E: 0.8214 | KL(1.00×0.5):
0.0529
2427.1s 212 Train E68: 0%| | 0/25 [00:00<?, ?batch/s]
Train E68: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4609, E=0.8258, KL=0.0529,
wKL=1.0000]
Train E68: 4%|▍ | 1/25 [00:01<00:32, 1.33s/batch, N=1.4609, E=0.8258,
KL=0.0529, wKL=1.0000]
Train E68: 4%|▍ | 1/25 [00:03<00:32, 1.33s/batch, N=1.5097, E=0.8179,
KL=0.0525, wKL=1.0000]
Train E68: 8%|▊ | 2/25 [00:03<00:39, 1.73s/batch, N=1.5097, E=0.8179,
KL=0.0525, wKL=1.0000]
Train E68: 8%|▊ | 2/25 [00:04<00:39, 1.73s/batch, N=1.4122, E=0.8243,
KL=0.0521, wKL=1.0000]
Train E68: 12%|█▏ | 3/25 [00:04<00:34, 1.58s/batch, N=1.4122, E=0.8243,
KL=0.0521, wKL=1.0000]
Train E68: 12%|█▏ | 3/25 [00:06<00:34, 1.58s/batch, N=1.5590, E=0.8228,
KL=0.0527, wKL=1.0000]
Train E68: 16%|█▌ | 4/25 [00:06<00:31, 1.49s/batch, N=1.5590, E=0.8228,
KL=0.0527, wKL=1.0000]
Train E68: 16%|█▌ | 4/25 [00:07<00:31, 1.49s/batch, N=1.4516, E=0.8212,
KL=0.0514, wKL=1.0000]
Train E68: 20%|██ | 5/25 [00:07<00:29, 1.45s/batch, N=1.4516, E=0.8212,
KL=0.0514, wKL=1.0000]
Train E68: 20%|██ | 5/25 [00:08<00:29, 1.45s/batch, N=1.4022, E=0.8233,
KL=0.0535, wKL=1.0000]
Train E68: 24%|██▍ | 6/25 [00:08<00:27, 1.44s/batch, N=1.4022, E=0.8233,
KL=0.0535, wKL=1.0000]
Train E68: 24%|██▍ | 6/25 [00:10<00:27, 1.44s/batch, N=1.5058, E=0.8189,
KL=0.0532, wKL=1.0000]
Train E68: 28%|██▊ | 7/25 [00:10<00:25, 1.42s/batch, N=1.5058, E=0.8189,
KL=0.0532, wKL=1.0000]
Train E68: 28%|██▊ | 7/25 [00:11<00:25, 1.42s/batch, N=1.5012, E=0.8259,
KL=0.0517, wKL=1.0000]
Train E68: 32%|███▏ | 8/25 [00:11<00:24, 1.41s/batch, N=1.5012, E=0.8259,
KL=0.0517, wKL=1.0000]
Train E68: 32%|███▏ | 8/25 [00:13<00:24, 1.41s/batch, N=1.4622, E=0.8236,
KL=0.0515, wKL=1.0000]
Train E68: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4622, E=0.8236,
KL=0.0515, wKL=1.0000]
Train E68: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.4909, E=0.8215,
KL=0.0522, wKL=1.0000]
Train E68: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.4909, E=0.8215,
KL=0.0522, wKL=1.0000]
Train E68: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4530, E=0.8207,
KL=0.0523, wKL=1.0000]
Train E68: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.4530, E=0.8207,
KL=0.0523, wKL=1.0000]
Train E68: 44%|████▍ | 11/25 [00:17<00:19, 1.38s/batch, N=1.4958, E=0.8159,
KL=0.0523, wKL=1.0000]
Train E68: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4958, E=0.8159,
KL=0.0523, wKL=1.0000]
Train E68: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4753, E=0.8223,
KL=0.0512, wKL=1.0000]
Train E68: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4753, E=0.8223,
KL=0.0512, wKL=1.0000]
Train E68: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4815, E=0.8208,
KL=0.0514, wKL=1.0000]
Train E68: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.4815, E=0.8208,
KL=0.0514, wKL=1.0000]
Train E68: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.4878, E=0.8212,
KL=0.0519, wKL=1.0000]
Train E68: 60%|██████ | 15/25 [00:21<00:14, 1.47s/batch, N=1.4878, E=0.8212,
KL=0.0519, wKL=1.0000]
Train E68: 60%|██████ | 15/25 [00:23<00:14, 1.47s/batch, N=1.4496, E=0.8189,
KL=0.0519, wKL=1.0000]
Train E68: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.4496, E=0.8189,
KL=0.0519, wKL=1.0000]
Train E68: 64%|██████▍ | 16/25 [00:24<00:12, 1.44s/batch, N=1.5328, E=0.8224,
KL=0.0528, wKL=1.0000]
Train E68: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.5328, E=0.8224,
KL=0.0528, wKL=1.0000]
Train E68: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4857, E=0.8193,
KL=0.0532, wKL=1.0000]
Train E68: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4857, E=0.8193,
KL=0.0532, wKL=1.0000]
Train E68: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.3985, E=0.8177,
KL=0.0545, wKL=1.0000]
Train E68: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.3985, E=0.8177,
KL=0.0545, wKL=1.0000]
Train E68: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5506, E=0.8257,
KL=0.0534, wKL=1.0000]
Train E68: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.5506, E=0.8257,
KL=0.0534, wKL=1.0000]
Train E68: 80%|████████ | 20/25 [00:30<00:07, 1.43s/batch, N=1.4341, E=0.8177,
KL=0.0529, wKL=1.0000]
Train E68: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4341, E=0.8177,
KL=0.0529, wKL=1.0000]
Train E68: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5287, E=0.8223,
KL=0.0522, wKL=1.0000]
Train E68: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5287, E=0.8223,
KL=0.0522, wKL=1.0000]
Train E68: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5551, E=0.8214,
KL=0.0541, wKL=1.0000]
Train E68: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5551, E=0.8214,
KL=0.0541, wKL=1.0000]
Train E68: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4560, E=0.8220,
KL=0.0520, wKL=1.0000]
Train E68: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4560, E=0.8220,
KL=0.0520, wKL=1.0000]
Train E68: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4847, E=0.8239,
KL=0.0530, wKL=1.0000]
Train E68: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.4847, E=0.8239,
KL=0.0530, wKL=1.0000]
Train E68: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4847, E=0.8239,
KL=0.0530, wKL=1.0000]
2427.1s 213 [Epoch 068] Total: 2.3286 | N: 1.4809 | E: 0.8214 | KL(1.00×0.5):
0.0525
2462.0s 214 Train E69: 0%| | 0/25 [00:00<?, ?batch/s]
Train E69: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4382, E=0.8202, KL=0.0522,
wKL=1.0000]
Train E69: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4382, E=0.8202,
KL=0.0522, wKL=1.0000]
Train E69: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4556, E=0.8193,
KL=0.0529, wKL=1.0000]
Train E69: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.4556, E=0.8193,
KL=0.0529, wKL=1.0000]
Train E69: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.5935, E=0.8257,
KL=0.0529, wKL=1.0000]
Train E69: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5935, E=0.8257,
KL=0.0529, wKL=1.0000]
Train E69: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4248, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E69: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4248, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E69: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.5457, E=0.8222,
KL=0.0524, wKL=1.0000]
Train E69: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5457, E=0.8222,
KL=0.0524, wKL=1.0000]
Train E69: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4750, E=0.8258,
KL=0.0534, wKL=1.0000]
Train E69: 24%|██▍ | 6/25 [00:08<00:26, 1.37s/batch, N=1.4750, E=0.8258,
KL=0.0534, wKL=1.0000]
Train E69: 24%|██▍ | 6/25 [00:10<00:26, 1.37s/batch, N=1.4004, E=0.8263,
KL=0.0506, wKL=1.0000]
Train E69: 28%|██▊ | 7/25 [00:10<00:28, 1.58s/batch, N=1.4004, E=0.8263,
KL=0.0506, wKL=1.0000]
Train E69: 28%|██▊ | 7/25 [00:11<00:28, 1.58s/batch, N=1.4377, E=0.8245,
KL=0.0510, wKL=1.0000]
Train E69: 32%|███▏ | 8/25 [00:11<00:25, 1.52s/batch, N=1.4377, E=0.8245,
KL=0.0510, wKL=1.0000]
Train E69: 32%|███▏ | 8/25 [00:12<00:25, 1.52s/batch, N=1.5388, E=0.8237,
KL=0.0526, wKL=1.0000]
Train E69: 36%|███▌ | 9/25 [00:12<00:23, 1.48s/batch, N=1.5388, E=0.8237,
KL=0.0526, wKL=1.0000]
Train E69: 36%|███▌ | 9/25 [00:14<00:23, 1.48s/batch, N=1.5081, E=0.8257,
KL=0.0527, wKL=1.0000]
Train E69: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.5081, E=0.8257,
KL=0.0527, wKL=1.0000]
Train E69: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4934, E=0.8181,
KL=0.0510, wKL=1.0000]
Train E69: 44%|████▍ | 11/25 [00:15<00:20, 1.44s/batch, N=1.4934, E=0.8181,
KL=0.0510, wKL=1.0000]
Train E69: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.4111, E=0.8219,
KL=0.0517, wKL=1.0000]
Train E69: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.4111, E=0.8219,
KL=0.0517, wKL=1.0000]
Train E69: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.5216, E=0.8197,
KL=0.0519, wKL=1.0000]
Train E69: 52%|█████▏ | 13/25 [00:18<00:17, 1.50s/batch, N=1.5216, E=0.8197,
KL=0.0519, wKL=1.0000]
Train E69: 52%|█████▏ | 13/25 [00:20<00:17, 1.50s/batch, N=1.4524, E=0.8176,
KL=0.0519, wKL=1.0000]
Train E69: 56%|█████▌ | 14/25 [00:20<00:16, 1.46s/batch, N=1.4524, E=0.8176,
KL=0.0519, wKL=1.0000]
Train E69: 56%|█████▌ | 14/25 [00:21<00:16, 1.46s/batch, N=1.4907, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E69: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.4907, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E69: 60%|██████ | 15/25 [00:23<00:14, 1.44s/batch, N=1.4716, E=0.8187,
KL=0.0517, wKL=1.0000]
Train E69: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.4716, E=0.8187,
KL=0.0517, wKL=1.0000]
Train E69: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.3921, E=0.8180,
KL=0.0516, wKL=1.0000]
Train E69: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.3921, E=0.8180,
KL=0.0516, wKL=1.0000]
Train E69: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.5576, E=0.8210,
KL=0.0520, wKL=1.0000]
Train E69: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.5576, E=0.8210,
KL=0.0520, wKL=1.0000]
Train E69: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5280, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E69: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5280, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E69: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5645, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E69: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5645, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E69: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.4365, E=0.8221,
KL=0.0517, wKL=1.0000]
Train E69: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4365, E=0.8221,
KL=0.0517, wKL=1.0000]
Train E69: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.3795, E=0.8195,
KL=0.0504, wKL=1.0000]
Train E69: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.3795, E=0.8195,
KL=0.0504, wKL=1.0000]
Train E69: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5752, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E69: 92%|█████████▏| 23/25 [00:32<00:02, 1.39s/batch, N=1.5752, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E69: 92%|█████████▏| 23/25 [00:34<00:02, 1.39s/batch, N=1.5235, E=0.8222,
KL=0.0520, wKL=1.0000]
Train E69: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5235, E=0.8222,
KL=0.0520, wKL=1.0000]
Train E69: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.3367, E=0.8218,
KL=0.0505, wKL=1.0000]
Train E69: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.3367, E=0.8218,
KL=0.0505, wKL=1.0000]
Train E69: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.3367, E=0.8218,
KL=0.0505, wKL=1.0000]
2462.0s 215 [Epoch 069] Total: 2.3290 | N: 1.4815 | E: 0.8216 | KL(1.00×0.5):
0.0520
2497.0s 216 Train E70: 0%| | 0/25 [00:00<?, ?batch/s]
Train E70: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4894, E=0.8202, KL=0.0532,
wKL=1.0000]
Train E70: 4%|▍ | 1/25 [00:01<00:31, 1.32s/batch, N=1.4894, E=0.8202,
KL=0.0532, wKL=1.0000]
Train E70: 4%|▍ | 1/25 [00:02<00:31, 1.32s/batch, N=1.5078, E=0.8236,
KL=0.0535, wKL=1.0000]
Train E70: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.5078, E=0.8236,
KL=0.0535, wKL=1.0000]
Train E70: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.4210, E=0.8260,
KL=0.0510, wKL=1.0000]
Train E70: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4210, E=0.8260,
KL=0.0510, wKL=1.0000]
Train E70: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4580, E=0.8241,
KL=0.0518, wKL=1.0000]
Train E70: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4580, E=0.8241,
KL=0.0518, wKL=1.0000]
Train E70: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.5560, E=0.8226,
KL=0.0517, wKL=1.0000]
Train E70: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5560, E=0.8226,
KL=0.0517, wKL=1.0000]
Train E70: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4107, E=0.8181,
KL=0.0509, wKL=1.0000]
Train E70: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4107, E=0.8181,
KL=0.0509, wKL=1.0000]
Train E70: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4111, E=0.8203,
KL=0.0513, wKL=1.0000]
Train E70: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4111, E=0.8203,
KL=0.0513, wKL=1.0000]
Train E70: 28%|██▊ | 7/25 [00:10<00:24, 1.39s/batch, N=1.5586, E=0.8208,
KL=0.0528, wKL=1.0000]
Train E70: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.5586, E=0.8208,
KL=0.0528, wKL=1.0000]
Train E70: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4006, E=0.8217,
KL=0.0516, wKL=1.0000]
Train E70: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4006, E=0.8217,
KL=0.0516, wKL=1.0000]
Train E70: 36%|███▌ | 9/25 [00:14<00:22, 1.38s/batch, N=1.4971, E=0.8154,
KL=0.0528, wKL=1.0000]
Train E70: 40%|████ | 10/25 [00:14<00:22, 1.47s/batch, N=1.4971, E=0.8154,
KL=0.0528, wKL=1.0000]
Train E70: 40%|████ | 10/25 [00:15<00:22, 1.47s/batch, N=1.4002, E=0.8201,
KL=0.0516, wKL=1.0000]
Train E70: 44%|████▍ | 11/25 [00:15<00:20, 1.47s/batch, N=1.4002, E=0.8201,
KL=0.0516, wKL=1.0000]
Train E70: 44%|████▍ | 11/25 [00:17<00:20, 1.47s/batch, N=1.4754, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E70: 48%|████▊ | 12/25 [00:17<00:21, 1.63s/batch, N=1.4754, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E70: 48%|████▊ | 12/25 [00:18<00:21, 1.63s/batch, N=1.4908, E=0.8217,
KL=0.0511, wKL=1.0000]
Train E70: 52%|█████▏ | 13/25 [00:18<00:18, 1.57s/batch, N=1.4908, E=0.8217,
KL=0.0511, wKL=1.0000]
Train E70: 52%|█████▏ | 13/25 [00:20<00:18, 1.57s/batch, N=1.4189, E=0.8176,
KL=0.0512, wKL=1.0000]
Train E70: 56%|█████▌ | 14/25 [00:20<00:16, 1.52s/batch, N=1.4189, E=0.8176,
KL=0.0512, wKL=1.0000]
Train E70: 56%|█████▌ | 14/25 [00:21<00:16, 1.52s/batch, N=1.5273, E=0.8194,
KL=0.0522, wKL=1.0000]
Train E70: 60%|██████ | 15/25 [00:21<00:14, 1.48s/batch, N=1.5273, E=0.8194,
KL=0.0522, wKL=1.0000]
Train E70: 60%|██████ | 15/25 [00:23<00:14, 1.48s/batch, N=1.5741, E=0.8246,
KL=0.0512, wKL=1.0000]
Train E70: 64%|██████▍ | 16/25 [00:23<00:13, 1.46s/batch, N=1.5741, E=0.8246,
KL=0.0512, wKL=1.0000]
Train E70: 64%|██████▍ | 16/25 [00:24<00:13, 1.46s/batch, N=1.3660, E=0.8159,
KL=0.0520, wKL=1.0000]
Train E70: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.3660, E=0.8159,
KL=0.0520, wKL=1.0000]
Train E70: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4965, E=0.8236,
KL=0.0509, wKL=1.0000]
Train E70: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4965, E=0.8236,
KL=0.0509, wKL=1.0000]
Train E70: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5127, E=0.8236,
KL=0.0520, wKL=1.0000]
Train E70: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5127, E=0.8236,
KL=0.0520, wKL=1.0000]
Train E70: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5366, E=0.8263,
KL=0.0521, wKL=1.0000]
Train E70: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.5366, E=0.8263,
KL=0.0521, wKL=1.0000]
Train E70: 80%|████████ | 20/25 [00:30<00:07, 1.43s/batch, N=1.5674, E=0.8269,
KL=0.0514, wKL=1.0000]
Train E70: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5674, E=0.8269,
KL=0.0514, wKL=1.0000]
Train E70: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.4979, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E70: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4979, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E70: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5145, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E70: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5145, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E70: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4828, E=0.8227,
KL=0.0514, wKL=1.0000]
Train E70: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4828, E=0.8227,
KL=0.0514, wKL=1.0000]
Train E70: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.3971, E=0.8226,
KL=0.0511, wKL=1.0000]
Train E70: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.3971, E=0.8226,
KL=0.0511, wKL=1.0000]
Train E70: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.3971, E=0.8226,
KL=0.0511, wKL=1.0000]
2497.0s 217 [Epoch 070] Total: 2.3282 | N: 1.4807 | E: 0.8215 | KL(1.00×0.5):
0.0518
2497.0s 218 Saved checkpoint: /kaggle/working/checkpoints/gvae_70_epoch070.pt
2531.7s 219 Train E71: 0%| | 0/25 [00:00<?, ?batch/s]
Train E71: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4885, E=0.8187, KL=0.0518,
wKL=1.0000]
Train E71: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4885, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E71: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5137, E=0.8227,
KL=0.0518, wKL=1.0000]
Train E71: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.5137, E=0.8227,
KL=0.0518, wKL=1.0000]
Train E71: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4715, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E71: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4715, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E71: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4851, E=0.8207,
KL=0.0518, wKL=1.0000]
Train E71: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4851, E=0.8207,
KL=0.0518, wKL=1.0000]
Train E71: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.5717, E=0.8205,
KL=0.0526, wKL=1.0000]
Train E71: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5717, E=0.8205,
KL=0.0526, wKL=1.0000]
Train E71: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4767, E=0.8250,
KL=0.0501, wKL=1.0000]
Train E71: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4767, E=0.8250,
KL=0.0501, wKL=1.0000]
Train E71: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5371, E=0.8274,
KL=0.0509, wKL=1.0000]
Train E71: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.5371, E=0.8274,
KL=0.0509, wKL=1.0000]
Train E71: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5079, E=0.8209,
KL=0.0507, wKL=1.0000]
Train E71: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.5079, E=0.8209,
KL=0.0507, wKL=1.0000]
Train E71: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.4509, E=0.8211,
KL=0.0504, wKL=1.0000]
Train E71: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.4509, E=0.8211,
KL=0.0504, wKL=1.0000]
Train E71: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4039, E=0.8212,
KL=0.0507, wKL=1.0000]
Train E71: 40%|████ | 10/25 [00:14<00:21, 1.43s/batch, N=1.4039, E=0.8212,
KL=0.0507, wKL=1.0000]
Train E71: 40%|████ | 10/25 [00:15<00:21, 1.43s/batch, N=1.5231, E=0.8212,
KL=0.0515, wKL=1.0000]
Train E71: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5231, E=0.8212,
KL=0.0515, wKL=1.0000]
Train E71: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4813, E=0.8233,
KL=0.0516, wKL=1.0000]
Train E71: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4813, E=0.8233,
KL=0.0516, wKL=1.0000]
Train E71: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5059, E=0.8209,
KL=0.0514, wKL=1.0000]
Train E71: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5059, E=0.8209,
KL=0.0514, wKL=1.0000]
Train E71: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4986, E=0.8165,
KL=0.0522, wKL=1.0000]
Train E71: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4986, E=0.8165,
KL=0.0522, wKL=1.0000]
Train E71: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4642, E=0.8193,
KL=0.0522, wKL=1.0000]
Train E71: 60%|██████ | 15/25 [00:21<00:15, 1.58s/batch, N=1.4642, E=0.8193,
KL=0.0522, wKL=1.0000]
Train E71: 60%|██████ | 15/25 [00:22<00:15, 1.58s/batch, N=1.6071, E=0.8248,
KL=0.0518, wKL=1.0000]
Train E71: 64%|██████▍ | 16/25 [00:22<00:13, 1.51s/batch, N=1.6071, E=0.8248,
KL=0.0518, wKL=1.0000]
Train E71: 64%|██████▍ | 16/25 [00:24<00:13, 1.51s/batch, N=1.4109, E=0.8160,
KL=0.0515, wKL=1.0000]
Train E71: 68%|██████▊ | 17/25 [00:24<00:11, 1.50s/batch, N=1.4109, E=0.8160,
KL=0.0515, wKL=1.0000]
Train E71: 68%|██████▊ | 17/25 [00:25<00:11, 1.50s/batch, N=1.4252, E=0.8210,
KL=0.0519, wKL=1.0000]
Train E71: 72%|███████▏ | 18/25 [00:25<00:10, 1.46s/batch, N=1.4252, E=0.8210,
KL=0.0519, wKL=1.0000]
Train E71: 72%|███████▏ | 18/25 [00:27<00:10, 1.46s/batch, N=1.3896, E=0.8222,
KL=0.0506, wKL=1.0000]
Train E71: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.3896, E=0.8222,
KL=0.0506, wKL=1.0000]
Train E71: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.4720, E=0.8213,
KL=0.0513, wKL=1.0000]
Train E71: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4720, E=0.8213,
KL=0.0513, wKL=1.0000]
Train E71: 80%|████████ | 20/25 [00:29<00:07, 1.42s/batch, N=1.4525, E=0.8218,
KL=0.0499, wKL=1.0000]
Train E71: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4525, E=0.8218,
KL=0.0499, wKL=1.0000]
Train E71: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4048, E=0.8229,
KL=0.0488, wKL=1.0000]
Train E71: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4048, E=0.8229,
KL=0.0488, wKL=1.0000]
Train E71: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.5306, E=0.8235,
KL=0.0520, wKL=1.0000]
Train E71: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.5306, E=0.8235,
KL=0.0520, wKL=1.0000]
Train E71: 92%|█████████▏| 23/25 [00:34<00:02, 1.38s/batch, N=1.4583, E=0.8227,
KL=0.0504, wKL=1.0000]
Train E71: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4583, E=0.8227,
KL=0.0504, wKL=1.0000]
Train E71: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4900, E=0.8249,
KL=0.0509, wKL=1.0000]
Train E71: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4900, E=0.8249,
KL=0.0509, wKL=1.0000]
Train E71: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4900, E=0.8249,
KL=0.0509, wKL=1.0000]
2531.7s 220 [Epoch 071] Total: 2.3279 | N: 1.4806 | E: 0.8216 | KL(1.00×0.5):
0.0513
2566.6s 221 Train E72: 0%| | 0/25 [00:00<?, ?batch/s]
Train E72: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5392, E=0.8222, KL=0.0510,
wKL=1.0000]
Train E72: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5392, E=0.8222,
KL=0.0510, wKL=1.0000]
Train E72: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4445, E=0.8234,
KL=0.0521, wKL=1.0000]
Train E72: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4445, E=0.8234,
KL=0.0521, wKL=1.0000]
Train E72: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.5931, E=0.8193,
KL=0.0512, wKL=1.0000]
Train E72: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5931, E=0.8193,
KL=0.0512, wKL=1.0000]
Train E72: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4898, E=0.8196,
KL=0.0515, wKL=1.0000]
Train E72: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4898, E=0.8196,
KL=0.0515, wKL=1.0000]
Train E72: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4524, E=0.8223,
KL=0.0511, wKL=1.0000]
Train E72: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4524, E=0.8223,
KL=0.0511, wKL=1.0000]
Train E72: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5143, E=0.8240,
KL=0.0513, wKL=1.0000]
Train E72: 24%|██▍ | 6/25 [00:08<00:28, 1.49s/batch, N=1.5143, E=0.8240,
KL=0.0513, wKL=1.0000]
Train E72: 24%|██▍ | 6/25 [00:09<00:28, 1.49s/batch, N=1.3630, E=0.8152,
KL=0.0510, wKL=1.0000]
Train E72: 28%|██▊ | 7/25 [00:09<00:26, 1.46s/batch, N=1.3630, E=0.8152,
KL=0.0510, wKL=1.0000]
Train E72: 28%|██▊ | 7/25 [00:11<00:26, 1.46s/batch, N=1.5034, E=0.8159,
KL=0.0514, wKL=1.0000]
Train E72: 32%|███▏ | 8/25 [00:11<00:24, 1.43s/batch, N=1.5034, E=0.8159,
KL=0.0514, wKL=1.0000]
Train E72: 32%|███▏ | 8/25 [00:12<00:24, 1.43s/batch, N=1.4933, E=0.8216,
KL=0.0519, wKL=1.0000]
Train E72: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.4933, E=0.8216,
KL=0.0519, wKL=1.0000]
Train E72: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.4813, E=0.8274,
KL=0.0518, wKL=1.0000]
Train E72: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.4813, E=0.8274,
KL=0.0518, wKL=1.0000]
Train E72: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4800, E=0.8232,
KL=0.0513, wKL=1.0000]
Train E72: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4800, E=0.8232,
KL=0.0513, wKL=1.0000]
Train E72: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4841, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E72: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4841, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E72: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4525, E=0.8244,
KL=0.0521, wKL=1.0000]
Train E72: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4525, E=0.8244,
KL=0.0521, wKL=1.0000]
Train E72: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5348, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E72: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5348, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E72: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4985, E=0.8216,
KL=0.0500, wKL=1.0000]
Train E72: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4985, E=0.8216,
KL=0.0500, wKL=1.0000]
Train E72: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4263, E=0.8124,
KL=0.0503, wKL=1.0000]
Train E72: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4263, E=0.8124,
KL=0.0503, wKL=1.0000]
Train E72: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5210, E=0.8230,
KL=0.0511, wKL=1.0000]
Train E72: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5210, E=0.8230,
KL=0.0511, wKL=1.0000]
Train E72: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.3812, E=0.8220,
KL=0.0502, wKL=1.0000]
Train E72: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.3812, E=0.8220,
KL=0.0502, wKL=1.0000]
Train E72: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.5435, E=0.8246,
KL=0.0517, wKL=1.0000]
Train E72: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.5435, E=0.8246,
KL=0.0517, wKL=1.0000]
Train E72: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4436, E=0.8201,
KL=0.0509, wKL=1.0000]
Train E72: 80%|████████ | 20/25 [00:28<00:07, 1.57s/batch, N=1.4436, E=0.8201,
KL=0.0509, wKL=1.0000]
Train E72: 80%|████████ | 20/25 [00:30<00:07, 1.57s/batch, N=1.4697, E=0.8236,
KL=0.0513, wKL=1.0000]
Train E72: 84%|████████▍ | 21/25 [00:30<00:06, 1.53s/batch, N=1.4697, E=0.8236,
KL=0.0513, wKL=1.0000]
Train E72: 84%|████████▍ | 21/25 [00:31<00:06, 1.53s/batch, N=1.4481, E=0.8239,
KL=0.0509, wKL=1.0000]
Train E72: 88%|████████▊ | 22/25 [00:31<00:04, 1.50s/batch, N=1.4481, E=0.8239,
KL=0.0509, wKL=1.0000]
Train E72: 88%|████████▊ | 22/25 [00:32<00:04, 1.50s/batch, N=1.5138, E=0.8246,
KL=0.0506, wKL=1.0000]
Train E72: 92%|█████████▏| 23/25 [00:32<00:02, 1.47s/batch, N=1.5138, E=0.8246,
KL=0.0506, wKL=1.0000]
Train E72: 92%|█████████▏| 23/25 [00:34<00:02, 1.47s/batch, N=1.4484, E=0.8187,
KL=0.0503, wKL=1.0000]
Train E72: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4484, E=0.8187,
KL=0.0503, wKL=1.0000]
Train E72: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5191, E=0.8198,
KL=0.0500, wKL=1.0000]
Train E72: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.5191, E=0.8198,
KL=0.0500, wKL=1.0000]
Train E72: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5191, E=0.8198,
KL=0.0500, wKL=1.0000]
2566.6s 222 [Epoch 072] Total: 2.3278 | N: 1.4806 | E: 0.8215 | KL(1.00×0.5):
0.0513
2600.9s 223 Train E73: 0%| | 0/25 [00:00<?, ?batch/s]
Train E73: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5606, E=0.8234, KL=0.0511,
wKL=1.0000]
Train E73: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5606, E=0.8234,
KL=0.0511, wKL=1.0000]
Train E73: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5031, E=0.8254,
KL=0.0506, wKL=1.0000]
Train E73: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5031, E=0.8254,
KL=0.0506, wKL=1.0000]
Train E73: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4293, E=0.8217,
KL=0.0494, wKL=1.0000]
Train E73: 12%|█▏ | 3/25 [00:04<00:31, 1.44s/batch, N=1.4293, E=0.8217,
KL=0.0494, wKL=1.0000]
Train E73: 12%|█▏ | 3/25 [00:05<00:31, 1.44s/batch, N=1.4637, E=0.8216,
KL=0.0514, wKL=1.0000]
Train E73: 16%|█▌ | 4/25 [00:05<00:30, 1.47s/batch, N=1.4637, E=0.8216,
KL=0.0514, wKL=1.0000]
Train E73: 16%|█▌ | 4/25 [00:07<00:30, 1.47s/batch, N=1.4619, E=0.8200,
KL=0.0503, wKL=1.0000]
Train E73: 20%|██ | 5/25 [00:07<00:28, 1.44s/batch, N=1.4619, E=0.8200,
KL=0.0503, wKL=1.0000]
Train E73: 20%|██ | 5/25 [00:08<00:28, 1.44s/batch, N=1.4228, E=0.8201,
KL=0.0505, wKL=1.0000]
Train E73: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.4228, E=0.8201,
KL=0.0505, wKL=1.0000]
Train E73: 24%|██▍ | 6/25 [00:09<00:26, 1.42s/batch, N=1.4939, E=0.8231,
KL=0.0539, wKL=1.0000]
Train E73: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4939, E=0.8231,
KL=0.0539, wKL=1.0000]
Train E73: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.5489, E=0.8213,
KL=0.0510, wKL=1.0000]
Train E73: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5489, E=0.8213,
KL=0.0510, wKL=1.0000]
Train E73: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4093, E=0.8269,
KL=0.0517, wKL=1.0000]
Train E73: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4093, E=0.8269,
KL=0.0517, wKL=1.0000]
Train E73: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.4586, E=0.8234,
KL=0.0511, wKL=1.0000]
Train E73: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4586, E=0.8234,
KL=0.0511, wKL=1.0000]
Train E73: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.5106, E=0.8188,
KL=0.0524, wKL=1.0000]
Train E73: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5106, E=0.8188,
KL=0.0524, wKL=1.0000]
Train E73: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5593, E=0.8169,
KL=0.0517, wKL=1.0000]
Train E73: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.5593, E=0.8169,
KL=0.0517, wKL=1.0000]
Train E73: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.5169, E=0.8190,
KL=0.0506, wKL=1.0000]
Train E73: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5169, E=0.8190,
KL=0.0506, wKL=1.0000]
Train E73: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4921, E=0.8224,
KL=0.0503, wKL=1.0000]
Train E73: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4921, E=0.8224,
KL=0.0503, wKL=1.0000]
Train E73: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4549, E=0.8210,
KL=0.0503, wKL=1.0000]
Train E73: 60%|██████ | 15/25 [00:21<00:13, 1.39s/batch, N=1.4549, E=0.8210,
KL=0.0503, wKL=1.0000]
Train E73: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.5237, E=0.8199,
KL=0.0500, wKL=1.0000]
Train E73: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.5237, E=0.8199,
KL=0.0500, wKL=1.0000]
Train E73: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4984, E=0.8209,
KL=0.0505, wKL=1.0000]
Train E73: 68%|██████▊ | 17/25 [00:23<00:11, 1.38s/batch, N=1.4984, E=0.8209,
KL=0.0505, wKL=1.0000]
Train E73: 68%|██████▊ | 17/25 [00:25<00:11, 1.38s/batch, N=1.4418, E=0.8245,
KL=0.0503, wKL=1.0000]
Train E73: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.4418, E=0.8245,
KL=0.0503, wKL=1.0000]
Train E73: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4649, E=0.8229,
KL=0.0504, wKL=1.0000]
Train E73: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4649, E=0.8229,
KL=0.0504, wKL=1.0000]
Train E73: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4549, E=0.8197,
KL=0.0524, wKL=1.0000]
Train E73: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4549, E=0.8197,
KL=0.0524, wKL=1.0000]
Train E73: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.4323, E=0.8183,
KL=0.0504, wKL=1.0000]
Train E73: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4323, E=0.8183,
KL=0.0504, wKL=1.0000]
Train E73: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.4617, E=0.8210,
KL=0.0512, wKL=1.0000]
Train E73: 88%|████████▊ | 22/25 [00:30<00:04, 1.39s/batch, N=1.4617, E=0.8210,
KL=0.0512, wKL=1.0000]
Train E73: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4627, E=0.8226,
KL=0.0499, wKL=1.0000]
Train E73: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4627, E=0.8226,
KL=0.0499, wKL=1.0000]
Train E73: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4611, E=0.8196,
KL=0.0508, wKL=1.0000]
Train E73: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.4611, E=0.8196,
KL=0.0508, wKL=1.0000]
Train E73: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5872, E=0.8274,
KL=0.0519, wKL=1.0000]
Train E73: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5872, E=0.8274,
KL=0.0519, wKL=1.0000]
Train E73: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.5872, E=0.8274,
KL=0.0519, wKL=1.0000]
2600.9s 224 [Epoch 073] Total: 2.3275 | N: 1.4805 | E: 0.8215 | KL(1.00×0.5):
0.0510
2635.8s 225 Train E74: 0%| | 0/25 [00:00<?, ?batch/s]
Train E74: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4255, E=0.8204, KL=0.0494,
wKL=1.0000]
Train E74: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.4255, E=0.8204,
KL=0.0494, wKL=1.0000]
Train E74: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.4596, E=0.8215,
KL=0.0499, wKL=1.0000]
Train E74: 8%|▊ | 2/25 [00:02<00:33, 1.45s/batch, N=1.4596, E=0.8215,
KL=0.0499, wKL=1.0000]
Train E74: 8%|▊ | 2/25 [00:04<00:33, 1.45s/batch, N=1.4867, E=0.8251,
KL=0.0512, wKL=1.0000]
Train E74: 12%|█▏ | 3/25 [00:04<00:30, 1.41s/batch, N=1.4867, E=0.8251,
KL=0.0512, wKL=1.0000]
Train E74: 12%|█▏ | 3/25 [00:05<00:30, 1.41s/batch, N=1.4231, E=0.8200,
KL=0.0516, wKL=1.0000]
Train E74: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4231, E=0.8200,
KL=0.0516, wKL=1.0000]
Train E74: 16%|█▌ | 4/25 [00:07<00:29, 1.39s/batch, N=1.4953, E=0.8210,
KL=0.0507, wKL=1.0000]
Train E74: 20%|██ | 5/25 [00:07<00:32, 1.61s/batch, N=1.4953, E=0.8210,
KL=0.0507, wKL=1.0000]
Train E74: 20%|██ | 5/25 [00:09<00:32, 1.61s/batch, N=1.4717, E=0.8190,
KL=0.0514, wKL=1.0000]
Train E74: 24%|██▍ | 6/25 [00:09<00:29, 1.54s/batch, N=1.4717, E=0.8190,
KL=0.0514, wKL=1.0000]
Train E74: 24%|██▍ | 6/25 [00:10<00:29, 1.54s/batch, N=1.4962, E=0.8263,
KL=0.0507, wKL=1.0000]
Train E74: 28%|██▊ | 7/25 [00:10<00:27, 1.51s/batch, N=1.4962, E=0.8263,
KL=0.0507, wKL=1.0000]
Train E74: 28%|██▊ | 7/25 [00:11<00:27, 1.51s/batch, N=1.4533, E=0.8196,
KL=0.0510, wKL=1.0000]
Train E74: 32%|███▏ | 8/25 [00:11<00:24, 1.46s/batch, N=1.4533, E=0.8196,
KL=0.0510, wKL=1.0000]
Train E74: 32%|███▏ | 8/25 [00:13<00:24, 1.46s/batch, N=1.5435, E=0.8206,
KL=0.0518, wKL=1.0000]
Train E74: 36%|███▌ | 9/25 [00:13<00:23, 1.44s/batch, N=1.5435, E=0.8206,
KL=0.0518, wKL=1.0000]
Train E74: 36%|███▌ | 9/25 [00:14<00:23, 1.44s/batch, N=1.4780, E=0.8170,
KL=0.0509, wKL=1.0000]
Train E74: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4780, E=0.8170,
KL=0.0509, wKL=1.0000]
Train E74: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4446, E=0.8214,
KL=0.0501, wKL=1.0000]
Train E74: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4446, E=0.8214,
KL=0.0501, wKL=1.0000]
Train E74: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.4319, E=0.8190,
KL=0.0500, wKL=1.0000]
Train E74: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4319, E=0.8190,
KL=0.0500, wKL=1.0000]
Train E74: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5470, E=0.8186,
KL=0.0508, wKL=1.0000]
Train E74: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5470, E=0.8186,
KL=0.0508, wKL=1.0000]
Train E74: 52%|█████▏ | 13/25 [00:20<00:16, 1.39s/batch, N=1.4169, E=0.8249,
KL=0.0500, wKL=1.0000]
Train E74: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.4169, E=0.8249,
KL=0.0500, wKL=1.0000]
Train E74: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5818, E=0.8164,
KL=0.0507, wKL=1.0000]
Train E74: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.5818, E=0.8164,
KL=0.0507, wKL=1.0000]
Train E74: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.3786, E=0.8226,
KL=0.0491, wKL=1.0000]
Train E74: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.3786, E=0.8226,
KL=0.0491, wKL=1.0000]
Train E74: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4696, E=0.8205,
KL=0.0497, wKL=1.0000]
Train E74: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4696, E=0.8205,
KL=0.0497, wKL=1.0000]
Train E74: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5118, E=0.8256,
KL=0.0519, wKL=1.0000]
Train E74: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5118, E=0.8256,
KL=0.0519, wKL=1.0000]
Train E74: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5294, E=0.8264,
KL=0.0526, wKL=1.0000]
Train E74: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5294, E=0.8264,
KL=0.0526, wKL=1.0000]
Train E74: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4659, E=0.8189,
KL=0.0497, wKL=1.0000]
Train E74: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4659, E=0.8189,
KL=0.0497, wKL=1.0000]
Train E74: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.5021, E=0.8262,
KL=0.0498, wKL=1.0000]
Train E74: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5021, E=0.8262,
KL=0.0498, wKL=1.0000]
Train E74: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4769, E=0.8176,
KL=0.0514, wKL=1.0000]
Train E74: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4769, E=0.8176,
KL=0.0514, wKL=1.0000]
Train E74: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5423, E=0.8241,
KL=0.0504, wKL=1.0000]
Train E74: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5423, E=0.8241,
KL=0.0504, wKL=1.0000]
Train E74: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4966, E=0.8250,
KL=0.0527, wKL=1.0000]
Train E74: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4966, E=0.8250,
KL=0.0527, wKL=1.0000]
Train E74: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4734, E=0.8194,
KL=0.0516, wKL=1.0000]
Train E74: 100%|██████████| 25/25 [00:34<00:00, 1.19s/batch, N=1.4734, E=0.8194,
KL=0.0516, wKL=1.0000]
Train E74: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4734, E=0.8194,
KL=0.0516, wKL=1.0000]
2635.8s 226 [Epoch 074] Total: 2.3271 | N: 1.4802 | E: 0.8215 | KL(1.00×0.5):
0.0507
2670.4s 227 Train E75: 0%| | 0/25 [00:00<?, ?batch/s]
Train E75: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4984, E=0.8229, KL=0.0501,
wKL=1.0000]
Train E75: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4984, E=0.8229,
KL=0.0501, wKL=1.0000]
Train E75: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5112, E=0.8231,
KL=0.0519, wKL=1.0000]
Train E75: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5112, E=0.8231,
KL=0.0519, wKL=1.0000]
Train E75: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4278, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E75: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4278, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E75: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5089, E=0.8180,
KL=0.0519, wKL=1.0000]
Train E75: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5089, E=0.8180,
KL=0.0519, wKL=1.0000]
Train E75: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5336, E=0.8226,
KL=0.0509, wKL=1.0000]
Train E75: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5336, E=0.8226,
KL=0.0509, wKL=1.0000]
Train E75: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5005, E=0.8233,
KL=0.0508, wKL=1.0000]
Train E75: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5005, E=0.8233,
KL=0.0508, wKL=1.0000]
Train E75: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4836, E=0.8254,
KL=0.0513, wKL=1.0000]
Train E75: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4836, E=0.8254,
KL=0.0513, wKL=1.0000]
Train E75: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.3761, E=0.8194,
KL=0.0495, wKL=1.0000]
Train E75: 32%|███▏ | 8/25 [00:11<00:26, 1.58s/batch, N=1.3761, E=0.8194,
KL=0.0495, wKL=1.0000]
Train E75: 32%|███▏ | 8/25 [00:12<00:26, 1.58s/batch, N=1.4250, E=0.8231,
KL=0.0502, wKL=1.0000]
Train E75: 36%|███▌ | 9/25 [00:12<00:24, 1.51s/batch, N=1.4250, E=0.8231,
KL=0.0502, wKL=1.0000]
Train E75: 36%|███▌ | 9/25 [00:14<00:24, 1.51s/batch, N=1.6067, E=0.8184,
KL=0.0509, wKL=1.0000]
Train E75: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.6067, E=0.8184,
KL=0.0509, wKL=1.0000]
Train E75: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4531, E=0.8261,
KL=0.0503, wKL=1.0000]
Train E75: 44%|████▍ | 11/25 [00:15<00:20, 1.44s/batch, N=1.4531, E=0.8261,
KL=0.0503, wKL=1.0000]
Train E75: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.4294, E=0.8223,
KL=0.0500, wKL=1.0000]
Train E75: 48%|████▊ | 12/25 [00:17<00:18, 1.43s/batch, N=1.4294, E=0.8223,
KL=0.0500, wKL=1.0000]
Train E75: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.5157, E=0.8268,
KL=0.0504, wKL=1.0000]
Train E75: 52%|█████▏ | 13/25 [00:18<00:16, 1.42s/batch, N=1.5157, E=0.8268,
KL=0.0504, wKL=1.0000]
Train E75: 52%|█████▏ | 13/25 [00:19<00:16, 1.42s/batch, N=1.4675, E=0.8186,
KL=0.0502, wKL=1.0000]
Train E75: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.4675, E=0.8186,
KL=0.0502, wKL=1.0000]
Train E75: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4542, E=0.8238,
KL=0.0522, wKL=1.0000]
Train E75: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4542, E=0.8238,
KL=0.0522, wKL=1.0000]
Train E75: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.5485, E=0.8220,
KL=0.0505, wKL=1.0000]
Train E75: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5485, E=0.8220,
KL=0.0505, wKL=1.0000]
Train E75: 64%|██████▍ | 16/25 [00:24<00:12, 1.39s/batch, N=1.4422, E=0.8217,
KL=0.0500, wKL=1.0000]
Train E75: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4422, E=0.8217,
KL=0.0500, wKL=1.0000]
Train E75: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4407, E=0.8212,
KL=0.0504, wKL=1.0000]
Train E75: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4407, E=0.8212,
KL=0.0504, wKL=1.0000]
Train E75: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.5007, E=0.8179,
KL=0.0536, wKL=1.0000]
Train E75: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.5007, E=0.8179,
KL=0.0536, wKL=1.0000]
Train E75: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5921, E=0.8197,
KL=0.0515, wKL=1.0000]
Train E75: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.5921, E=0.8197,
KL=0.0515, wKL=1.0000]
Train E75: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.4277, E=0.8196,
KL=0.0496, wKL=1.0000]
Train E75: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.4277, E=0.8196,
KL=0.0496, wKL=1.0000]
Train E75: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.4748, E=0.8243,
KL=0.0501, wKL=1.0000]
Train E75: 88%|████████▊ | 22/25 [00:31<00:04, 1.43s/batch, N=1.4748, E=0.8243,
KL=0.0501, wKL=1.0000]
Train E75: 88%|████████▊ | 22/25 [00:32<00:04, 1.43s/batch, N=1.5231, E=0.8189,
KL=0.0505, wKL=1.0000]
Train E75: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5231, E=0.8189,
KL=0.0505, wKL=1.0000]
Train E75: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.3832, E=0.8174,
KL=0.0500, wKL=1.0000]
Train E75: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.3832, E=0.8174,
KL=0.0500, wKL=1.0000]
Train E75: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4909, E=0.8258,
KL=0.0498, wKL=1.0000]
Train E75: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4909, E=0.8258,
KL=0.0498, wKL=1.0000]
Train E75: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4909, E=0.8258,
KL=0.0498, wKL=1.0000]
2670.4s 228 [Epoch 075] Total: 2.3274 | N: 1.4804 | E: 0.8217 | KL(1.00×0.5):
0.0507
2705.2s 229 Train E76: 0%| | 0/25 [00:00<?, ?batch/s]
Train E76: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5499, E=0.8181, KL=0.0509,
wKL=1.0000]
Train E76: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5499, E=0.8181,
KL=0.0509, wKL=1.0000]
Train E76: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4101, E=0.8147,
KL=0.0522, wKL=1.0000]
Train E76: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4101, E=0.8147,
KL=0.0522, wKL=1.0000]
Train E76: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.3817, E=0.8211,
KL=0.0489, wKL=1.0000]
Train E76: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.3817, E=0.8211,
KL=0.0489, wKL=1.0000]
Train E76: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5799, E=0.8213,
KL=0.0503, wKL=1.0000]
Train E76: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.5799, E=0.8213,
KL=0.0503, wKL=1.0000]
Train E76: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4443, E=0.8210,
KL=0.0489, wKL=1.0000]
Train E76: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4443, E=0.8210,
KL=0.0489, wKL=1.0000]
Train E76: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4719, E=0.8234,
KL=0.0490, wKL=1.0000]
Train E76: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4719, E=0.8234,
KL=0.0490, wKL=1.0000]
Train E76: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4775, E=0.8224,
KL=0.0498, wKL=1.0000]
Train E76: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4775, E=0.8224,
KL=0.0498, wKL=1.0000]
Train E76: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4095, E=0.8186,
KL=0.0503, wKL=1.0000]
Train E76: 32%|███▏ | 8/25 [00:11<00:24, 1.41s/batch, N=1.4095, E=0.8186,
KL=0.0503, wKL=1.0000]
Train E76: 32%|███▏ | 8/25 [00:12<00:24, 1.41s/batch, N=1.4750, E=0.8236,
KL=0.0497, wKL=1.0000]
Train E76: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.4750, E=0.8236,
KL=0.0497, wKL=1.0000]
Train E76: 36%|███▌ | 9/25 [00:13<00:22, 1.42s/batch, N=1.5015, E=0.8166,
KL=0.0512, wKL=1.0000]
Train E76: 40%|████ | 10/25 [00:13<00:21, 1.40s/batch, N=1.5015, E=0.8166,
KL=0.0512, wKL=1.0000]
Train E76: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.4204, E=0.8121,
KL=0.0505, wKL=1.0000]
Train E76: 44%|████▍ | 11/25 [00:15<00:22, 1.60s/batch, N=1.4204, E=0.8121,
KL=0.0505, wKL=1.0000]
Train E76: 44%|████▍ | 11/25 [00:17<00:22, 1.60s/batch, N=1.5312, E=0.8230,
KL=0.0522, wKL=1.0000]
Train E76: 48%|████▊ | 12/25 [00:17<00:19, 1.52s/batch, N=1.5312, E=0.8230,
KL=0.0522, wKL=1.0000]
Train E76: 48%|████▊ | 12/25 [00:18<00:19, 1.52s/batch, N=1.4733, E=0.8218,
KL=0.0507, wKL=1.0000]
Train E76: 52%|█████▏ | 13/25 [00:18<00:17, 1.48s/batch, N=1.4733, E=0.8218,
KL=0.0507, wKL=1.0000]
Train E76: 52%|█████▏ | 13/25 [00:20<00:17, 1.48s/batch, N=1.4294, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E76: 56%|█████▌ | 14/25 [00:20<00:16, 1.47s/batch, N=1.4294, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E76: 56%|█████▌ | 14/25 [00:21<00:16, 1.47s/batch, N=1.4731, E=0.8239,
KL=0.0492, wKL=1.0000]
Train E76: 60%|██████ | 15/25 [00:21<00:14, 1.45s/batch, N=1.4731, E=0.8239,
KL=0.0492, wKL=1.0000]
Train E76: 60%|██████ | 15/25 [00:22<00:14, 1.45s/batch, N=1.5662, E=0.8267,
KL=0.0522, wKL=1.0000]
Train E76: 64%|██████▍ | 16/25 [00:22<00:12, 1.43s/batch, N=1.5662, E=0.8267,
KL=0.0522, wKL=1.0000]
Train E76: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.4209, E=0.8257,
KL=0.0494, wKL=1.0000]
Train E76: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.4209, E=0.8257,
KL=0.0494, wKL=1.0000]
Train E76: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4601, E=0.8292,
KL=0.0495, wKL=1.0000]
Train E76: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4601, E=0.8292,
KL=0.0495, wKL=1.0000]
Train E76: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5706, E=0.8185,
KL=0.0517, wKL=1.0000]
Train E76: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.5706, E=0.8185,
KL=0.0517, wKL=1.0000]
Train E76: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.5314, E=0.8275,
KL=0.0500, wKL=1.0000]
Train E76: 80%|████████ | 20/25 [00:28<00:07, 1.45s/batch, N=1.5314, E=0.8275,
KL=0.0500, wKL=1.0000]
Train E76: 80%|████████ | 20/25 [00:30<00:07, 1.45s/batch, N=1.4314, E=0.8211,
KL=0.0508, wKL=1.0000]
Train E76: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4314, E=0.8211,
KL=0.0508, wKL=1.0000]
Train E76: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.5584, E=0.8225,
KL=0.0524, wKL=1.0000]
Train E76: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.5584, E=0.8225,
KL=0.0524, wKL=1.0000]
Train E76: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.4465, E=0.8243,
KL=0.0499, wKL=1.0000]
Train E76: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4465, E=0.8243,
KL=0.0499, wKL=1.0000]
Train E76: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.5302, E=0.8239,
KL=0.0511, wKL=1.0000]
Train E76: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5302, E=0.8239,
KL=0.0511, wKL=1.0000]
Train E76: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4286, E=0.8236,
KL=0.0516, wKL=1.0000]
Train E76: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.4286, E=0.8236,
KL=0.0516, wKL=1.0000]
Train E76: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4286, E=0.8236,
KL=0.0516, wKL=1.0000]
2705.2s 230 [Epoch 076] Total: 2.3271 | N: 1.4801 | E: 0.8218 | KL(1.00×0.5):
0.0505
2740.2s 231 Train E77: 0%| | 0/25 [00:00<?, ?batch/s]
Train E77: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4402, E=0.8251, KL=0.0501,
wKL=1.0000]
Train E77: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.4402, E=0.8251,
KL=0.0501, wKL=1.0000]
Train E77: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.4384, E=0.8166,
KL=0.0504, wKL=1.0000]
Train E77: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4384, E=0.8166,
KL=0.0504, wKL=1.0000]
Train E77: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4111, E=0.8184,
KL=0.0512, wKL=1.0000]
Train E77: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4111, E=0.8184,
KL=0.0512, wKL=1.0000]
Train E77: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.5541, E=0.8270,
KL=0.0502, wKL=1.0000]
Train E77: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.5541, E=0.8270,
KL=0.0502, wKL=1.0000]
Train E77: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.5219, E=0.8230,
KL=0.0510, wKL=1.0000]
Train E77: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5219, E=0.8230,
KL=0.0510, wKL=1.0000]
Train E77: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4935, E=0.8219,
KL=0.0499, wKL=1.0000]
Train E77: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.4935, E=0.8219,
KL=0.0499, wKL=1.0000]
Train E77: 24%|██▍ | 6/25 [00:09<00:26, 1.42s/batch, N=1.4855, E=0.8237,
KL=0.0501, wKL=1.0000]
Train E77: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4855, E=0.8237,
KL=0.0501, wKL=1.0000]
Train E77: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4879, E=0.8198,
KL=0.0498, wKL=1.0000]
Train E77: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4879, E=0.8198,
KL=0.0498, wKL=1.0000]
Train E77: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4957, E=0.8204,
KL=0.0507, wKL=1.0000]
Train E77: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4957, E=0.8204,
KL=0.0507, wKL=1.0000]
Train E77: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4915, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E77: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.4915, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E77: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4456, E=0.8176,
KL=0.0496, wKL=1.0000]
Train E77: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4456, E=0.8176,
KL=0.0496, wKL=1.0000]
Train E77: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4377, E=0.8213,
KL=0.0509, wKL=1.0000]
Train E77: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4377, E=0.8213,
KL=0.0509, wKL=1.0000]
Train E77: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4169, E=0.8195,
KL=0.0499, wKL=1.0000]
Train E77: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4169, E=0.8195,
KL=0.0499, wKL=1.0000]
Train E77: 52%|█████▏ | 13/25 [00:20<00:16, 1.39s/batch, N=1.5260, E=0.8213,
KL=0.0511, wKL=1.0000]
Train E77: 56%|█████▌ | 14/25 [00:20<00:17, 1.58s/batch, N=1.5260, E=0.8213,
KL=0.0511, wKL=1.0000]
Train E77: 56%|█████▌ | 14/25 [00:21<00:17, 1.58s/batch, N=1.5324, E=0.8215,
KL=0.0511, wKL=1.0000]
Train E77: 60%|██████ | 15/25 [00:21<00:15, 1.52s/batch, N=1.5324, E=0.8215,
KL=0.0511, wKL=1.0000]
Train E77: 60%|██████ | 15/25 [00:22<00:15, 1.52s/batch, N=1.4643, E=0.8220,
KL=0.0504, wKL=1.0000]
Train E77: 64%|██████▍ | 16/25 [00:22<00:13, 1.47s/batch, N=1.4643, E=0.8220,
KL=0.0504, wKL=1.0000]
Train E77: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5139, E=0.8208,
KL=0.0509, wKL=1.0000]
Train E77: 68%|██████▊ | 17/25 [00:24<00:12, 1.54s/batch, N=1.5139, E=0.8208,
KL=0.0509, wKL=1.0000]
Train E77: 68%|██████▊ | 17/25 [00:26<00:12, 1.54s/batch, N=1.4211, E=0.8217,
KL=0.0515, wKL=1.0000]
Train E77: 72%|███████▏ | 18/25 [00:26<00:10, 1.52s/batch, N=1.4211, E=0.8217,
KL=0.0515, wKL=1.0000]
Train E77: 72%|███████▏ | 18/25 [00:27<00:10, 1.52s/batch, N=1.5577, E=0.8197,
KL=0.0503, wKL=1.0000]
Train E77: 76%|███████▌ | 19/25 [00:27<00:08, 1.47s/batch, N=1.5577, E=0.8197,
KL=0.0503, wKL=1.0000]
Train E77: 76%|███████▌ | 19/25 [00:28<00:08, 1.47s/batch, N=1.4170, E=0.8211,
KL=0.0517, wKL=1.0000]
Train E77: 80%|████████ | 20/25 [00:28<00:07, 1.46s/batch, N=1.4170, E=0.8211,
KL=0.0517, wKL=1.0000]
Train E77: 80%|████████ | 20/25 [00:30<00:07, 1.46s/batch, N=1.4397, E=0.8257,
KL=0.0501, wKL=1.0000]
Train E77: 84%|████████▍ | 21/25 [00:30<00:05, 1.45s/batch, N=1.4397, E=0.8257,
KL=0.0501, wKL=1.0000]
Train E77: 84%|████████▍ | 21/25 [00:31<00:05, 1.45s/batch, N=1.5025, E=0.8231,
KL=0.0496, wKL=1.0000]
Train E77: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.5025, E=0.8231,
KL=0.0496, wKL=1.0000]
Train E77: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5516, E=0.8212,
KL=0.0503, wKL=1.0000]
Train E77: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5516, E=0.8212,
KL=0.0503, wKL=1.0000]
Train E77: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4884, E=0.8194,
KL=0.0519, wKL=1.0000]
Train E77: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4884, E=0.8194,
KL=0.0519, wKL=1.0000]
Train E77: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4567, E=0.8260,
KL=0.0481, wKL=1.0000]
Train E77: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4567, E=0.8260,
KL=0.0481, wKL=1.0000]
Train E77: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4567, E=0.8260,
KL=0.0481, wKL=1.0000]
2740.2s 232 [Epoch 077] Total: 2.3270 | N: 1.4802 | E: 0.8215 | KL(1.00×0.5):
0.0505
2774.9s 233 Train E78: 0%| | 0/25 [00:00<?, ?batch/s]
Train E78: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5463, E=0.8264, KL=0.0496,
wKL=1.0000]
Train E78: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5463, E=0.8264,
KL=0.0496, wKL=1.0000]
Train E78: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4583, E=0.8186,
KL=0.0511, wKL=1.0000]
Train E78: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4583, E=0.8186,
KL=0.0511, wKL=1.0000]
Train E78: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.5744, E=0.8274,
KL=0.0504, wKL=1.0000]
Train E78: 12%|█▏ | 3/25 [00:04<00:30, 1.36s/batch, N=1.5744, E=0.8274,
KL=0.0504, wKL=1.0000]
Train E78: 12%|█▏ | 3/25 [00:05<00:30, 1.36s/batch, N=1.4490, E=0.8217,
KL=0.0498, wKL=1.0000]
Train E78: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4490, E=0.8217,
KL=0.0498, wKL=1.0000]
Train E78: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.5076, E=0.8250,
KL=0.0500, wKL=1.0000]
Train E78: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5076, E=0.8250,
KL=0.0500, wKL=1.0000]
Train E78: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4512, E=0.8222,
KL=0.0518, wKL=1.0000]
Train E78: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4512, E=0.8222,
KL=0.0518, wKL=1.0000]
Train E78: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5049, E=0.8197,
KL=0.0497, wKL=1.0000]
Train E78: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5049, E=0.8197,
KL=0.0497, wKL=1.0000]
Train E78: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4420, E=0.8207,
KL=0.0530, wKL=1.0000]
Train E78: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4420, E=0.8207,
KL=0.0530, wKL=1.0000]
Train E78: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4918, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4918, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.5346, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.5346, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4990, E=0.8259,
KL=0.0492, wKL=1.0000]
Train E78: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4990, E=0.8259,
KL=0.0492, wKL=1.0000]
Train E78: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5064, E=0.8189,
KL=0.0500, wKL=1.0000]
Train E78: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5064, E=0.8189,
KL=0.0500, wKL=1.0000]
Train E78: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4709, E=0.8208,
KL=0.0512, wKL=1.0000]
Train E78: 52%|█████▏ | 13/25 [00:17<00:16, 1.38s/batch, N=1.4709, E=0.8208,
KL=0.0512, wKL=1.0000]
Train E78: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.5565, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E78: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.5565, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E78: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4205, E=0.8260,
KL=0.0507, wKL=1.0000]
Train E78: 60%|██████ | 15/25 [00:20<00:14, 1.42s/batch, N=1.4205, E=0.8260,
KL=0.0507, wKL=1.0000]
Train E78: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.4347, E=0.8199,
KL=0.0495, wKL=1.0000]
Train E78: 64%|██████▍ | 16/25 [00:22<00:13, 1.45s/batch, N=1.4347, E=0.8199,
KL=0.0495, wKL=1.0000]
Train E78: 64%|██████▍ | 16/25 [00:23<00:13, 1.45s/batch, N=1.4745, E=0.8217,
KL=0.0490, wKL=1.0000]
Train E78: 68%|██████▊ | 17/25 [00:23<00:11, 1.44s/batch, N=1.4745, E=0.8217,
KL=0.0490, wKL=1.0000]
Train E78: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4519, E=0.8218,
KL=0.0496, wKL=1.0000]
Train E78: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4519, E=0.8218,
KL=0.0496, wKL=1.0000]
Train E78: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5019, E=0.8206,
KL=0.0499, wKL=1.0000]
Train E78: 76%|███████▌ | 19/25 [00:27<00:09, 1.60s/batch, N=1.5019, E=0.8206,
KL=0.0499, wKL=1.0000]
Train E78: 76%|███████▌ | 19/25 [00:28<00:09, 1.60s/batch, N=1.4589, E=0.8231,
KL=0.0501, wKL=1.0000]
Train E78: 80%|████████ | 20/25 [00:28<00:07, 1.53s/batch, N=1.4589, E=0.8231,
KL=0.0501, wKL=1.0000]
Train E78: 80%|████████ | 20/25 [00:29<00:07, 1.53s/batch, N=1.4548, E=0.8233,
KL=0.0495, wKL=1.0000]
Train E78: 84%|████████▍ | 21/25 [00:29<00:05, 1.48s/batch, N=1.4548, E=0.8233,
KL=0.0495, wKL=1.0000]
Train E78: 84%|████████▍ | 21/25 [00:31<00:05, 1.48s/batch, N=1.4272, E=0.8163,
KL=0.0499, wKL=1.0000]
Train E78: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4272, E=0.8163,
KL=0.0499, wKL=1.0000]
Train E78: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.4642, E=0.8257,
KL=0.0502, wKL=1.0000]
Train E78: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.4642, E=0.8257,
KL=0.0502, wKL=1.0000]
Train E78: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.4457, E=0.8170,
KL=0.0506, wKL=1.0000]
Train E78: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4457, E=0.8170,
KL=0.0506, wKL=1.0000]
Train E78: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4779, E=0.8169,
KL=0.0514, wKL=1.0000]
Train E78: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4779, E=0.8169,
KL=0.0514, wKL=1.0000]
Train E78: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4779, E=0.8169,
KL=0.0514, wKL=1.0000]
2774.9s 234 [Epoch 078] Total: 2.3271 | N: 1.4803 | E: 0.8217 | KL(1.00×0.5):
0.0504
2809.7s 235 Train E79: 0%| | 0/25 [00:00<?, ?batch/s]
Train E79: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4136, E=0.8232, KL=0.0514,
wKL=1.0000]
Train E79: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4136, E=0.8232,
KL=0.0514, wKL=1.0000]
Train E79: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5013, E=0.8214,
KL=0.0517, wKL=1.0000]
Train E79: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5013, E=0.8214,
KL=0.0517, wKL=1.0000]
Train E79: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4968, E=0.8208,
KL=0.0500, wKL=1.0000]
Train E79: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4968, E=0.8208,
KL=0.0500, wKL=1.0000]
Train E79: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4443, E=0.8207,
KL=0.0495, wKL=1.0000]
Train E79: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4443, E=0.8207,
KL=0.0495, wKL=1.0000]
Train E79: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.3959, E=0.8221,
KL=0.0484, wKL=1.0000]
Train E79: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.3959, E=0.8221,
KL=0.0484, wKL=1.0000]
Train E79: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4460, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E79: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4460, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E79: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5543, E=0.8239,
KL=0.0501, wKL=1.0000]
Train E79: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5543, E=0.8239,
KL=0.0501, wKL=1.0000]
Train E79: 28%|██▊ | 7/25 [00:10<00:24, 1.38s/batch, N=1.5059, E=0.8186,
KL=0.0489, wKL=1.0000]
Train E79: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.5059, E=0.8186,
KL=0.0489, wKL=1.0000]
Train E79: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4985, E=0.8177,
KL=0.0497, wKL=1.0000]
Train E79: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.4985, E=0.8177,
KL=0.0497, wKL=1.0000]
Train E79: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.4382, E=0.8216,
KL=0.0495, wKL=1.0000]
Train E79: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4382, E=0.8216,
KL=0.0495, wKL=1.0000]
Train E79: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5302, E=0.8216,
KL=0.0527, wKL=1.0000]
Train E79: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5302, E=0.8216,
KL=0.0527, wKL=1.0000]
Train E79: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4568, E=0.8188,
KL=0.0506, wKL=1.0000]
Train E79: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4568, E=0.8188,
KL=0.0506, wKL=1.0000]
Train E79: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4980, E=0.8250,
KL=0.0511, wKL=1.0000]
Train E79: 52%|█████▏ | 13/25 [00:18<00:17, 1.45s/batch, N=1.4980, E=0.8250,
KL=0.0511, wKL=1.0000]
Train E79: 52%|█████▏ | 13/25 [00:19<00:17, 1.45s/batch, N=1.4282, E=0.8166,
KL=0.0507, wKL=1.0000]
Train E79: 56%|█████▌ | 14/25 [00:19<00:16, 1.48s/batch, N=1.4282, E=0.8166,
KL=0.0507, wKL=1.0000]
Train E79: 56%|█████▌ | 14/25 [00:21<00:16, 1.48s/batch, N=1.5073, E=0.8244,
KL=0.0500, wKL=1.0000]
Train E79: 60%|██████ | 15/25 [00:21<00:14, 1.46s/batch, N=1.5073, E=0.8244,
KL=0.0500, wKL=1.0000]
Train E79: 60%|██████ | 15/25 [00:22<00:14, 1.46s/batch, N=1.5327, E=0.8258,
KL=0.0499, wKL=1.0000]
Train E79: 64%|██████▍ | 16/25 [00:22<00:12, 1.44s/batch, N=1.5327, E=0.8258,
KL=0.0499, wKL=1.0000]
Train E79: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.5054, E=0.8209,
KL=0.0494, wKL=1.0000]
Train E79: 68%|██████▊ | 17/25 [00:23<00:11, 1.43s/batch, N=1.5054, E=0.8209,
KL=0.0494, wKL=1.0000]
Train E79: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.4753, E=0.8193,
KL=0.0501, wKL=1.0000]
Train E79: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4753, E=0.8193,
KL=0.0501, wKL=1.0000]
Train E79: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.4056, E=0.8216,
KL=0.0496, wKL=1.0000]
Train E79: 76%|███████▌ | 19/25 [00:26<00:08, 1.41s/batch, N=1.4056, E=0.8216,
KL=0.0496, wKL=1.0000]
Train E79: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4801, E=0.8217,
KL=0.0519, wKL=1.0000]
Train E79: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4801, E=0.8217,
KL=0.0519, wKL=1.0000]
Train E79: 80%|████████ | 20/25 [00:29<00:06, 1.40s/batch, N=1.5106, E=0.8235,
KL=0.0512, wKL=1.0000]
Train E79: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5106, E=0.8235,
KL=0.0512, wKL=1.0000]
Train E79: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.5250, E=0.8225,
KL=0.0507, wKL=1.0000]
Train E79: 88%|████████▊ | 22/25 [00:31<00:04, 1.57s/batch, N=1.5250, E=0.8225,
KL=0.0507, wKL=1.0000]
Train E79: 88%|████████▊ | 22/25 [00:32<00:04, 1.57s/batch, N=1.5053, E=0.8187,
KL=0.0509, wKL=1.0000]
Train E79: 92%|█████████▏| 23/25 [00:32<00:03, 1.51s/batch, N=1.5053, E=0.8187,
KL=0.0509, wKL=1.0000]
Train E79: 92%|█████████▏| 23/25 [00:34<00:03, 1.51s/batch, N=1.4286, E=0.8237,
KL=0.0495, wKL=1.0000]
Train E79: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.4286, E=0.8237,
KL=0.0495, wKL=1.0000]
Train E79: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.5679, E=0.8251,
KL=0.0498, wKL=1.0000]
Train E79: 100%|██████████| 25/25 [00:34<00:00, 1.22s/batch, N=1.5679, E=0.8251,
KL=0.0498, wKL=1.0000]
Train E79: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5679, E=0.8251,
KL=0.0498, wKL=1.0000]
2809.7s 236 [Epoch 079] Total: 2.3267 | N: 1.4800 | E: 0.8215 | KL(1.00×0.5):
0.0503
2844.9s 237 [Epoch 080] Total: 2.3266 | N: 1.4798 | E: 0.8216 | KL(1.00×0.5):
0.0503
2844.9s 238 Saved checkpoint: /kaggle/working/checkpoints/gvae_80_epoch080.pt
2844.9s 239 Train E80: 0%| | 0/25 [00:00<?, ?batch/s]
Train E80: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4574, E=0.8178, KL=0.0497,
wKL=1.0000]
Train E80: 4%|▍ | 1/25 [00:01<00:34, 1.43s/batch, N=1.4574, E=0.8178,
KL=0.0497, wKL=1.0000]
Train E80: 4%|▍ | 1/25 [00:02<00:34, 1.43s/batch, N=1.3826, E=0.8243,
KL=0.0490, wKL=1.0000]
Train E80: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.3826, E=0.8243,
KL=0.0490, wKL=1.0000]
Train E80: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.5051, E=0.8204,
KL=0.0506, wKL=1.0000]
Train E80: 12%|█▏ | 3/25 [00:04<00:31, 1.41s/batch, N=1.5051, E=0.8204,
KL=0.0506, wKL=1.0000]
Train E80: 12%|█▏ | 3/25 [00:05<00:31, 1.41s/batch, N=1.4334, E=0.8193,
KL=0.0493, wKL=1.0000]
Train E80: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4334, E=0.8193,
KL=0.0493, wKL=1.0000]
Train E80: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.5351, E=0.8275,
KL=0.0508, wKL=1.0000]
Train E80: 20%|██ | 5/25 [00:07<00:27, 1.39s/batch, N=1.5351, E=0.8275,
KL=0.0508, wKL=1.0000]
Train E80: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4755, E=0.8143,
KL=0.0506, wKL=1.0000]
Train E80: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.4755, E=0.8143,
KL=0.0506, wKL=1.0000]
Train E80: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4700, E=0.8232,
KL=0.0499, wKL=1.0000]
Train E80: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4700, E=0.8232,
KL=0.0499, wKL=1.0000]
Train E80: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5489, E=0.8210,
KL=0.0514, wKL=1.0000]
Train E80: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.5489, E=0.8210,
KL=0.0514, wKL=1.0000]
Train E80: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.4681, E=0.8261,
KL=0.0494, wKL=1.0000]
Train E80: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4681, E=0.8261,
KL=0.0494, wKL=1.0000]
Train E80: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.4855, E=0.8211,
KL=0.0502, wKL=1.0000]
Train E80: 40%|████ | 10/25 [00:14<00:21, 1.40s/batch, N=1.4855, E=0.8211,
KL=0.0502, wKL=1.0000]
Train E80: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.4895, E=0.8198,
KL=0.0510, wKL=1.0000]
Train E80: 44%|████▍ | 11/25 [00:15<00:20, 1.46s/batch, N=1.4895, E=0.8198,
KL=0.0510, wKL=1.0000]
Train E80: 44%|████▍ | 11/25 [00:17<00:20, 1.46s/batch, N=1.4731, E=0.8225,
KL=0.0498, wKL=1.0000]
Train E80: 48%|████▊ | 12/25 [00:17<00:18, 1.43s/batch, N=1.4731, E=0.8225,
KL=0.0498, wKL=1.0000]
Train E80: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.5002, E=0.8224,
KL=0.0494, wKL=1.0000]
Train E80: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5002, E=0.8224,
KL=0.0494, wKL=1.0000]
Train E80: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.5430, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E80: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5430, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E80: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4453, E=0.8180,
KL=0.0504, wKL=1.0000]
Train E80: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.4453, E=0.8180,
KL=0.0504, wKL=1.0000]
Train E80: 60%|██████ | 15/25 [00:22<00:14, 1.43s/batch, N=1.4463, E=0.8236,
KL=0.0489, wKL=1.0000]
Train E80: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.4463, E=0.8236,
KL=0.0489, wKL=1.0000]
Train E80: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.5152, E=0.8207,
KL=0.0513, wKL=1.0000]
Train E80: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.5152, E=0.8207,
KL=0.0513, wKL=1.0000]
Train E80: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4201, E=0.8185,
KL=0.0498, wKL=1.0000]
Train E80: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4201, E=0.8185,
KL=0.0498, wKL=1.0000]
Train E80: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.6035, E=0.8281,
KL=0.0501, wKL=1.0000]
Train E80: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.6035, E=0.8281,
KL=0.0501, wKL=1.0000]
Train E80: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4994, E=0.8254,
KL=0.0507, wKL=1.0000]
Train E80: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4994, E=0.8254,
KL=0.0507, wKL=1.0000]
Train E80: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.4604, E=0.8212,
KL=0.0498, wKL=1.0000]
Train E80: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4604, E=0.8212,
KL=0.0498, wKL=1.0000]
Train E80: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.3982, E=0.8171,
KL=0.0502, wKL=1.0000]
Train E80: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.3982, E=0.8171,
KL=0.0502, wKL=1.0000]
Train E80: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.4754, E=0.8217,
KL=0.0520, wKL=1.0000]
Train E80: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4754, E=0.8217,
KL=0.0520, wKL=1.0000]
Train E80: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4833, E=0.8209,
KL=0.0503, wKL=1.0000]
Train E80: 96%|█████████▌| 24/25 [00:34<00:01, 1.59s/batch, N=1.4833, E=0.8209,
KL=0.0503, wKL=1.0000]
Train E80: 96%|█████████▌| 24/25 [00:35<00:01, 1.59s/batch, N=1.4833, E=0.8231,
KL=0.0486, wKL=1.0000]
Train E80: 100%|██████████| 25/25 [00:35<00:00, 1.31s/batch, N=1.4833, E=0.8231,
KL=0.0486, wKL=1.0000]
Train E80: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4833, E=0.8231,
KL=0.0486, wKL=1.0000]
2850.7s 240 /usr/local/lib/python3.11/dist-packages/traitlets/traitlets.py:2915:
FutureWarning: --
Exporter.preprocessors=["remove_papermill_header.RemovePapermillHeader"] for
containers is deprecated in traitlets 5.0. You can pass `--Exporter.preprocessors
item` ... multiple times to add items to a list.
2850.7s 241 warn(
2850.7s 242 [NbConvertApp] Converting notebook __notebook__.ipynb to notebook
2851.5s 243 [NbConvertApp] Writing 67931 bytes to __notebook__.ipynb
2852.8s 244 /usr/local/lib/python3.11/dist-packages/traitlets/traitlets.py:2915:
FutureWarning: --
Exporter.preprocessors=["nbconvert.preprocessors.ExtractOutputPreprocessor"] for
containers is deprecated in traitlets 5.0. You can pass `--Exporter.preprocessors
item` ... multiple times to add items to a list.
2852.8s 245 warn(
2852.8s 246 [NbConvertApp] Converting notebook __notebook__.ipynb to html
2853.6s 247 [NbConvertApp] Writing 409106 bytes to __results__.html