0% found this document useful (0 votes)
16 views146 pages

Graph Vae Training - Log

The document contains warnings related to the use of frozen modules in a Python debugger, suggesting to disable them for better breakpoint functionality. It details the installation process of the 'torch_geometric' and 'rdkit' packages, confirming successful installations. Additionally, it mentions the use of multiple GPUs for training, along with a warning about the performance of 'DataParallel' compared to 'DistributedDataParallel'.

Uploaded by

ahnd6474
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views146 pages

Graph Vae Training - Log

The document contains warnings related to the use of frozen modules in a Python debugger, suggesting to disable them for better breakpoint functionality. It details the installation process of the 'torch_geometric' and 'rdkit' packages, confirming successful installations. Additionally, it mentions the use of multiple GPUs for training, along with a warning about the performance of 'DataParallel' compared to 'DistributedDataParallel'.

Uploaded by

ahnd6474
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 146

5.3s 1 0.

00s - Debugger warning: It seems that frozen modules are being used, which
may
5.3s 2 0.00s - make the debugger miss breakpoints. Please pass -Xfrozen_modules=off
5.3s 3 0.00s - to python to disable frozen modules.
5.3s 4 0.00s - Note: Debugging will proceed. Set PYDEVD_DISABLE_FILE_VALIDATION=1
to disable this validation.
6.0s 5 0.00s - Debugger warning: It seems that frozen modules are being used, which
may
6.0s 6 0.00s - make the debugger miss breakpoints. Please pass -Xfrozen_modules=off
6.0s 7 0.00s - to python to disable frozen modules.
6.0s 8 0.00s - Note: Debugging will proceed. Set PYDEVD_DISABLE_FILE_VALIDATION=1
to disable this validation.
10.0s 9 Collecting torch_geometric
10.1s 10 Downloading torch_geometric-2.6.1-py3-none-any.whl.metadata (63 kB)
10.1s 11 #[?25l #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m0.0/63.1 kB#[0m #[31m?#[0m eta #[36m-:--:--#[0m
#[2K #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m63.1/63.1 kB#[0m
#[31m2.7 MB/s#[0m eta #[36m0:00:00#[0m
10.1s 12 #[?25hRequirement already satisfied: aiohttp in
/usr/local/lib/python3.11/dist-packages (from torch_geometric) (3.11.18)
10.1s 13 Requirement already satisfied: fsspec in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (2025.3.2)
10.1s 14 Requirement already satisfied: jinja2 in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (3.1.6)
10.1s 15 Requirement already satisfied: numpy in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (1.26.4)
10.1s 16 Requirement already satisfied: psutil>=5.8.0 in
/usr/local/lib/python3.11/dist-packages (from torch_geometric) (7.0.0)
10.1s 17 Requirement already satisfied: pyparsing in
/usr/local/lib/python3.11/dist-packages (from torch_geometric) (3.0.9)
10.1s 18 Requirement already satisfied: requests in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (2.32.3)
10.1s 19 Requirement already satisfied: tqdm in /usr/local/lib/python3.11/dist-
packages (from torch_geometric) (4.67.1)
10.1s 20 Requirement already satisfied: aiohappyeyeballs>=2.3.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (2.6.1)
10.2s 21 Requirement already satisfied: aiosignal>=1.1.2 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (1.3.2)
10.2s 22 Requirement already satisfied: attrs>=17.3.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (25.3.0)
10.2s 23 Requirement already satisfied: frozenlist>=1.1.1 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (1.6.0)
10.2s 24 Requirement already satisfied: multidict<7.0,>=4.5 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (6.4.3)
10.2s 25 Requirement already satisfied: propcache>=0.2.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (0.3.1)
10.2s 26 Requirement already satisfied: yarl<2.0,>=1.17.0 in
/usr/local/lib/python3.11/dist-packages (from aiohttp->torch_geometric) (1.20.0)
10.2s 27 Requirement already satisfied: MarkupSafe>=2.0 in
/usr/local/lib/python3.11/dist-packages (from jinja2->torch_geometric) (3.0.2)
10.2s 28 Requirement already satisfied: mkl_fft in /usr/local/lib/python3.11/dist-
packages (from numpy->torch_geometric) (1.3.8)
10.2s 29 Requirement already satisfied: mkl_random in
/usr/local/lib/python3.11/dist-packages (from numpy->torch_geometric) (1.2.4)
10.2s 30 Requirement already satisfied: mkl_umath in
/usr/local/lib/python3.11/dist-packages (from numpy->torch_geometric) (0.1.1)
10.2s 31 Requirement already satisfied: mkl in /usr/local/lib/python3.11/dist-
packages (from numpy->torch_geometric) (2025.1.0)
10.2s 32 Requirement already satisfied: tbb4py in /usr/local/lib/python3.11/dist-
packages (from numpy->torch_geometric) (2022.1.0)
10.2s 33 Requirement already satisfied: mkl-service in
/usr/local/lib/python3.11/dist-packages (from numpy->torch_geometric) (2.4.1)
10.2s 34 Requirement already satisfied: charset-normalizer<4,>=2 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric) (3.4.2)
10.2s 35 Requirement already satisfied: idna<4,>=2.5 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric) (3.10)
10.2s 36 Requirement already satisfied: urllib3<3,>=1.21.1 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric) (2.4.0)
10.2s 37 Requirement already satisfied: certifi>=2017.4.17 in
/usr/local/lib/python3.11/dist-packages (from requests->torch_geometric)
(2025.4.26)
10.2s 38 Requirement already satisfied: intel-openmp<2026,>=2024 in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->torch_geometric)
(2024.2.0)
10.2s 39 Requirement already satisfied: tbb==2022.* in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->torch_geometric)
(2022.1.0)
10.2s 40 Requirement already satisfied: tcmlib==1.* in
/usr/local/lib/python3.11/dist-packages (from tbb==2022.*->mkl->numpy-
>torch_geometric) (1.3.0)
10.2s 41 Requirement already satisfied: intel-cmplr-lib-rt in
/usr/local/lib/python3.11/dist-packages (from mkl_umath->numpy->torch_geometric)
(2024.2.0)
10.2s 42 Requirement already satisfied: intel-cmplr-lib-ur==2024.2.0 in
/usr/local/lib/python3.11/dist-packages (from intel-openmp<2026,>=2024->mkl->numpy-
>torch_geometric) (2024.2.0)
10.2s 43 Downloading torch_geometric-2.6.1-py3-none-any.whl (1.1 MB)
10.3s 44 #[?25l #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m0.0/1.1
MB#[0m #[31m?#[0m eta #[36m-:--:--#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m1.1/1.1
MB#[0m #[31m34.0 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m1.1/1.1 MB#[0m
#[31m23.0 MB/s#[0m eta #[36m0:00:00#[0m
11.9s 45 #[?25hInstalling collected packages: torch_geometric
12.7s 46 Successfully installed torch_geometric-2.6.1
14.1s 47 Requirement already satisfied: tqdm in /usr/local/lib/python3.11/dist-
packages (4.67.1)
17.1s 48 Collecting rdkit
17.1s 49 Downloading rdkit-2025.3.3-cp311-cp311-manylinux_2_28_x86_64.whl.metadata
(4.0 kB)
17.2s 50 Requirement already satisfied: numpy in /usr/local/lib/python3.11/dist-
packages (from rdkit) (1.26.4)
17.2s 51 Requirement already satisfied: Pillow in /usr/local/lib/python3.11/dist-
packages (from rdkit) (11.1.0)
17.2s 52 Requirement already satisfied: mkl_fft in /usr/local/lib/python3.11/dist-
packages (from numpy->rdkit) (1.3.8)
17.2s 53 Requirement already satisfied: mkl_random in
/usr/local/lib/python3.11/dist-packages (from numpy->rdkit) (1.2.4)
17.2s 54 Requirement already satisfied: mkl_umath in
/usr/local/lib/python3.11/dist-packages (from numpy->rdkit) (0.1.1)
17.2s 55 Requirement already satisfied: mkl in /usr/local/lib/python3.11/dist-
packages (from numpy->rdkit) (2025.1.0)
17.2s 56 Requirement already satisfied: tbb4py in /usr/local/lib/python3.11/dist-
packages (from numpy->rdkit) (2022.1.0)
17.2s 57 Requirement already satisfied: mkl-service in
/usr/local/lib/python3.11/dist-packages (from numpy->rdkit) (2.4.1)
17.2s 58 Requirement already satisfied: intel-openmp<2026,>=2024 in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->rdkit) (2024.2.0)
17.2s 59 Requirement already satisfied: tbb==2022.* in
/usr/local/lib/python3.11/dist-packages (from mkl->numpy->rdkit) (2022.1.0)
17.2s 60 Requirement already satisfied: tcmlib==1.* in
/usr/local/lib/python3.11/dist-packages (from tbb==2022.*->mkl->numpy->rdkit)
(1.3.0)
17.2s 61 Requirement already satisfied: intel-cmplr-lib-rt in
/usr/local/lib/python3.11/dist-packages (from mkl_umath->numpy->rdkit) (2024.2.0)
17.2s 62 Requirement already satisfied: intel-cmplr-lib-ur==2024.2.0 in
/usr/local/lib/python3.11/dist-packages (from intel-openmp<2026,>=2024->mkl->numpy-
>rdkit) (2024.2.0)
17.2s 63 Downloading rdkit-2025.3.3-cp311-cp311-manylinux_2_28_x86_64.whl (34.9 MB)
17.6s 64 #[?25l #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m0.0/34.9
MB#[0m #[31m?#[0m eta #[36m-:--:--#[0m
#[2K #[91m╸#[0m#[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m0.5/34.9
MB#[0m #[31m14.2 MB/s#[0m eta #[36m0:00:03#[0m
#[2K #[91m━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m3.8/34.9 MB#[0m #[31m55.7 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m9.7/34.9 MB#[0m #[31m93.5 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━━━━━━━━#[0m
#[32m16.1/34.9 MB#[0m #[31m184.5 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[90m╺#[0m#[90m━━━━━━━━━━━━━━#[0m
#[32m22.1/34.9 MB#[0m #[31m178.2 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m#[90m━━━━━━━#[0m
#[32m28.4/34.9 MB#[0m #[31m181.2 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m#[91m╸#[0m #[32m34.9/34.9
MB#[0m #[31m206.4 MB/s#[0m eta #[36m0:00:01#[0m
#[2K #[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━#[0m #[32m34.9/34.9 MB#[0m
#[31m53.3 MB/s#[0m eta #[36m0:00:00#[0m
19.2s 65 #[?25hInstalling collected packages: rdkit
20.4s 66 Successfully installed rdkit-2025.3.3
46.2s 67 총 로드된 그래프 개수: 100000
46.2s 68 Logical CPU cores: 4
46.5s 69 Using 2 GPUs with PyG DataParallel
46.5s 70
/usr/local/lib/python3.11/dist-packages/torch_geometric/nn/data_parallel.py:60:
UserWarning: 'DataParallel' is usually much slower than 'DistributedDataParallel'
even on a single machine. Please consider switching to 'DistributedDataParallel'
for multi-GPU training.
46.5s 71 warnings.warn("'DataParallel' is usually much slower than "
91.1s 72 Train E01: 0%| | 0/25 [00:00<?, ?batch/s]
Train E01: 0%| | 0/25 [00:03<?, ?batch/s, N=16.3778, E=1.3234,
KL=0.0500, wKL=0.0250]
Train E01: 4%|▍ | 1/25 [00:03<01:21, 3.38s/batch, N=16.3778, E=1.3234,
KL=0.0500, wKL=0.0250]
Train E01: 4%|▍ | 1/25 [00:05<01:21, 3.38s/batch, N=14.4663, E=1.2715,
KL=0.0678, wKL=0.0250]
Train E01: 8%|▊ | 2/25 [00:05<00:55, 2.43s/batch, N=14.4663, E=1.2715,
KL=0.0678, wKL=0.0250]
Train E01: 8%|▊ | 2/25 [00:06<00:55, 2.43s/batch, N=12.2940, E=1.1990,
KL=0.1175, wKL=0.0250]
Train E01: 12%|█▏ | 3/25 [00:06<00:45, 2.08s/batch, N=12.2940, E=1.1990,
KL=0.1175, wKL=0.0250]
Train E01: 12%|█▏ | 3/25 [00:08<00:45, 2.08s/batch, N=9.4547, E=1.0925,
KL=0.2018, wKL=0.0250]
Train E01: 16%|█▌ | 4/25 [00:08<00:40, 1.91s/batch, N=9.4547, E=1.0925,
KL=0.2018, wKL=0.0250]
Train E01: 16%|█▌ | 4/25 [00:10<00:40, 1.91s/batch, N=6.4252, E=0.9867,
KL=0.3325, wKL=0.0250]
Train E01: 20%|██ | 5/25 [00:10<00:36, 1.82s/batch, N=6.4252, E=0.9867,
KL=0.3325, wKL=0.0250]
Train E01: 20%|██ | 5/25 [00:11<00:36, 1.82s/batch, N=3.5332, E=0.9469,
KL=0.5261, wKL=0.0250]
Train E01: 24%|██▍ | 6/25 [00:11<00:34, 1.81s/batch, N=3.5332, E=0.9469,
KL=0.5261, wKL=0.0250]
Train E01: 24%|██▍ | 6/25 [00:13<00:34, 1.81s/batch, N=1.8069, E=0.9927,
KL=0.7991, wKL=0.0250]
Train E01: 28%|██▊ | 7/25 [00:13<00:32, 1.78s/batch, N=1.8069, E=0.9927,
KL=0.7991, wKL=0.0250]
Train E01: 28%|██▊ | 7/25 [00:15<00:32, 1.78s/batch, N=2.3330, E=1.0492,
KL=1.1288, wKL=0.0250]
Train E01: 32%|███▏ | 8/25 [00:15<00:30, 1.79s/batch, N=2.3330, E=1.0492,
KL=1.1288, wKL=0.0250]
Train E01: 32%|███▏ | 8/25 [00:17<00:30, 1.79s/batch, N=3.8587, E=1.0467,
KL=1.3643, wKL=0.0250]
Train E01: 36%|███▌ | 9/25 [00:17<00:28, 1.78s/batch, N=3.8587, E=1.0467,
KL=1.3643, wKL=0.0250]
Train E01: 36%|███▌ | 9/25 [00:18<00:28, 1.78s/batch, N=3.8348, E=0.9965,
KL=1.4050, wKL=0.0250]
Train E01: 40%|████ | 10/25 [00:18<00:26, 1.76s/batch, N=3.8348, E=0.9965,
KL=1.4050, wKL=0.0250]
Train E01: 40%|████ | 10/25 [00:20<00:26, 1.76s/batch, N=2.8877, E=0.9400,
KL=1.3302, wKL=0.0250]
Train E01: 44%|████▍ | 11/25 [00:20<00:24, 1.75s/batch, N=2.8877, E=0.9400,
KL=1.3302, wKL=0.0250]
Train E01: 44%|████▍ | 11/25 [00:22<00:24, 1.75s/batch, N=2.0811, E=0.9114,
KL=1.2041, wKL=0.0250]
Train E01: 48%|████▊ | 12/25 [00:22<00:23, 1.80s/batch, N=2.0811, E=0.9114,
KL=1.2041, wKL=0.0250]
Train E01: 48%|████▊ | 12/25 [00:24<00:23, 1.80s/batch, N=1.7251, E=0.9285,
KL=1.0838, wKL=0.0250]
Train E01: 52%|█████▏ | 13/25 [00:24<00:21, 1.78s/batch, N=1.7251, E=0.9285,
KL=1.0838, wKL=0.0250]
Train E01: 52%|█████▏ | 13/25 [00:25<00:21, 1.78s/batch, N=1.7046, E=0.9585,
KL=0.9776, wKL=0.0250]
Train E01: 56%|█████▌ | 14/25 [00:25<00:19, 1.75s/batch, N=1.7046, E=0.9585,
KL=0.9776, wKL=0.0250]
Train E01: 56%|█████▌ | 14/25 [00:27<00:19, 1.75s/batch, N=1.9528, E=0.9682,
KL=0.9068, wKL=0.0250]
Train E01: 60%|██████ | 15/25 [00:27<00:17, 1.72s/batch, N=1.9528, E=0.9682,
KL=0.9068, wKL=0.0250]
Train E01: 60%|██████ | 15/25 [00:29<00:17, 1.72s/batch, N=2.1785, E=0.9598,
KL=0.8592, wKL=0.0250]
Train E01: 64%|██████▍ | 16/25 [00:29<00:15, 1.71s/batch, N=2.1785, E=0.9598,
KL=0.8592, wKL=0.0250]
Train E01: 64%|██████▍ | 16/25 [00:31<00:15, 1.71s/batch, N=2.3986, E=0.9408,
KL=0.8431, wKL=0.0250]
Train E01: 68%|██████▊ | 17/25 [00:31<00:13, 1.71s/batch, N=2.3986, E=0.9408,
KL=0.8431, wKL=0.0250]
Train E01: 68%|██████▊ | 17/25 [00:32<00:13, 1.71s/batch, N=2.4209, E=0.9233,
KL=0.8474, wKL=0.0250]
Train E01: 72%|███████▏ | 18/25 [00:32<00:11, 1.70s/batch, N=2.4209, E=0.9233,
KL=0.8474, wKL=0.0250]
Train E01: 72%|███████▏ | 18/25 [00:34<00:11, 1.70s/batch, N=2.1500, E=0.9119,
KL=0.8644, wKL=0.0250]
Train E01: 76%|███████▌ | 19/25 [00:34<00:10, 1.71s/batch, N=2.1500, E=0.9119,
KL=0.8644, wKL=0.0250]
Train E01: 76%|███████▌ | 19/25 [00:36<00:10, 1.71s/batch, N=1.9749, E=0.9128,
KL=0.9021, wKL=0.0250]
Train E01: 80%|████████ | 20/25 [00:36<00:09, 1.92s/batch, N=1.9749, E=0.9128,
KL=0.9021, wKL=0.0250]
Train E01: 80%|████████ | 20/25 [00:38<00:09, 1.92s/batch, N=1.9611, E=0.9146,
KL=0.9556, wKL=0.0250]
Train E01: 84%|████████▍ | 21/25 [00:38<00:07, 1.86s/batch, N=1.9611, E=0.9146,
KL=0.9556, wKL=0.0250]
Train E01: 84%|████████▍ | 21/25 [00:40<00:07, 1.86s/batch, N=1.6205, E=0.9207,
KL=1.0094, wKL=0.0250]
Train E01: 88%|████████▊ | 22/25 [00:40<00:05, 1.83s/batch, N=1.6205, E=0.9207,
KL=1.0094, wKL=0.0250]
Train E01: 88%|████████▊ | 22/25 [00:42<00:05, 1.83s/batch, N=1.6097, E=0.9339,
KL=1.0726, wKL=0.0250]
Train E01: 92%|█████████▏| 23/25 [00:42<00:03, 1.79s/batch, N=1.6097, E=0.9339,
KL=1.0726, wKL=0.0250]
Train E01: 92%|█████████▏| 23/25 [00:43<00:03, 1.79s/batch, N=1.6439, E=0.9437,
KL=1.1365, wKL=0.0250]
Train E01: 96%|█████████▌| 24/25 [00:43<00:01, 1.78s/batch, N=1.6439, E=0.9437,
KL=1.1365, wKL=0.0250]
Train E01: 96%|█████████▌| 24/25 [00:44<00:01, 1.78s/batch, N=1.7865, E=0.9421,
KL=1.1876, wKL=0.0250]
Train E01: 100%|██████████| 25/25 [00:44<00:00, 1.49s/batch, N=1.7865, E=0.9421,
KL=1.1876, wKL=0.0250]
Train E01: 100%|██████████| 25/25 [00:44<00:00, 1.78s/batch, N=1.7865, E=0.9421,
KL=1.1876, wKL=0.0250]
91.1s 73 [Epoch 001] Total: 5.2451 | N: 4.2366 | E: 0.9979 | KL(0.03×0.5): 0.8388
126.1s 74 Train E02: 0%| | 0/25 [00:00<?, ?batch/s]
Train E02: 0%| | 0/25 [00:01<?, ?batch/s, N=1.7216, E=0.9424, KL=1.2219,
wKL=0.0500]
Train E02: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.7216, E=0.9424,
KL=1.2219, wKL=0.0500]
Train E02: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.6840, E=0.9336,
KL=1.2407, wKL=0.0500]
Train E02: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.6840, E=0.9336,
KL=1.2407, wKL=0.0500]
Train E02: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.6079, E=0.9265,
KL=1.2377, wKL=0.0500]
Train E02: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.6079, E=0.9265,
KL=1.2377, wKL=0.0500]
Train E02: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.7404, E=0.9155,
KL=1.2212, wKL=0.0500]
Train E02: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.7404, E=0.9155,
KL=1.2212, wKL=0.0500]
Train E02: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.6856, E=0.9111,
KL=1.1927, wKL=0.0500]
Train E02: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.6856, E=0.9111,
KL=1.1927, wKL=0.0500]
Train E02: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.6592, E=0.9112,
KL=1.1547, wKL=0.0500]
Train E02: 24%|██▍ | 6/25 [00:08<00:27, 1.46s/batch, N=1.6592, E=0.9112,
KL=1.1547, wKL=0.0500]
Train E02: 24%|██▍ | 6/25 [00:10<00:27, 1.46s/batch, N=1.5871, E=0.9136,
KL=1.1167, wKL=0.0500]
Train E02: 28%|██▊ | 7/25 [00:10<00:26, 1.46s/batch, N=1.5871, E=0.9136,
KL=1.1167, wKL=0.0500]
Train E02: 28%|██▊ | 7/25 [00:11<00:26, 1.46s/batch, N=1.6743, E=0.9214,
KL=1.0801, wKL=0.0500]
Train E02: 32%|███▏ | 8/25 [00:11<00:24, 1.44s/batch, N=1.6743, E=0.9214,
KL=1.0801, wKL=0.0500]
Train E02: 32%|███▏ | 8/25 [00:12<00:24, 1.44s/batch, N=1.5113, E=0.9265,
KL=1.0507, wKL=0.0500]
Train E02: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.5113, E=0.9265,
KL=1.0507, wKL=0.0500]
Train E02: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.5577, E=0.9259,
KL=1.0311, wKL=0.0500]
Train E02: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.5577, E=0.9259,
KL=1.0311, wKL=0.0500]
Train E02: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.5992, E=0.9226,
KL=1.0194, wKL=0.0500]
Train E02: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5992, E=0.9226,
KL=1.0194, wKL=0.0500]
Train E02: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5659, E=0.9180,
KL=1.0132, wKL=0.0500]
Train E02: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.5659, E=0.9180,
KL=1.0132, wKL=0.0500]
Train E02: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.7059, E=0.9148,
KL=1.0194, wKL=0.0500]
Train E02: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.7059, E=0.9148,
KL=1.0194, wKL=0.0500]
Train E02: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5493, E=0.9080,
KL=1.0261, wKL=0.0500]
Train E02: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5493, E=0.9080,
KL=1.0261, wKL=0.0500]
Train E02: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5759, E=0.9085,
KL=1.0412, wKL=0.0500]
Train E02: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.5759, E=0.9085,
KL=1.0412, wKL=0.0500]
Train E02: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.5779, E=0.9083,
KL=1.0570, wKL=0.0500]
Train E02: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5779, E=0.9083,
KL=1.0570, wKL=0.0500]
Train E02: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.5443, E=0.9110,
KL=1.0723, wKL=0.0500]
Train E02: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.5443, E=0.9110,
KL=1.0723, wKL=0.0500]
Train E02: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.4236, E=0.9160,
KL=1.0849, wKL=0.0500]
Train E02: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4236, E=0.9160,
KL=1.0849, wKL=0.0500]
Train E02: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.5439, E=0.9182,
KL=1.0960, wKL=0.0500]
Train E02: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.5439, E=0.9182,
KL=1.0960, wKL=0.0500]
Train E02: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5512, E=0.9174,
KL=1.1042, wKL=0.0500]
Train E02: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.5512, E=0.9174,
KL=1.1042, wKL=0.0500]
Train E02: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5165, E=0.9108,
KL=1.1034, wKL=0.0500]
Train E02: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5165, E=0.9108,
KL=1.1034, wKL=0.0500]
Train E02: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.6609, E=0.9121,
KL=1.1046, wKL=0.0500]
Train E02: 88%|████████▊ | 22/25 [00:30<00:04, 1.40s/batch, N=1.6609, E=0.9121,
KL=1.1046, wKL=0.0500]
Train E02: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5487, E=0.9143,
KL=1.0942, wKL=0.0500]
Train E02: 92%|█████████▏| 23/25 [00:32<00:03, 1.58s/batch, N=1.5487, E=0.9143,
KL=1.0942, wKL=0.0500]
Train E02: 92%|█████████▏| 23/25 [00:34<00:03, 1.58s/batch, N=1.5304, E=0.9107,
KL=1.0848, wKL=0.0500]
Train E02: 96%|█████████▌| 24/25 [00:34<00:01, 1.53s/batch, N=1.5304, E=0.9107,
KL=1.0848, wKL=0.0500]
Train E02: 96%|█████████▌| 24/25 [00:35<00:01, 1.53s/batch, N=1.5503, E=0.9086,
KL=1.0723, wKL=0.0500]
Train E02: 100%|██████████| 25/25 [00:35<00:00, 1.25s/batch, N=1.5503, E=0.9086,
KL=1.0723, wKL=0.0500]
Train E02: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5503, E=0.9086,
KL=1.0723, wKL=0.0500]
126.1s 75 [Epoch 002] Total: 2.5408 | N: 1.5960 | E: 0.9173 | KL(0.05×0.5): 1.1023
160.5s 76 Train E03: 0%| | 0/25 [00:00<?, ?batch/s]
Train E03: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4887, E=0.9101, KL=1.0587,
wKL=0.0750]
Train E03: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4887, E=0.9101,
KL=1.0587, wKL=0.0750]
Train E03: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5675, E=0.9102,
KL=1.0493, wKL=0.0750]
Train E03: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5675, E=0.9102,
KL=1.0493, wKL=0.0750]
Train E03: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.7098, E=0.9097,
KL=1.0406, wKL=0.0750]
Train E03: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.7098, E=0.9097,
KL=1.0406, wKL=0.0750]
Train E03: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5659, E=0.9111,
KL=1.0251, wKL=0.0750]
Train E03: 16%|█▌ | 4/25 [00:05<00:30, 1.45s/batch, N=1.5659, E=0.9111,
KL=1.0251, wKL=0.0750]
Train E03: 16%|█▌ | 4/25 [00:07<00:30, 1.45s/batch, N=1.5190, E=0.9094,
KL=1.0161, wKL=0.0750]
Train E03: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.5190, E=0.9094,
KL=1.0161, wKL=0.0750]
Train E03: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.4910, E=0.9135,
KL=1.0106, wKL=0.0750]
Train E03: 24%|██▍ | 6/25 [00:08<00:27, 1.43s/batch, N=1.4910, E=0.9135,
KL=1.0106, wKL=0.0750]
Train E03: 24%|██▍ | 6/25 [00:09<00:27, 1.43s/batch, N=1.4577, E=0.9107,
KL=1.0042, wKL=0.0750]
Train E03: 28%|██▊ | 7/25 [00:09<00:25, 1.42s/batch, N=1.4577, E=0.9107,
KL=1.0042, wKL=0.0750]
Train E03: 28%|██▊ | 7/25 [00:11<00:25, 1.42s/batch, N=1.4954, E=0.9083,
KL=1.0020, wKL=0.0750]
Train E03: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.4954, E=0.9083,
KL=1.0020, wKL=0.0750]
Train E03: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.4463, E=0.9097,
KL=0.9997, wKL=0.0750]
Train E03: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4463, E=0.9097,
KL=0.9997, wKL=0.0750]
Train E03: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.4520, E=0.9138,
KL=0.9958, wKL=0.0750]
Train E03: 40%|████ | 10/25 [00:14<00:21, 1.40s/batch, N=1.4520, E=0.9138,
KL=0.9958, wKL=0.0750]
Train E03: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.5459, E=0.9066,
KL=0.9952, wKL=0.0750]
Train E03: 44%|████▍ | 11/25 [00:15<00:19, 1.42s/batch, N=1.5459, E=0.9066,
KL=0.9952, wKL=0.0750]
Train E03: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.5385, E=0.9100,
KL=0.9893, wKL=0.0750]
Train E03: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.5385, E=0.9100,
KL=0.9893, wKL=0.0750]
Train E03: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.5819, E=0.9125,
KL=0.9865, wKL=0.0750]
Train E03: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5819, E=0.9125,
KL=0.9865, wKL=0.0750]
Train E03: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.5051, E=0.9091,
KL=0.9792, wKL=0.0750]
Train E03: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5051, E=0.9091,
KL=0.9792, wKL=0.0750]
Train E03: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.6486, E=0.9072,
KL=0.9758, wKL=0.0750]
Train E03: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.6486, E=0.9072,
KL=0.9758, wKL=0.0750]
Train E03: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5611, E=0.9140,
KL=0.9682, wKL=0.0750]
Train E03: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.5611, E=0.9140,
KL=0.9682, wKL=0.0750]
Train E03: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5306, E=0.9119,
KL=0.9608, wKL=0.0750]
Train E03: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5306, E=0.9119,
KL=0.9608, wKL=0.0750]
Train E03: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4701, E=0.9095,
KL=0.9516, wKL=0.0750]
Train E03: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4701, E=0.9095,
KL=0.9516, wKL=0.0750]
Train E03: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.5197, E=0.9085,
KL=0.9443, wKL=0.0750]
Train E03: 76%|███████▌ | 19/25 [00:26<00:08, 1.41s/batch, N=1.5197, E=0.9085,
KL=0.9443, wKL=0.0750]
Train E03: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5303, E=0.9090,
KL=0.9365, wKL=0.0750]
Train E03: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5303, E=0.9090,
KL=0.9365, wKL=0.0750]
Train E03: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.5594, E=0.9114,
KL=0.9334, wKL=0.0750]
Train E03: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5594, E=0.9114,
KL=0.9334, wKL=0.0750]
Train E03: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5389, E=0.9087,
KL=0.9238, wKL=0.0750]
Train E03: 88%|████████▊ | 22/25 [00:30<00:04, 1.41s/batch, N=1.5389, E=0.9087,
KL=0.9238, wKL=0.0750]
Train E03: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5034, E=0.9084,
KL=0.9167, wKL=0.0750]
Train E03: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5034, E=0.9084,
KL=0.9167, wKL=0.0750]
Train E03: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.5665, E=0.9107,
KL=0.9115, wKL=0.0750]
Train E03: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.5665, E=0.9107,
KL=0.9115, wKL=0.0750]
Train E03: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4971, E=0.9087,
KL=0.9031, wKL=0.0750]
Train E03: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4971, E=0.9087,
KL=0.9031, wKL=0.0750]
Train E03: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.4971, E=0.9087,
KL=0.9031, wKL=0.0750]
160.5s 77 [Epoch 003] Total: 2.4794 | N: 1.5324 | E: 0.9101 | KL(0.07×0.5): 0.9809
195.8s 78 Train E04: 0%| | 0/25 [00:00<?, ?batch/s]
Train E04: 0%| | 0/25 [00:02<?, ?batch/s, N=1.4919, E=0.9081, KL=0.8946,
wKL=0.1000]
Train E04: 4%|▍ | 1/25 [00:02<00:50, 2.10s/batch, N=1.4919, E=0.9081,
KL=0.8946, wKL=0.1000]
Train E04: 4%|▍ | 1/25 [00:03<00:50, 2.10s/batch, N=1.5228, E=0.9096,
KL=0.8925, wKL=0.1000]
Train E04: 8%|▊ | 2/25 [00:03<00:40, 1.74s/batch, N=1.5228, E=0.9096,
KL=0.8925, wKL=0.1000]
Train E04: 8%|▊ | 2/25 [00:04<00:40, 1.74s/batch, N=1.6339, E=0.9111,
KL=0.8877, wKL=0.1000]
Train E04: 12%|█▏ | 3/25 [00:04<00:34, 1.58s/batch, N=1.6339, E=0.9111,
KL=0.8877, wKL=0.1000]
Train E04: 12%|█▏ | 3/25 [00:06<00:34, 1.58s/batch, N=1.4708, E=0.9086,
KL=0.8760, wKL=0.1000]
Train E04: 16%|█▌ | 4/25 [00:06<00:32, 1.53s/batch, N=1.4708, E=0.9086,
KL=0.8760, wKL=0.1000]
Train E04: 16%|█▌ | 4/25 [00:07<00:32, 1.53s/batch, N=1.5492, E=0.9121,
KL=0.8724, wKL=0.1000]
Train E04: 20%|██ | 5/25 [00:07<00:29, 1.47s/batch, N=1.5492, E=0.9121,
KL=0.8724, wKL=0.1000]
Train E04: 20%|██ | 5/25 [00:09<00:29, 1.47s/batch, N=1.5148, E=0.9088,
KL=0.8661, wKL=0.1000]
Train E04: 24%|██▍ | 6/25 [00:09<00:27, 1.45s/batch, N=1.5148, E=0.9088,
KL=0.8661, wKL=0.1000]
Train E04: 24%|██▍ | 6/25 [00:10<00:27, 1.45s/batch, N=1.6846, E=0.9087,
KL=0.8655, wKL=0.1000]
Train E04: 28%|██▊ | 7/25 [00:10<00:25, 1.43s/batch, N=1.6846, E=0.9087,
KL=0.8655, wKL=0.1000]
Train E04: 28%|██▊ | 7/25 [00:12<00:25, 1.43s/batch, N=1.4793, E=0.9089,
KL=0.8535, wKL=0.1000]
Train E04: 32%|███▏ | 8/25 [00:12<00:24, 1.44s/batch, N=1.4793, E=0.9089,
KL=0.8535, wKL=0.1000]
Train E04: 32%|███▏ | 8/25 [00:13<00:24, 1.44s/batch, N=1.6417, E=0.9090,
KL=0.8506, wKL=0.1000]
Train E04: 36%|███▌ | 9/25 [00:13<00:22, 1.43s/batch, N=1.6417, E=0.9090,
KL=0.8506, wKL=0.1000]
Train E04: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4429, E=0.9072,
KL=0.8400, wKL=0.1000]
Train E04: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.4429, E=0.9072,
KL=0.8400, wKL=0.1000]
Train E04: 40%|████ | 10/25 [00:16<00:21, 1.42s/batch, N=1.4622, E=0.9084,
KL=0.8355, wKL=0.1000]
Train E04: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4622, E=0.9084,
KL=0.8355, wKL=0.1000]
Train E04: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.5165, E=0.9050,
KL=0.8283, wKL=0.1000]
Train E04: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.5165, E=0.9050,
KL=0.8283, wKL=0.1000]
Train E04: 48%|████▊ | 12/25 [00:19<00:18, 1.41s/batch, N=1.5154, E=0.9093,
KL=0.8258, wKL=0.1000]
Train E04: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5154, E=0.9093,
KL=0.8258, wKL=0.1000]
Train E04: 52%|█████▏ | 13/25 [00:20<00:16, 1.40s/batch, N=1.5523, E=0.9083,
KL=0.8189, wKL=0.1000]
Train E04: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.5523, E=0.9083,
KL=0.8189, wKL=0.1000]
Train E04: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5825, E=0.9117,
KL=0.8147, wKL=0.1000]
Train E04: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.5825, E=0.9117,
KL=0.8147, wKL=0.1000]
Train E04: 60%|██████ | 15/25 [00:23<00:14, 1.40s/batch, N=1.5627, E=0.9093,
KL=0.8064, wKL=0.1000]
Train E04: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5627, E=0.9093,
KL=0.8064, wKL=0.1000]
Train E04: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4209, E=0.9089,
KL=0.7959, wKL=0.1000]
Train E04: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4209, E=0.9089,
KL=0.7959, wKL=0.1000]
Train E04: 68%|██████▊ | 17/25 [00:26<00:11, 1.40s/batch, N=1.4956, E=0.9084,
KL=0.7935, wKL=0.1000]
Train E04: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4956, E=0.9084,
KL=0.7935, wKL=0.1000]
Train E04: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5084, E=0.9093,
KL=0.7862, wKL=0.1000]
Train E04: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5084, E=0.9093,
KL=0.7862, wKL=0.1000]
Train E04: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4661, E=0.9107,
KL=0.7790, wKL=0.1000]
Train E04: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4661, E=0.9107,
KL=0.7790, wKL=0.1000]
Train E04: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.5435, E=0.9080,
KL=0.7766, wKL=0.1000]
Train E04: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5435, E=0.9080,
KL=0.7766, wKL=0.1000]
Train E04: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.6161, E=0.9101,
KL=0.7749, wKL=0.1000]
Train E04: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.6161, E=0.9101,
KL=0.7749, wKL=0.1000]
Train E04: 88%|████████▊ | 22/25 [00:33<00:04, 1.40s/batch, N=1.4536, E=0.9089,
KL=0.7679, wKL=0.1000]
Train E04: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4536, E=0.9089,
KL=0.7679, wKL=0.1000]
Train E04: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4589, E=0.9092,
KL=0.7637, wKL=0.1000]
Train E04: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.4589, E=0.9092,
KL=0.7637, wKL=0.1000]
Train E04: 96%|█████████▌| 24/25 [00:35<00:01, 1.44s/batch, N=1.5813, E=0.9093,
KL=0.7598, wKL=0.1000]
Train E04: 100%|██████████| 25/25 [00:35<00:00, 1.22s/batch, N=1.5813, E=0.9093,
KL=0.7598, wKL=0.1000]
Train E04: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5813, E=0.9093,
KL=0.7598, wKL=0.1000]
195.8s 79 [Epoch 004] Total: 2.4758 | N: 1.5254 | E: 0.9091 | KL(0.10×0.5): 0.8266
230.9s 80 Train E05: 0%| | 0/25 [00:00<?, ?batch/s]
Train E05: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5352, E=0.9091, KL=0.7598,
wKL=0.1250]
Train E05: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5352, E=0.9091,
KL=0.7598, wKL=0.1250]
Train E05: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4937, E=0.9088,
KL=0.7541, wKL=0.1250]
Train E05: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4937, E=0.9088,
KL=0.7541, wKL=0.1250]
Train E05: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5630, E=0.9097,
KL=0.7527, wKL=0.1250]
Train E05: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5630, E=0.9097,
KL=0.7527, wKL=0.1250]
Train E05: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5584, E=0.9069,
KL=0.7492, wKL=0.1250]
Train E05: 16%|█▌ | 4/25 [00:05<00:29, 1.40s/batch, N=1.5584, E=0.9069,
KL=0.7492, wKL=0.1250]
Train E05: 16%|█▌ | 4/25 [00:06<00:29, 1.40s/batch, N=1.4394, E=0.9086,
KL=0.7398, wKL=0.1250]
Train E05: 20%|██ | 5/25 [00:06<00:28, 1.40s/batch, N=1.4394, E=0.9086,
KL=0.7398, wKL=0.1250]
Train E05: 20%|██ | 5/25 [00:09<00:28, 1.40s/batch, N=1.4983, E=0.9105,
KL=0.7357, wKL=0.1250]
Train E05: 24%|██▍ | 6/25 [00:09<00:30, 1.62s/batch, N=1.4983, E=0.9105,
KL=0.7357, wKL=0.1250]
Train E05: 24%|██▍ | 6/25 [00:10<00:30, 1.62s/batch, N=1.5274, E=0.9065,
KL=0.7334, wKL=0.1250]
Train E05: 28%|██▊ | 7/25 [00:10<00:27, 1.54s/batch, N=1.5274, E=0.9065,
KL=0.7334, wKL=0.1250]
Train E05: 28%|██▊ | 7/25 [00:11<00:27, 1.54s/batch, N=1.4989, E=0.9075,
KL=0.7254, wKL=0.1250]
Train E05: 32%|███▏ | 8/25 [00:11<00:25, 1.49s/batch, N=1.4989, E=0.9075,
KL=0.7254, wKL=0.1250]
Train E05: 32%|███▏ | 8/25 [00:13<00:25, 1.49s/batch, N=1.6445, E=0.9059,
KL=0.7254, wKL=0.1250]
Train E05: 36%|███▌ | 9/25 [00:13<00:23, 1.46s/batch, N=1.6445, E=0.9059,
KL=0.7254, wKL=0.1250]
Train E05: 36%|███▌ | 9/25 [00:14<00:23, 1.46s/batch, N=1.5174, E=0.9107,
KL=0.7170, wKL=0.1250]
Train E05: 40%|████ | 10/25 [00:14<00:21, 1.45s/batch, N=1.5174, E=0.9107,
KL=0.7170, wKL=0.1250]
Train E05: 40%|████ | 10/25 [00:15<00:21, 1.45s/batch, N=1.4871, E=0.9095,
KL=0.7117, wKL=0.1250]
Train E05: 44%|████▍ | 11/25 [00:15<00:20, 1.43s/batch, N=1.4871, E=0.9095,
KL=0.7117, wKL=0.1250]
Train E05: 44%|████▍ | 11/25 [00:17<00:20, 1.43s/batch, N=1.4996, E=0.9079,
KL=0.7057, wKL=0.1250]
Train E05: 48%|████▊ | 12/25 [00:17<00:18, 1.42s/batch, N=1.4996, E=0.9079,
KL=0.7057, wKL=0.1250]
Train E05: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.5169, E=0.9118,
KL=0.7030, wKL=0.1250]
Train E05: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5169, E=0.9118,
KL=0.7030, wKL=0.1250]
Train E05: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.5501, E=0.9077,
KL=0.7008, wKL=0.1250]
Train E05: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.5501, E=0.9077,
KL=0.7008, wKL=0.1250]
Train E05: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.6121, E=0.9076,
KL=0.6999, wKL=0.1250]
Train E05: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.6121, E=0.9076,
KL=0.6999, wKL=0.1250]
Train E05: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4994, E=0.9103,
KL=0.6936, wKL=0.1250]
Train E05: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4994, E=0.9103,
KL=0.6936, wKL=0.1250]
Train E05: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4794, E=0.9101,
KL=0.6906, wKL=0.1250]
Train E05: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4794, E=0.9101,
KL=0.6906, wKL=0.1250]
Train E05: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4975, E=0.9085,
KL=0.6867, wKL=0.1250]
Train E05: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4975, E=0.9085,
KL=0.6867, wKL=0.1250]
Train E05: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4718, E=0.9120,
KL=0.6830, wKL=0.1250]
Train E05: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.4718, E=0.9120,
KL=0.6830, wKL=0.1250]
Train E05: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5897, E=0.9087,
KL=0.6813, wKL=0.1250]
Train E05: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5897, E=0.9087,
KL=0.6813, wKL=0.1250]
Train E05: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5195, E=0.9116,
KL=0.6753, wKL=0.1250]
Train E05: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5195, E=0.9116,
KL=0.6753, wKL=0.1250]
Train E05: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5650, E=0.9081,
KL=0.6727, wKL=0.1250]
Train E05: 88%|████████▊ | 22/25 [00:31<00:04, 1.47s/batch, N=1.5650, E=0.9081,
KL=0.6727, wKL=0.1250]
Train E05: 88%|████████▊ | 22/25 [00:32<00:04, 1.47s/batch, N=1.4760, E=0.9096,
KL=0.6654, wKL=0.1250]
Train E05: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.4760, E=0.9096,
KL=0.6654, wKL=0.1250]
Train E05: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.5494, E=0.9055,
KL=0.6638, wKL=0.1250]
Train E05: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.5494, E=0.9055,
KL=0.6638, wKL=0.1250]
Train E05: 96%|█████████▌| 24/25 [00:35<00:01, 1.43s/batch, N=1.3126, E=0.9043,
KL=0.6551, wKL=0.1250]
Train E05: 100%|██████████| 25/25 [00:35<00:00, 1.20s/batch, N=1.3126, E=0.9043,
KL=0.6551, wKL=0.1250]
Train E05: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.3126, E=0.9043,
KL=0.6551, wKL=0.1250]
230.9s 81 [Epoch 005] Total: 2.4741 | N: 1.5210 | E: 0.9088 | KL(0.12×0.5): 0.7085
266.1s 82 Train E06: 0%| | 0/25 [00:00<?, ?batch/s]
Train E06: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5278, E=0.9074, KL=0.6585,
wKL=0.1500]
Train E06: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5278, E=0.9074,
KL=0.6585, wKL=0.1500]
Train E06: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.5546, E=0.9078,
KL=0.6563, wKL=0.1500]
Train E06: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5546, E=0.9078,
KL=0.6563, wKL=0.1500]
Train E06: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.4643, E=0.9064,
KL=0.6520, wKL=0.1500]
Train E06: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4643, E=0.9064,
KL=0.6520, wKL=0.1500]
Train E06: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.6365, E=0.9066,
KL=0.6535, wKL=0.1500]
Train E06: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.6365, E=0.9066,
KL=0.6535, wKL=0.1500]
Train E06: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5112, E=0.9068,
KL=0.6485, wKL=0.1500]
Train E06: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5112, E=0.9068,
KL=0.6485, wKL=0.1500]
Train E06: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5094, E=0.9100,
KL=0.6439, wKL=0.1500]
Train E06: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5094, E=0.9100,
KL=0.6439, wKL=0.1500]
Train E06: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.6024, E=0.9074,
KL=0.6449, wKL=0.1500]
Train E06: 28%|██▊ | 7/25 [00:09<00:25, 1.43s/batch, N=1.6024, E=0.9074,
KL=0.6449, wKL=0.1500]
Train E06: 28%|██▊ | 7/25 [00:11<00:25, 1.43s/batch, N=1.5592, E=0.9093,
KL=0.6426, wKL=0.1500]
Train E06: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5592, E=0.9093,
KL=0.6426, wKL=0.1500]
Train E06: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.5442, E=0.9066,
KL=0.6378, wKL=0.1500]
Train E06: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.5442, E=0.9066,
KL=0.6378, wKL=0.1500]
Train E06: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5272, E=0.9080,
KL=0.6323, wKL=0.1500]
Train E06: 40%|████ | 10/25 [00:14<00:23, 1.60s/batch, N=1.5272, E=0.9080,
KL=0.6323, wKL=0.1500]
Train E06: 40%|████ | 10/25 [00:16<00:23, 1.60s/batch, N=1.5376, E=0.9090,
KL=0.6292, wKL=0.1500]
Train E06: 44%|████▍ | 11/25 [00:16<00:21, 1.53s/batch, N=1.5376, E=0.9090,
KL=0.6292, wKL=0.1500]
Train E06: 44%|████▍ | 11/25 [00:17<00:21, 1.53s/batch, N=1.5777, E=0.9072,
KL=0.6285, wKL=0.1500]
Train E06: 48%|████▊ | 12/25 [00:17<00:19, 1.50s/batch, N=1.5777, E=0.9072,
KL=0.6285, wKL=0.1500]
Train E06: 48%|████▊ | 12/25 [00:18<00:19, 1.50s/batch, N=1.5136, E=0.9070,
KL=0.6209, wKL=0.1500]
Train E06: 52%|█████▏ | 13/25 [00:18<00:17, 1.46s/batch, N=1.5136, E=0.9070,
KL=0.6209, wKL=0.1500]
Train E06: 52%|█████▏ | 13/25 [00:20<00:17, 1.46s/batch, N=1.4749, E=0.9081,
KL=0.6189, wKL=0.1500]
Train E06: 56%|█████▌ | 14/25 [00:20<00:16, 1.46s/batch, N=1.4749, E=0.9081,
KL=0.6189, wKL=0.1500]
Train E06: 56%|█████▌ | 14/25 [00:21<00:16, 1.46s/batch, N=1.5319, E=0.9099,
KL=0.6171, wKL=0.1500]
Train E06: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.5319, E=0.9099,
KL=0.6171, wKL=0.1500]
Train E06: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.5148, E=0.9138,
KL=0.6159, wKL=0.1500]
Train E06: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.5148, E=0.9138,
KL=0.6159, wKL=0.1500]
Train E06: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.5262, E=0.9109,
KL=0.6101, wKL=0.1500]
Train E06: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.5262, E=0.9109,
KL=0.6101, wKL=0.1500]
Train E06: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.4758, E=0.9099,
KL=0.6075, wKL=0.1500]
Train E06: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4758, E=0.9099,
KL=0.6075, wKL=0.1500]
Train E06: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.4526, E=0.9077,
KL=0.6067, wKL=0.1500]
Train E06: 76%|███████▌ | 19/25 [00:27<00:08, 1.46s/batch, N=1.4526, E=0.9077,
KL=0.6067, wKL=0.1500]
Train E06: 76%|███████▌ | 19/25 [00:28<00:08, 1.46s/batch, N=1.5113, E=0.9078,
KL=0.6046, wKL=0.1500]
Train E06: 80%|████████ | 20/25 [00:28<00:07, 1.44s/batch, N=1.5113, E=0.9078,
KL=0.6046, wKL=0.1500]
Train E06: 80%|████████ | 20/25 [00:30<00:07, 1.44s/batch, N=1.4894, E=0.9093,
KL=0.6019, wKL=0.1500]
Train E06: 84%|████████▍ | 21/25 [00:30<00:05, 1.44s/batch, N=1.4894, E=0.9093,
KL=0.6019, wKL=0.1500]
Train E06: 84%|████████▍ | 21/25 [00:31<00:05, 1.44s/batch, N=1.4704, E=0.9106,
KL=0.6000, wKL=0.1500]
Train E06: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.4704, E=0.9106,
KL=0.6000, wKL=0.1500]
Train E06: 88%|████████▊ | 22/25 [00:33<00:04, 1.44s/batch, N=1.4772, E=0.9106,
KL=0.5981, wKL=0.1500]
Train E06: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4772, E=0.9106,
KL=0.5981, wKL=0.1500]
Train E06: 92%|█████████▏| 23/25 [00:34<00:02, 1.42s/batch, N=1.4279, E=0.9098,
KL=0.5927, wKL=0.1500]
Train E06: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4279, E=0.9098,
KL=0.5927, wKL=0.1500]
Train E06: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.5154, E=0.9113,
KL=0.5940, wKL=0.1500]
Train E06: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.5154, E=0.9113,
KL=0.5940, wKL=0.1500]
Train E06: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5154, E=0.9113,
KL=0.5940, wKL=0.1500]
266.1s 83 [Epoch 006] Total: 2.4730 | N: 1.5174 | E: 0.9087 | KL(0.15×0.5): 0.6254
301.2s 84 Train E07: 0%| | 0/25 [00:00<?, ?batch/s]
Train E07: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5009, E=0.9096, KL=0.5899,
wKL=0.1750]
Train E07: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5009, E=0.9096,
KL=0.5899, wKL=0.1750]
Train E07: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5950, E=0.9089,
KL=0.5922, wKL=0.1750]
Train E07: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.5950, E=0.9089,
KL=0.5922, wKL=0.1750]
Train E07: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4751, E=0.9052,
KL=0.5861, wKL=0.1750]
Train E07: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4751, E=0.9052,
KL=0.5861, wKL=0.1750]
Train E07: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5119, E=0.9115,
KL=0.5841, wKL=0.1750]
Train E07: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5119, E=0.9115,
KL=0.5841, wKL=0.1750]
Train E07: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4745, E=0.9074,
KL=0.5823, wKL=0.1750]
Train E07: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4745, E=0.9074,
KL=0.5823, wKL=0.1750]
Train E07: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4651, E=0.9084,
KL=0.5792, wKL=0.1750]
Train E07: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4651, E=0.9084,
KL=0.5792, wKL=0.1750]
Train E07: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5603, E=0.9081,
KL=0.5796, wKL=0.1750]
Train E07: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.5603, E=0.9081,
KL=0.5796, wKL=0.1750]
Train E07: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.5033, E=0.9087,
KL=0.5755, wKL=0.1750]
Train E07: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5033, E=0.9087,
KL=0.5755, wKL=0.1750]
Train E07: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.5339, E=0.9093,
KL=0.5742, wKL=0.1750]
Train E07: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5339, E=0.9093,
KL=0.5742, wKL=0.1750]
Train E07: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5276, E=0.9080,
KL=0.5712, wKL=0.1750]
Train E07: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5276, E=0.9080,
KL=0.5712, wKL=0.1750]
Train E07: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5078, E=0.9102,
KL=0.5665, wKL=0.1750]
Train E07: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5078, E=0.9102,
KL=0.5665, wKL=0.1750]
Train E07: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4465, E=0.9064,
KL=0.5595, wKL=0.1750]
Train E07: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4465, E=0.9064,
KL=0.5595, wKL=0.1750]
Train E07: 48%|████▊ | 12/25 [00:18<00:17, 1.38s/batch, N=1.4720, E=0.9075,
KL=0.5578, wKL=0.1750]
Train E07: 52%|█████▏ | 13/25 [00:18<00:18, 1.58s/batch, N=1.4720, E=0.9075,
KL=0.5578, wKL=0.1750]
Train E07: 52%|█████▏ | 13/25 [00:20<00:18, 1.58s/batch, N=1.5465, E=0.9065,
KL=0.5582, wKL=0.1750]
Train E07: 56%|█████▌ | 14/25 [00:20<00:16, 1.54s/batch, N=1.5465, E=0.9065,
KL=0.5582, wKL=0.1750]
Train E07: 56%|█████▌ | 14/25 [00:21<00:16, 1.54s/batch, N=1.5190, E=0.9098,
KL=0.5540, wKL=0.1750]
Train E07: 60%|██████ | 15/25 [00:21<00:15, 1.51s/batch, N=1.5190, E=0.9098,
KL=0.5540, wKL=0.1750]
Train E07: 60%|██████ | 15/25 [00:22<00:15, 1.51s/batch, N=1.4534, E=0.9071,
KL=0.5502, wKL=0.1750]
Train E07: 64%|██████▍ | 16/25 [00:22<00:13, 1.48s/batch, N=1.4534, E=0.9071,
KL=0.5502, wKL=0.1750]
Train E07: 64%|██████▍ | 16/25 [00:24<00:13, 1.48s/batch, N=1.5150, E=0.9059,
KL=0.5491, wKL=0.1750]
Train E07: 68%|██████▊ | 17/25 [00:24<00:12, 1.54s/batch, N=1.5150, E=0.9059,
KL=0.5491, wKL=0.1750]
Train E07: 68%|██████▊ | 17/25 [00:26<00:12, 1.54s/batch, N=1.5505, E=0.9097,
KL=0.5493, wKL=0.1750]
Train E07: 72%|███████▏ | 18/25 [00:26<00:10, 1.50s/batch, N=1.5505, E=0.9097,
KL=0.5493, wKL=0.1750]
Train E07: 72%|███████▏ | 18/25 [00:27<00:10, 1.50s/batch, N=1.6138, E=0.9095,
KL=0.5517, wKL=0.1750]
Train E07: 76%|███████▌ | 19/25 [00:27<00:08, 1.48s/batch, N=1.6138, E=0.9095,
KL=0.5517, wKL=0.1750]
Train E07: 76%|███████▌ | 19/25 [00:28<00:08, 1.48s/batch, N=1.4713, E=0.9096,
KL=0.5429, wKL=0.1750]
Train E07: 80%|████████ | 20/25 [00:28<00:07, 1.46s/batch, N=1.4713, E=0.9096,
KL=0.5429, wKL=0.1750]
Train E07: 80%|████████ | 20/25 [00:30<00:07, 1.46s/batch, N=1.5617, E=0.9105,
KL=0.5440, wKL=0.1750]
Train E07: 84%|████████▍ | 21/25 [00:30<00:05, 1.44s/batch, N=1.5617, E=0.9105,
KL=0.5440, wKL=0.1750]
Train E07: 84%|████████▍ | 21/25 [00:31<00:05, 1.44s/batch, N=1.5187, E=0.9084,
KL=0.5384, wKL=0.1750]
Train E07: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.5187, E=0.9084,
KL=0.5384, wKL=0.1750]
Train E07: 88%|████████▊ | 22/25 [00:33<00:04, 1.44s/batch, N=1.5278, E=0.9096,
KL=0.5393, wKL=0.1750]
Train E07: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.5278, E=0.9096,
KL=0.5393, wKL=0.1750]
Train E07: 92%|█████████▏| 23/25 [00:34<00:02, 1.42s/batch, N=1.4801, E=0.9059,
KL=0.5331, wKL=0.1750]
Train E07: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4801, E=0.9059,
KL=0.5331, wKL=0.1750]
Train E07: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.5016, E=0.9108,
KL=0.5318, wKL=0.1750]
Train E07: 100%|██████████| 25/25 [00:35<00:00, 1.18s/batch, N=1.5016, E=0.9108,
KL=0.5318, wKL=0.1750]
Train E07: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5016, E=0.9108,
KL=0.5318, wKL=0.1750]
301.2s 85 [Epoch 007] Total: 2.4713 | N: 1.5136 | E: 0.9084 | KL(0.17×0.5): 0.5623
336.5s 86 Train E08: 0%| | 0/25 [00:00<?, ?batch/s]
Train E08: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5724, E=0.9097, KL=0.5331,
wKL=0.2000]
Train E08: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5724, E=0.9097,
KL=0.5331, wKL=0.2000]
Train E08: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5567, E=0.9082,
KL=0.5304, wKL=0.2000]
Train E08: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5567, E=0.9082,
KL=0.5304, wKL=0.2000]
Train E08: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4774, E=0.9098,
KL=0.5264, wKL=0.2000]
Train E08: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4774, E=0.9098,
KL=0.5264, wKL=0.2000]
Train E08: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4698, E=0.9063,
KL=0.5226, wKL=0.2000]
Train E08: 16%|█▌ | 4/25 [00:05<00:29, 1.40s/batch, N=1.4698, E=0.9063,
KL=0.5226, wKL=0.2000]
Train E08: 16%|█▌ | 4/25 [00:06<00:29, 1.40s/batch, N=1.5310, E=0.9075,
KL=0.5223, wKL=0.2000]
Train E08: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5310, E=0.9075,
KL=0.5223, wKL=0.2000]
Train E08: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.5091, E=0.9071,
KL=0.5220, wKL=0.2000]
Train E08: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5091, E=0.9071,
KL=0.5220, wKL=0.2000]
Train E08: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5232, E=0.9106,
KL=0.5167, wKL=0.2000]
Train E08: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.5232, E=0.9106,
KL=0.5167, wKL=0.2000]
Train E08: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.4647, E=0.9099,
KL=0.5146, wKL=0.2000]
Train E08: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4647, E=0.9099,
KL=0.5146, wKL=0.2000]
Train E08: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5403, E=0.9082,
KL=0.5144, wKL=0.2000]
Train E08: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5403, E=0.9082,
KL=0.5144, wKL=0.2000]
Train E08: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.5633, E=0.9072,
KL=0.5148, wKL=0.2000]
Train E08: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.5633, E=0.9072,
KL=0.5148, wKL=0.2000]
Train E08: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.5001, E=0.9083,
KL=0.5095, wKL=0.2000]
Train E08: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.5001, E=0.9083,
KL=0.5095, wKL=0.2000]
Train E08: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.5121, E=0.9070,
KL=0.5085, wKL=0.2000]
Train E08: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.5121, E=0.9070,
KL=0.5085, wKL=0.2000]
Train E08: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5549, E=0.9069,
KL=0.5061, wKL=0.2000]
Train E08: 52%|█████▏ | 13/25 [00:18<00:16, 1.42s/batch, N=1.5549, E=0.9069,
KL=0.5061, wKL=0.2000]
Train E08: 52%|█████▏ | 13/25 [00:19<00:16, 1.42s/batch, N=1.5448, E=0.9084,
KL=0.5035, wKL=0.2000]
Train E08: 56%|█████▌ | 14/25 [00:19<00:15, 1.43s/batch, N=1.5448, E=0.9084,
KL=0.5035, wKL=0.2000]
Train E08: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.5042, E=0.9080,
KL=0.5001, wKL=0.2000]
Train E08: 60%|██████ | 15/25 [00:21<00:14, 1.46s/batch, N=1.5042, E=0.9080,
KL=0.5001, wKL=0.2000]
Train E08: 60%|██████ | 15/25 [00:22<00:14, 1.46s/batch, N=1.4334, E=0.9122,
KL=0.4966, wKL=0.2000]
Train E08: 64%|██████▍ | 16/25 [00:22<00:13, 1.45s/batch, N=1.4334, E=0.9122,
KL=0.4966, wKL=0.2000]
Train E08: 64%|██████▍ | 16/25 [00:24<00:13, 1.45s/batch, N=1.4171, E=0.9093,
KL=0.4909, wKL=0.2000]
Train E08: 68%|██████▊ | 17/25 [00:24<00:13, 1.64s/batch, N=1.4171, E=0.9093,
KL=0.4909, wKL=0.2000]
Train E08: 68%|██████▊ | 17/25 [00:26<00:13, 1.64s/batch, N=1.5226, E=0.9073,
KL=0.4930, wKL=0.2000]
Train E08: 72%|███████▏ | 18/25 [00:26<00:11, 1.57s/batch, N=1.5226, E=0.9073,
KL=0.4930, wKL=0.2000]
Train E08: 72%|███████▏ | 18/25 [00:27<00:11, 1.57s/batch, N=1.5728, E=0.9074,
KL=0.4952, wKL=0.2000]
Train E08: 76%|███████▌ | 19/25 [00:27<00:09, 1.52s/batch, N=1.5728, E=0.9074,
KL=0.4952, wKL=0.2000]
Train E08: 76%|███████▌ | 19/25 [00:28<00:09, 1.52s/batch, N=1.5308, E=0.9089,
KL=0.4907, wKL=0.2000]
Train E08: 80%|████████ | 20/25 [00:28<00:07, 1.48s/batch, N=1.5308, E=0.9089,
KL=0.4907, wKL=0.2000]
Train E08: 80%|████████ | 20/25 [00:30<00:07, 1.48s/batch, N=1.4863, E=0.9056,
KL=0.4877, wKL=0.2000]
Train E08: 84%|████████▍ | 21/25 [00:30<00:05, 1.47s/batch, N=1.4863, E=0.9056,
KL=0.4877, wKL=0.2000]
Train E08: 84%|████████▍ | 21/25 [00:31<00:05, 1.47s/batch, N=1.5131, E=0.9081,
KL=0.4868, wKL=0.2000]
Train E08: 88%|████████▊ | 22/25 [00:31<00:04, 1.45s/batch, N=1.5131, E=0.9081,
KL=0.4868, wKL=0.2000]
Train E08: 88%|████████▊ | 22/25 [00:33<00:04, 1.45s/batch, N=1.5669, E=0.9091,
KL=0.4863, wKL=0.2000]
Train E08: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.5669, E=0.9091,
KL=0.4863, wKL=0.2000]
Train E08: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.5075, E=0.9076,
KL=0.4836, wKL=0.2000]
Train E08: 96%|█████████▌| 24/25 [00:34<00:01, 1.47s/batch, N=1.5075, E=0.9076,
KL=0.4836, wKL=0.2000]
Train E08: 96%|█████████▌| 24/25 [00:35<00:01, 1.47s/batch, N=1.3417, E=0.9077,
KL=0.4760, wKL=0.2000]
Train E08: 100%|██████████| 25/25 [00:35<00:00, 1.21s/batch, N=1.3417, E=0.9077,
KL=0.4760, wKL=0.2000]
Train E08: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.3417, E=0.9077,
KL=0.4760, wKL=0.2000]
336.5s 87 [Epoch 008] Total: 2.4715 | N: 1.5127 | E: 0.9083 | KL(0.20×0.5): 0.5060
371.7s 88 Train E09: 0%| | 0/25 [00:00<?, ?batch/s]
Train E09: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4321, E=0.9076, KL=0.4758,
wKL=0.2250]
Train E09: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4321, E=0.9076,
KL=0.4758, wKL=0.2250]
Train E09: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4787, E=0.9071,
KL=0.4755, wKL=0.2250]
Train E09: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4787, E=0.9071,
KL=0.4755, wKL=0.2250]
Train E09: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.5512, E=0.9064,
KL=0.4755, wKL=0.2250]
Train E09: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5512, E=0.9064,
KL=0.4755, wKL=0.2250]
Train E09: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5518, E=0.9114,
KL=0.4766, wKL=0.2250]
Train E09: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5518, E=0.9114,
KL=0.4766, wKL=0.2250]
Train E09: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5973, E=0.9080,
KL=0.4739, wKL=0.2250]
Train E09: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5973, E=0.9080,
KL=0.4739, wKL=0.2250]
Train E09: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.4820, E=0.9074,
KL=0.4710, wKL=0.2250]
Train E09: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4820, E=0.9074,
KL=0.4710, wKL=0.2250]
Train E09: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.4524, E=0.9067,
KL=0.4678, wKL=0.2250]
Train E09: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4524, E=0.9067,
KL=0.4678, wKL=0.2250]
Train E09: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.5318, E=0.9083,
KL=0.4692, wKL=0.2250]
Train E09: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5318, E=0.9083,
KL=0.4692, wKL=0.2250]
Train E09: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4136, E=0.9061,
KL=0.4603, wKL=0.2250]
Train E09: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4136, E=0.9061,
KL=0.4603, wKL=0.2250]
Train E09: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5337, E=0.9092,
KL=0.4624, wKL=0.2250]
Train E09: 40%|████ | 10/25 [00:13<00:21, 1.41s/batch, N=1.5337, E=0.9092,
KL=0.4624, wKL=0.2250]
Train E09: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.5145, E=0.9106,
KL=0.4593, wKL=0.2250]
Train E09: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5145, E=0.9106,
KL=0.4593, wKL=0.2250]
Train E09: 44%|████▍ | 11/25 [00:17<00:19, 1.41s/batch, N=1.5478, E=0.9084,
KL=0.4569, wKL=0.2250]
Train E09: 48%|████▊ | 12/25 [00:17<00:19, 1.50s/batch, N=1.5478, E=0.9084,
KL=0.4569, wKL=0.2250]
Train E09: 48%|████▊ | 12/25 [00:18<00:19, 1.50s/batch, N=1.3977, E=0.9088,
KL=0.4489, wKL=0.2250]
Train E09: 52%|█████▏ | 13/25 [00:18<00:17, 1.48s/batch, N=1.3977, E=0.9088,
KL=0.4489, wKL=0.2250]
Train E09: 52%|█████▏ | 13/25 [00:19<00:17, 1.48s/batch, N=1.5172, E=0.9097,
KL=0.4514, wKL=0.2250]
Train E09: 56%|█████▌ | 14/25 [00:19<00:15, 1.45s/batch, N=1.5172, E=0.9097,
KL=0.4514, wKL=0.2250]
Train E09: 56%|█████▌ | 14/25 [00:21<00:15, 1.45s/batch, N=1.6038, E=0.9063,
KL=0.4527, wKL=0.2250]
Train E09: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.6038, E=0.9063,
KL=0.4527, wKL=0.2250]
Train E09: 60%|██████ | 15/25 [00:22<00:14, 1.44s/batch, N=1.5834, E=0.9080,
KL=0.4499, wKL=0.2250]
Train E09: 64%|██████▍ | 16/25 [00:22<00:12, 1.43s/batch, N=1.5834, E=0.9080,
KL=0.4499, wKL=0.2250]
Train E09: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.6092, E=0.9077,
KL=0.4526, wKL=0.2250]
Train E09: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.6092, E=0.9077,
KL=0.4526, wKL=0.2250]
Train E09: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.5172, E=0.9066,
KL=0.4459, wKL=0.2250]
Train E09: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.5172, E=0.9066,
KL=0.4459, wKL=0.2250]
Train E09: 72%|███████▏ | 18/25 [00:27<00:10, 1.44s/batch, N=1.3617, E=0.9073,
KL=0.4378, wKL=0.2250]
Train E09: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.3617, E=0.9073,
KL=0.4378, wKL=0.2250]
Train E09: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5348, E=0.9109,
KL=0.4392, wKL=0.2250]
Train E09: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5348, E=0.9109,
KL=0.4392, wKL=0.2250]
Train E09: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.5045, E=0.9064,
KL=0.4363, wKL=0.2250]
Train E09: 84%|████████▍ | 21/25 [00:30<00:06, 1.59s/batch, N=1.5045, E=0.9064,
KL=0.4363, wKL=0.2250]
Train E09: 84%|████████▍ | 21/25 [00:31<00:06, 1.59s/batch, N=1.4986, E=0.9074,
KL=0.4331, wKL=0.2250]
Train E09: 88%|████████▊ | 22/25 [00:31<00:04, 1.53s/batch, N=1.4986, E=0.9074,
KL=0.4331, wKL=0.2250]
Train E09: 88%|████████▊ | 22/25 [00:33<00:04, 1.53s/batch, N=1.5344, E=0.9078,
KL=0.4335, wKL=0.2250]
Train E09: 92%|█████████▏| 23/25 [00:33<00:02, 1.49s/batch, N=1.5344, E=0.9078,
KL=0.4335, wKL=0.2250]
Train E09: 92%|█████████▏| 23/25 [00:34<00:02, 1.49s/batch, N=1.5347, E=0.9076,
KL=0.4352, wKL=0.2250]
Train E09: 96%|█████████▌| 24/25 [00:34<00:01, 1.46s/batch, N=1.5347, E=0.9076,
KL=0.4352, wKL=0.2250]
Train E09: 96%|█████████▌| 24/25 [00:35<00:01, 1.46s/batch, N=1.5151, E=0.9070,
KL=0.4336, wKL=0.2250]
Train E09: 100%|██████████| 25/25 [00:35<00:00, 1.21s/batch, N=1.5151, E=0.9070,
KL=0.4336, wKL=0.2250]
Train E09: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5151, E=0.9070,
KL=0.4336, wKL=0.2250]
371.7s 89 [Epoch 009] Total: 2.4711 | N: 1.5119 | E: 0.9080 | KL(0.23×0.5): 0.4555
406.2s 90 Train E10: 0%| | 0/25 [00:00<?, ?batch/s]
Train E10: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4690, E=0.9054, KL=0.4296,
wKL=0.2500]
Train E10: 4%|▍ | 1/25 [00:01<00:33, 1.40s/batch, N=1.4690, E=0.9054,
KL=0.4296, wKL=0.2500]
Train E10: 4%|▍ | 1/25 [00:02<00:33, 1.40s/batch, N=1.4634, E=0.9072,
KL=0.4265, wKL=0.2500]
Train E10: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4634, E=0.9072,
KL=0.4265, wKL=0.2500]
Train E10: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4797, E=0.9109,
KL=0.4262, wKL=0.2500]
Train E10: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4797, E=0.9109,
KL=0.4262, wKL=0.2500]
Train E10: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5110, E=0.9097,
KL=0.4245, wKL=0.2500]
Train E10: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5110, E=0.9097,
KL=0.4245, wKL=0.2500]
Train E10: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5853, E=0.9086,
KL=0.4267, wKL=0.2500]
Train E10: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5853, E=0.9086,
KL=0.4267, wKL=0.2500]
Train E10: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.6000, E=0.9051,
KL=0.4240, wKL=0.2500]
Train E10: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.6000, E=0.9051,
KL=0.4240, wKL=0.2500]
Train E10: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4176, E=0.9059,
KL=0.4163, wKL=0.2500]
Train E10: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4176, E=0.9059,
KL=0.4163, wKL=0.2500]
Train E10: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.4576, E=0.9106,
KL=0.4187, wKL=0.2500]
Train E10: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4576, E=0.9106,
KL=0.4187, wKL=0.2500]
Train E10: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5136, E=0.9086,
KL=0.4176, wKL=0.2500]
Train E10: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.5136, E=0.9086,
KL=0.4176, wKL=0.2500]
Train E10: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.5597, E=0.9090,
KL=0.4163, wKL=0.2500]
Train E10: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.5597, E=0.9090,
KL=0.4163, wKL=0.2500]
Train E10: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.5447, E=0.9070,
KL=0.4131, wKL=0.2500]
Train E10: 44%|████▍ | 11/25 [00:15<00:20, 1.45s/batch, N=1.5447, E=0.9070,
KL=0.4131, wKL=0.2500]
Train E10: 44%|████▍ | 11/25 [00:17<00:20, 1.45s/batch, N=1.4955, E=0.9083,
KL=0.4092, wKL=0.2500]
Train E10: 48%|████▊ | 12/25 [00:17<00:18, 1.43s/batch, N=1.4955, E=0.9083,
KL=0.4092, wKL=0.2500]
Train E10: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.4767, E=0.9067,
KL=0.4062, wKL=0.2500]
Train E10: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4767, E=0.9067,
KL=0.4062, wKL=0.2500]
Train E10: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.4844, E=0.9062,
KL=0.4063, wKL=0.2500]
Train E10: 56%|█████▌ | 14/25 [00:19<00:15, 1.42s/batch, N=1.4844, E=0.9062,
KL=0.4063, wKL=0.2500]
Train E10: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.5586, E=0.9089,
KL=0.4069, wKL=0.2500]
Train E10: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5586, E=0.9089,
KL=0.4069, wKL=0.2500]
Train E10: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.5103, E=0.9074,
KL=0.4062, wKL=0.2500]
Train E10: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.5103, E=0.9074,
KL=0.4062, wKL=0.2500]
Train E10: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5587, E=0.9036,
KL=0.4040, wKL=0.2500]
Train E10: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5587, E=0.9036,
KL=0.4040, wKL=0.2500]
Train E10: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.5260, E=0.9070,
KL=0.4021, wKL=0.2500]
Train E10: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5260, E=0.9070,
KL=0.4021, wKL=0.2500]
Train E10: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4618, E=0.9084,
KL=0.3976, wKL=0.2500]
Train E10: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4618, E=0.9084,
KL=0.3976, wKL=0.2500]
Train E10: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5870, E=0.9060,
KL=0.3978, wKL=0.2500]
Train E10: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5870, E=0.9060,
KL=0.3978, wKL=0.2500]
Train E10: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.5556, E=0.9082,
KL=0.3980, wKL=0.2500]
Train E10: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5556, E=0.9082,
KL=0.3980, wKL=0.2500]
Train E10: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5588, E=0.9043,
KL=0.3977, wKL=0.2500]
Train E10: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5588, E=0.9043,
KL=0.3977, wKL=0.2500]
Train E10: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4508, E=0.9054,
KL=0.3936, wKL=0.2500]
Train E10: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4508, E=0.9054,
KL=0.3936, wKL=0.2500]
Train E10: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4932, E=0.9082,
KL=0.3970, wKL=0.2500]
Train E10: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.4932, E=0.9082,
KL=0.3970, wKL=0.2500]
Train E10: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.3929, E=0.9011,
KL=0.3913, wKL=0.2500]
Train E10: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.3929, E=0.9011,
KL=0.3913, wKL=0.2500]
Train E10: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.3929, E=0.9011,
KL=0.3913, wKL=0.2500]
406.2s 91 [Epoch 010] Total: 2.4698 | N: 1.5113 | E: 0.9073 | KL(0.25×0.5): 0.4106
406.2s 92 Saved checkpoint: /kaggle/working/checkpoints/gvae_10_epoch010.pt
441.4s 93 Train E11: 0%| | 0/25 [00:00<?, ?batch/s]
Train E11: 0%| | 0/25 [00:02<?, ?batch/s, N=1.5345, E=0.9069, KL=0.3948,
wKL=0.2750]
Train E11: 4%|▍ | 1/25 [00:02<00:48, 2.00s/batch, N=1.5345, E=0.9069,
KL=0.3948, wKL=0.2750]
Train E11: 4%|▍ | 1/25 [00:03<00:48, 2.00s/batch, N=1.4803, E=0.9087,
KL=0.3910, wKL=0.2750]
Train E11: 8%|▊ | 2/25 [00:03<00:37, 1.63s/batch, N=1.4803, E=0.9087,
KL=0.3910, wKL=0.2750]
Train E11: 8%|▊ | 2/25 [00:04<00:37, 1.63s/batch, N=1.5300, E=0.9080,
KL=0.3887, wKL=0.2750]
Train E11: 12%|█▏ | 3/25 [00:04<00:34, 1.55s/batch, N=1.5300, E=0.9080,
KL=0.3887, wKL=0.2750]
Train E11: 12%|█▏ | 3/25 [00:06<00:34, 1.55s/batch, N=1.4870, E=0.9037,
KL=0.3850, wKL=0.2750]
Train E11: 16%|█▌ | 4/25 [00:06<00:30, 1.47s/batch, N=1.4870, E=0.9037,
KL=0.3850, wKL=0.2750]
Train E11: 16%|█▌ | 4/25 [00:07<00:30, 1.47s/batch, N=1.5864, E=0.9059,
KL=0.3862, wKL=0.2750]
Train E11: 20%|██ | 5/25 [00:07<00:28, 1.44s/batch, N=1.5864, E=0.9059,
KL=0.3862, wKL=0.2750]
Train E11: 20%|██ | 5/25 [00:08<00:28, 1.44s/batch, N=1.5561, E=0.9089,
KL=0.3830, wKL=0.2750]
Train E11: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.5561, E=0.9089,
KL=0.3830, wKL=0.2750]
Train E11: 24%|██▍ | 6/25 [00:10<00:26, 1.42s/batch, N=1.4536, E=0.9064,
KL=0.3812, wKL=0.2750]
Train E11: 28%|██▊ | 7/25 [00:10<00:26, 1.49s/batch, N=1.4536, E=0.9064,
KL=0.3812, wKL=0.2750]
Train E11: 28%|██▊ | 7/25 [00:12<00:26, 1.49s/batch, N=1.5450, E=0.9063,
KL=0.3824, wKL=0.2750]
Train E11: 32%|███▏ | 8/25 [00:12<00:25, 1.47s/batch, N=1.5450, E=0.9063,
KL=0.3824, wKL=0.2750]
Train E11: 32%|███▏ | 8/25 [00:13<00:25, 1.47s/batch, N=1.4679, E=0.9051,
KL=0.3778, wKL=0.2750]
Train E11: 36%|███▌ | 9/25 [00:13<00:23, 1.45s/batch, N=1.4679, E=0.9051,
KL=0.3778, wKL=0.2750]
Train E11: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.5206, E=0.9088,
KL=0.3784, wKL=0.2750]
Train E11: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.5206, E=0.9088,
KL=0.3784, wKL=0.2750]
Train E11: 40%|████ | 10/25 [00:16<00:21, 1.46s/batch, N=1.4415, E=0.9062,
KL=0.3729, wKL=0.2750]
Train E11: 44%|████▍ | 11/25 [00:16<00:20, 1.44s/batch, N=1.4415, E=0.9062,
KL=0.3729, wKL=0.2750]
Train E11: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.4638, E=0.9084,
KL=0.3720, wKL=0.2750]
Train E11: 48%|████▊ | 12/25 [00:17<00:18, 1.42s/batch, N=1.4638, E=0.9084,
KL=0.3720, wKL=0.2750]
Train E11: 48%|████▊ | 12/25 [00:19<00:18, 1.42s/batch, N=1.4451, E=0.9071,
KL=0.3712, wKL=0.2750]
Train E11: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4451, E=0.9071,
KL=0.3712, wKL=0.2750]
Train E11: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.5063, E=0.9057,
KL=0.3700, wKL=0.2750]
Train E11: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.5063, E=0.9057,
KL=0.3700, wKL=0.2750]
Train E11: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.5015, E=0.9045,
KL=0.3686, wKL=0.2750]
Train E11: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.5015, E=0.9045,
KL=0.3686, wKL=0.2750]
Train E11: 60%|██████ | 15/25 [00:23<00:14, 1.40s/batch, N=1.4404, E=0.9050,
KL=0.3650, wKL=0.2750]
Train E11: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4404, E=0.9050,
KL=0.3650, wKL=0.2750]
Train E11: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.5326, E=0.9045,
KL=0.3673, wKL=0.2750]
Train E11: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5326, E=0.9045,
KL=0.3673, wKL=0.2750]
Train E11: 68%|██████▊ | 17/25 [00:26<00:11, 1.44s/batch, N=1.4966, E=0.9034,
KL=0.3638, wKL=0.2750]
Train E11: 72%|███████▏ | 18/25 [00:26<00:09, 1.43s/batch, N=1.4966, E=0.9034,
KL=0.3638, wKL=0.2750]
Train E11: 72%|███████▏ | 18/25 [00:27<00:09, 1.43s/batch, N=1.5357, E=0.9048,
KL=0.3651, wKL=0.2750]
Train E11: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.5357, E=0.9048,
KL=0.3651, wKL=0.2750]
Train E11: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5272, E=0.9049,
KL=0.3638, wKL=0.2750]
Train E11: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5272, E=0.9049,
KL=0.3638, wKL=0.2750]
Train E11: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.5028, E=0.9075,
KL=0.3620, wKL=0.2750]
Train E11: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5028, E=0.9075,
KL=0.3620, wKL=0.2750]
Train E11: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5610, E=0.9036,
KL=0.3607, wKL=0.2750]
Train E11: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5610, E=0.9036,
KL=0.3607, wKL=0.2750]
Train E11: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.5753, E=0.9076,
KL=0.3611, wKL=0.2750]
Train E11: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.5753, E=0.9076,
KL=0.3611, wKL=0.2750]
Train E11: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5363, E=0.9063,
KL=0.3592, wKL=0.2750]
Train E11: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.5363, E=0.9063,
KL=0.3592, wKL=0.2750]
Train E11: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.5575, E=0.9084,
KL=0.3619, wKL=0.2750]
Train E11: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.5575, E=0.9084,
KL=0.3619, wKL=0.2750]
Train E11: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5575, E=0.9084,
KL=0.3619, wKL=0.2750]
441.4s 94 [Epoch 011] Total: 2.4679 | N: 1.5103 | E: 0.9062 | KL(0.28×0.5): 0.3736
476.3s 95 Train E12: 0%| | 0/25 [00:00<?, ?batch/s]
Train E12: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4128, E=0.9053, KL=0.3562,
wKL=0.3000]
Train E12: 4%|▍ | 1/25 [00:01<00:33, 1.40s/batch, N=1.4128, E=0.9053,
KL=0.3562, wKL=0.3000]
Train E12: 4%|▍ | 1/25 [00:02<00:33, 1.40s/batch, N=1.4161, E=0.9053,
KL=0.3549, wKL=0.3000]
Train E12: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.4161, E=0.9053,
KL=0.3549, wKL=0.3000]
Train E12: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.5206, E=0.9017,
KL=0.3554, wKL=0.3000]
Train E12: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5206, E=0.9017,
KL=0.3554, wKL=0.3000]
Train E12: 12%|█▏ | 3/25 [00:06<00:29, 1.36s/batch, N=1.4862, E=0.9044,
KL=0.3527, wKL=0.3000]
Train E12: 16%|█▌ | 4/25 [00:06<00:34, 1.64s/batch, N=1.4862, E=0.9044,
KL=0.3527, wKL=0.3000]
Train E12: 16%|█▌ | 4/25 [00:07<00:34, 1.64s/batch, N=1.5436, E=0.9061,
KL=0.3525, wKL=0.3000]
Train E12: 20%|██ | 5/25 [00:07<00:31, 1.60s/batch, N=1.5436, E=0.9061,
KL=0.3525, wKL=0.3000]
Train E12: 20%|██ | 5/25 [00:09<00:31, 1.60s/batch, N=1.4792, E=0.9045,
KL=0.3508, wKL=0.3000]
Train E12: 24%|██▍ | 6/25 [00:09<00:29, 1.53s/batch, N=1.4792, E=0.9045,
KL=0.3508, wKL=0.3000]
Train E12: 24%|██▍ | 6/25 [00:10<00:29, 1.53s/batch, N=1.5652, E=0.9024,
KL=0.3523, wKL=0.3000]
Train E12: 28%|██▊ | 7/25 [00:10<00:26, 1.50s/batch, N=1.5652, E=0.9024,
KL=0.3523, wKL=0.3000]
Train E12: 28%|██▊ | 7/25 [00:11<00:26, 1.50s/batch, N=1.4485, E=0.9041,
KL=0.3477, wKL=0.3000]
Train E12: 32%|███▏ | 8/25 [00:11<00:24, 1.47s/batch, N=1.4485, E=0.9041,
KL=0.3477, wKL=0.3000]
Train E12: 32%|███▏ | 8/25 [00:13<00:24, 1.47s/batch, N=1.5496, E=0.9024,
KL=0.3475, wKL=0.3000]
Train E12: 36%|███▌ | 9/25 [00:13<00:23, 1.45s/batch, N=1.5496, E=0.9024,
KL=0.3475, wKL=0.3000]
Train E12: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.5340, E=0.9005,
KL=0.3429, wKL=0.3000]
Train E12: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.5340, E=0.9005,
KL=0.3429, wKL=0.3000]
Train E12: 40%|████ | 10/25 [00:16<00:21, 1.44s/batch, N=1.5123, E=0.9035,
KL=0.3440, wKL=0.3000]
Train E12: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.5123, E=0.9035,
KL=0.3440, wKL=0.3000]
Train E12: 44%|████▍ | 11/25 [00:17<00:19, 1.42s/batch, N=1.4561, E=0.9068,
KL=0.3427, wKL=0.3000]
Train E12: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4561, E=0.9068,
KL=0.3427, wKL=0.3000]
Train E12: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5383, E=0.9027,
KL=0.3449, wKL=0.3000]
Train E12: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5383, E=0.9027,
KL=0.3449, wKL=0.3000]
Train E12: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.5454, E=0.9028,
KL=0.3441, wKL=0.3000]
Train E12: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.5454, E=0.9028,
KL=0.3441, wKL=0.3000]
Train E12: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4459, E=0.9030,
KL=0.3395, wKL=0.3000]
Train E12: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4459, E=0.9030,
KL=0.3395, wKL=0.3000]
Train E12: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.5298, E=0.9042,
KL=0.3398, wKL=0.3000]
Train E12: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.5298, E=0.9042,
KL=0.3398, wKL=0.3000]
Train E12: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.5703, E=0.9021,
KL=0.3371, wKL=0.3000]
Train E12: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5703, E=0.9021,
KL=0.3371, wKL=0.3000]
Train E12: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5296, E=0.9030,
KL=0.3369, wKL=0.3000]
Train E12: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5296, E=0.9030,
KL=0.3369, wKL=0.3000]
Train E12: 72%|███████▏ | 18/25 [00:27<00:09, 1.39s/batch, N=1.5632, E=0.9040,
KL=0.3377, wKL=0.3000]
Train E12: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.5632, E=0.9040,
KL=0.3377, wKL=0.3000]
Train E12: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5168, E=0.9033,
KL=0.3349, wKL=0.3000]
Train E12: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5168, E=0.9033,
KL=0.3349, wKL=0.3000]
Train E12: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5792, E=0.9022,
KL=0.3360, wKL=0.3000]
Train E12: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5792, E=0.9022,
KL=0.3360, wKL=0.3000]
Train E12: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5169, E=0.8997,
KL=0.3342, wKL=0.3000]
Train E12: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.5169, E=0.8997,
KL=0.3342, wKL=0.3000]
Train E12: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4897, E=0.9009,
KL=0.3325, wKL=0.3000]
Train E12: 92%|█████████▏| 23/25 [00:32<00:02, 1.39s/batch, N=1.4897, E=0.9009,
KL=0.3325, wKL=0.3000]
Train E12: 92%|█████████▏| 23/25 [00:34<00:02, 1.39s/batch, N=1.4535, E=0.9016,
KL=0.3297, wKL=0.3000]
Train E12: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4535, E=0.9016,
KL=0.3297, wKL=0.3000]
Train E12: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5575, E=0.8985,
KL=0.3337, wKL=0.3000]
Train E12: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.5575, E=0.8985,
KL=0.3337, wKL=0.3000]
Train E12: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5575, E=0.8985,
KL=0.3337, wKL=0.3000]
476.3s 96 [Epoch 012] Total: 2.4639 | N: 1.5093 | E: 0.9031 | KL(0.30×0.5): 0.3435
511.3s 97 Train E13: 0%| | 0/25 [00:00<?, ?batch/s]
Train E13: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5473, E=0.8999, KL=0.3347,
wKL=0.3250]
Train E13: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5473, E=0.8999,
KL=0.3347, wKL=0.3250]
Train E13: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5427, E=0.8995,
KL=0.3329, wKL=0.3250]
Train E13: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5427, E=0.8995,
KL=0.3329, wKL=0.3250]
Train E13: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4968, E=0.8979,
KL=0.3290, wKL=0.3250]
Train E13: 12%|█▏ | 3/25 [00:04<00:33, 1.51s/batch, N=1.4968, E=0.8979,
KL=0.3290, wKL=0.3250]
Train E13: 12%|█▏ | 3/25 [00:05<00:33, 1.51s/batch, N=1.4971, E=0.9007,
KL=0.3302, wKL=0.3250]
Train E13: 16%|█▌ | 4/25 [00:05<00:31, 1.48s/batch, N=1.4971, E=0.9007,
KL=0.3302, wKL=0.3250]
Train E13: 16%|█▌ | 4/25 [00:07<00:31, 1.48s/batch, N=1.5223, E=0.8996,
KL=0.3284, wKL=0.3250]
Train E13: 20%|██ | 5/25 [00:07<00:28, 1.44s/batch, N=1.5223, E=0.8996,
KL=0.3284, wKL=0.3250]
Train E13: 20%|██ | 5/25 [00:08<00:28, 1.44s/batch, N=1.4452, E=0.8981,
KL=0.3267, wKL=0.3250]
Train E13: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.4452, E=0.8981,
KL=0.3267, wKL=0.3250]
Train E13: 24%|██▍ | 6/25 [00:09<00:26, 1.42s/batch, N=1.4388, E=0.8965,
KL=0.3252, wKL=0.3250]
Train E13: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4388, E=0.8965,
KL=0.3252, wKL=0.3250]
Train E13: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5665, E=0.8962,
KL=0.3252, wKL=0.3250]
Train E13: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5665, E=0.8962,
KL=0.3252, wKL=0.3250]
Train E13: 32%|███▏ | 8/25 [00:13<00:23, 1.39s/batch, N=1.4660, E=0.8937,
KL=0.3210, wKL=0.3250]
Train E13: 36%|███▌ | 9/25 [00:13<00:25, 1.58s/batch, N=1.4660, E=0.8937,
KL=0.3210, wKL=0.3250]
Train E13: 36%|███▌ | 9/25 [00:14<00:25, 1.58s/batch, N=1.4529, E=0.8947,
KL=0.3185, wKL=0.3250]
Train E13: 40%|████ | 10/25 [00:14<00:23, 1.54s/batch, N=1.4529, E=0.8947,
KL=0.3185, wKL=0.3250]
Train E13: 40%|████ | 10/25 [00:16<00:23, 1.54s/batch, N=1.5188, E=0.8961,
KL=0.3200, wKL=0.3250]
Train E13: 44%|████▍ | 11/25 [00:16<00:20, 1.49s/batch, N=1.5188, E=0.8961,
KL=0.3200, wKL=0.3250]
Train E13: 44%|████▍ | 11/25 [00:17<00:20, 1.49s/batch, N=1.5467, E=0.8957,
KL=0.3198, wKL=0.3250]
Train E13: 48%|████▊ | 12/25 [00:17<00:18, 1.46s/batch, N=1.5467, E=0.8957,
KL=0.3198, wKL=0.3250]
Train E13: 48%|████▊ | 12/25 [00:18<00:18, 1.46s/batch, N=1.4774, E=0.8950,
KL=0.3192, wKL=0.3250]
Train E13: 52%|█████▏ | 13/25 [00:18<00:17, 1.43s/batch, N=1.4774, E=0.8950,
KL=0.3192, wKL=0.3250]
Train E13: 52%|█████▏ | 13/25 [00:20<00:17, 1.43s/batch, N=1.4142, E=0.8893,
KL=0.3181, wKL=0.3250]
Train E13: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4142, E=0.8893,
KL=0.3181, wKL=0.3250]
Train E13: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.5587, E=0.8889,
KL=0.3206, wKL=0.3250]
Train E13: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5587, E=0.8889,
KL=0.3206, wKL=0.3250]
Train E13: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4874, E=0.8876,
KL=0.3166, wKL=0.3250]
Train E13: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4874, E=0.8876,
KL=0.3166, wKL=0.3250]
Train E13: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.5069, E=0.8868,
KL=0.3164, wKL=0.3250]
Train E13: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.5069, E=0.8868,
KL=0.3164, wKL=0.3250]
Train E13: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.5411, E=0.8859,
KL=0.3183, wKL=0.3250]
Train E13: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5411, E=0.8859,
KL=0.3183, wKL=0.3250]
Train E13: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.6103, E=0.8889,
KL=0.3208, wKL=0.3250]
Train E13: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.6103, E=0.8889,
KL=0.3208, wKL=0.3250]
Train E13: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4265, E=0.8808,
KL=0.3167, wKL=0.3250]
Train E13: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4265, E=0.8808,
KL=0.3167, wKL=0.3250]
Train E13: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5063, E=0.8819,
KL=0.3182, wKL=0.3250]
Train E13: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5063, E=0.8819,
KL=0.3182, wKL=0.3250]
Train E13: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5402, E=0.8805,
KL=0.3210, wKL=0.3250]
Train E13: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.5402, E=0.8805,
KL=0.3210, wKL=0.3250]
Train E13: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4774, E=0.8791,
KL=0.3202, wKL=0.3250]
Train E13: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4774, E=0.8791,
KL=0.3202, wKL=0.3250]
Train E13: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5410, E=0.8765,
KL=0.3200, wKL=0.3250]
Train E13: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5410, E=0.8765,
KL=0.3200, wKL=0.3250]
Train E13: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5975, E=0.8798,
KL=0.3234, wKL=0.3250]
Train E13: 100%|██████████| 25/25 [00:34<00:00, 1.19s/batch, N=1.5975, E=0.8798,
KL=0.3234, wKL=0.3250]
Train E13: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5975, E=0.8798,
KL=0.3234, wKL=0.3250]
511.3s 98 [Epoch 013] Total: 2.4503 | N: 1.5069 | E: 0.8910 | KL(0.33×0.5): 0.3224
546.4s 99 Train E14: 0%| | 0/25 [00:00<?, ?batch/s]
Train E14: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5399, E=0.8757, KL=0.3247,
wKL=0.3500]
Train E14: 4%|▍ | 1/25 [00:01<00:36, 1.53s/batch, N=1.5399, E=0.8757,
KL=0.3247, wKL=0.3500]
Train E14: 4%|▍ | 1/25 [00:02<00:36, 1.53s/batch, N=1.4990, E=0.8706,
KL=0.3237, wKL=0.3500]
Train E14: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.4990, E=0.8706,
KL=0.3237, wKL=0.3500]
Train E14: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.5140, E=0.8673,
KL=0.3227, wKL=0.3500]
Train E14: 12%|█▏ | 3/25 [00:04<00:31, 1.41s/batch, N=1.5140, E=0.8673,
KL=0.3227, wKL=0.3500]
Train E14: 12%|█▏ | 3/25 [00:05<00:31, 1.41s/batch, N=1.4455, E=0.8663,
KL=0.3225, wKL=0.3500]
Train E14: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4455, E=0.8663,
KL=0.3225, wKL=0.3500]
Train E14: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4900, E=0.8637,
KL=0.3254, wKL=0.3500]
Train E14: 20%|██ | 5/25 [00:07<00:27, 1.40s/batch, N=1.4900, E=0.8637,
KL=0.3254, wKL=0.3500]
Train E14: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.5399, E=0.8619,
KL=0.3268, wKL=0.3500]
Train E14: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5399, E=0.8619,
KL=0.3268, wKL=0.3500]
Train E14: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4421, E=0.8567,
KL=0.3279, wKL=0.3500]
Train E14: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4421, E=0.8567,
KL=0.3279, wKL=0.3500]
Train E14: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4874, E=0.8567,
KL=0.3325, wKL=0.3500]
Train E14: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4874, E=0.8567,
KL=0.3325, wKL=0.3500]
Train E14: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.5812, E=0.8488,
KL=0.3320, wKL=0.3500]
Train E14: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5812, E=0.8488,
KL=0.3320, wKL=0.3500]
Train E14: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.5572, E=0.8514,
KL=0.3318, wKL=0.3500]
Train E14: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.5572, E=0.8514,
KL=0.3318, wKL=0.3500]
Train E14: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4397, E=0.8511,
KL=0.3325, wKL=0.3500]
Train E14: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4397, E=0.8511,
KL=0.3325, wKL=0.3500]
Train E14: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.4607, E=0.8474,
KL=0.3378, wKL=0.3500]
Train E14: 48%|████▊ | 12/25 [00:17<00:20, 1.59s/batch, N=1.4607, E=0.8474,
KL=0.3378, wKL=0.3500]
Train E14: 48%|████▊ | 12/25 [00:18<00:20, 1.59s/batch, N=1.5927, E=0.8502,
KL=0.3428, wKL=0.3500]
Train E14: 52%|█████▏ | 13/25 [00:18<00:18, 1.54s/batch, N=1.5927, E=0.8502,
KL=0.3428, wKL=0.3500]
Train E14: 52%|█████▏ | 13/25 [00:20<00:18, 1.54s/batch, N=1.5155, E=0.8485,
KL=0.3436, wKL=0.3500]
Train E14: 56%|█████▌ | 14/25 [00:20<00:16, 1.51s/batch, N=1.5155, E=0.8485,
KL=0.3436, wKL=0.3500]
Train E14: 56%|█████▌ | 14/25 [00:21<00:16, 1.51s/batch, N=1.4703, E=0.8405,
KL=0.3418, wKL=0.3500]
Train E14: 60%|██████ | 15/25 [00:21<00:14, 1.47s/batch, N=1.4703, E=0.8405,
KL=0.3418, wKL=0.3500]
Train E14: 60%|██████ | 15/25 [00:23<00:14, 1.47s/batch, N=1.4250, E=0.8418,
KL=0.3431, wKL=0.3500]
Train E14: 64%|██████▍ | 16/25 [00:23<00:13, 1.44s/batch, N=1.4250, E=0.8418,
KL=0.3431, wKL=0.3500]
Train E14: 64%|██████▍ | 16/25 [00:24<00:13, 1.44s/batch, N=1.4905, E=0.8388,
KL=0.3465, wKL=0.3500]
Train E14: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.4905, E=0.8388,
KL=0.3465, wKL=0.3500]
Train E14: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.5387, E=0.8415,
KL=0.3508, wKL=0.3500]
Train E14: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5387, E=0.8415,
KL=0.3508, wKL=0.3500]
Train E14: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.5076, E=0.8382,
KL=0.3525, wKL=0.3500]
Train E14: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5076, E=0.8382,
KL=0.3525, wKL=0.3500]
Train E14: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4468, E=0.8417,
KL=0.3493, wKL=0.3500]
Train E14: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4468, E=0.8417,
KL=0.3493, wKL=0.3500]
Train E14: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.4582, E=0.8392,
KL=0.3473, wKL=0.3500]
Train E14: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4582, E=0.8392,
KL=0.3473, wKL=0.3500]
Train E14: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.4248, E=0.8377,
KL=0.3458, wKL=0.3500]
Train E14: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.4248, E=0.8377,
KL=0.3458, wKL=0.3500]
Train E14: 88%|████████▊ | 22/25 [00:33<00:04, 1.44s/batch, N=1.5299, E=0.8386,
KL=0.3474, wKL=0.3500]
Train E14: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.5299, E=0.8386,
KL=0.3474, wKL=0.3500]
Train E14: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5121, E=0.8366,
KL=0.3468, wKL=0.3500]
Train E14: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.5121, E=0.8366,
KL=0.3468, wKL=0.3500]
Train E14: 96%|█████████▌| 24/25 [00:35<00:01, 1.44s/batch, N=1.6547, E=0.8339,
KL=0.3487, wKL=0.3500]
Train E14: 100%|██████████| 25/25 [00:35<00:00, 1.19s/batch, N=1.6547, E=0.8339,
KL=0.3487, wKL=0.3500]
Train E14: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.6547, E=0.8339,
KL=0.3487, wKL=0.3500]
546.4s 100 [Epoch 014] Total: 2.4081 | N: 1.4989 | E: 0.8502 | KL(0.35×0.5): 0.3376
581.5s 101 Train E15: 0%| | 0/25 [00:00<?, ?batch/s]
Train E15: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5293, E=0.8376, KL=0.3426,
wKL=0.3750]
Train E15: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.5293, E=0.8376,
KL=0.3426, wKL=0.3750]
Train E15: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.4947, E=0.8374,
KL=0.3395, wKL=0.3750]
Train E15: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4947, E=0.8374,
KL=0.3395, wKL=0.3750]
Train E15: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.5105, E=0.8318,
KL=0.3359, wKL=0.3750]
Train E15: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5105, E=0.8318,
KL=0.3359, wKL=0.3750]
Train E15: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4658, E=0.8380,
KL=0.3319, wKL=0.3750]
Train E15: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.4658, E=0.8380,
KL=0.3319, wKL=0.3750]
Train E15: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.4792, E=0.8344,
KL=0.3301, wKL=0.3750]
Train E15: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4792, E=0.8344,
KL=0.3301, wKL=0.3750]
Train E15: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5064, E=0.8337,
KL=0.3282, wKL=0.3750]
Train E15: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5064, E=0.8337,
KL=0.3282, wKL=0.3750]
Train E15: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5324, E=0.8321,
KL=0.3251, wKL=0.3750]
Train E15: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5324, E=0.8321,
KL=0.3251, wKL=0.3750]
Train E15: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4746, E=0.8305,
KL=0.3212, wKL=0.3750]
Train E15: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4746, E=0.8305,
KL=0.3212, wKL=0.3750]
Train E15: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5386, E=0.8388,
KL=0.3199, wKL=0.3750]
Train E15: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.5386, E=0.8388,
KL=0.3199, wKL=0.3750]
Train E15: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.6122, E=0.8324,
KL=0.3214, wKL=0.3750]
Train E15: 40%|████ | 10/25 [00:13<00:20, 1.37s/batch, N=1.6122, E=0.8324,
KL=0.3214, wKL=0.3750]
Train E15: 40%|████ | 10/25 [00:15<00:20, 1.37s/batch, N=1.4640, E=0.8278,
KL=0.3193, wKL=0.3750]
Train E15: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4640, E=0.8278,
KL=0.3193, wKL=0.3750]
Train E15: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4052, E=0.8340,
KL=0.3164, wKL=0.3750]
Train E15: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4052, E=0.8340,
KL=0.3164, wKL=0.3750]
Train E15: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4371, E=0.8316,
KL=0.3169, wKL=0.3750]
Train E15: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4371, E=0.8316,
KL=0.3169, wKL=0.3750]
Train E15: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4958, E=0.8341,
KL=0.3133, wKL=0.3750]
Train E15: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4958, E=0.8341,
KL=0.3133, wKL=0.3750]
Train E15: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.4404, E=0.8369,
KL=0.3131, wKL=0.3750]
Train E15: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.4404, E=0.8369,
KL=0.3131, wKL=0.3750]
Train E15: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4911, E=0.8272,
KL=0.3145, wKL=0.3750]
Train E15: 64%|██████▍ | 16/25 [00:22<00:14, 1.59s/batch, N=1.4911, E=0.8272,
KL=0.3145, wKL=0.3750]
Train E15: 64%|██████▍ | 16/25 [00:24<00:14, 1.59s/batch, N=1.5196, E=0.8321,
KL=0.3153, wKL=0.3750]
Train E15: 68%|██████▊ | 17/25 [00:24<00:12, 1.55s/batch, N=1.5196, E=0.8321,
KL=0.3153, wKL=0.3750]
Train E15: 68%|██████▊ | 17/25 [00:25<00:12, 1.55s/batch, N=1.4874, E=0.8312,
KL=0.3143, wKL=0.3750]
Train E15: 72%|███████▏ | 18/25 [00:25<00:10, 1.50s/batch, N=1.4874, E=0.8312,
KL=0.3143, wKL=0.3750]
Train E15: 72%|███████▏ | 18/25 [00:27<00:10, 1.50s/batch, N=1.5481, E=0.8335,
KL=0.3144, wKL=0.3750]
Train E15: 76%|███████▌ | 19/25 [00:27<00:08, 1.47s/batch, N=1.5481, E=0.8335,
KL=0.3144, wKL=0.3750]
Train E15: 76%|███████▌ | 19/25 [00:28<00:08, 1.47s/batch, N=1.4926, E=0.8245,
KL=0.3127, wKL=0.3750]
Train E15: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.4926, E=0.8245,
KL=0.3127, wKL=0.3750]
Train E15: 80%|████████ | 20/25 [00:30<00:07, 1.50s/batch, N=1.4364, E=0.8316,
KL=0.3108, wKL=0.3750]
Train E15: 84%|████████▍ | 21/25 [00:30<00:06, 1.52s/batch, N=1.4364, E=0.8316,
KL=0.3108, wKL=0.3750]
Train E15: 84%|████████▍ | 21/25 [00:31<00:06, 1.52s/batch, N=1.5324, E=0.8264,
KL=0.3127, wKL=0.3750]
Train E15: 88%|████████▊ | 22/25 [00:31<00:04, 1.48s/batch, N=1.5324, E=0.8264,
KL=0.3127, wKL=0.3750]
Train E15: 88%|████████▊ | 22/25 [00:32<00:04, 1.48s/batch, N=1.4754, E=0.8256,
KL=0.3123, wKL=0.3750]
Train E15: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.4754, E=0.8256,
KL=0.3123, wKL=0.3750]
Train E15: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5212, E=0.8307,
KL=0.3116, wKL=0.3750]
Train E15: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5212, E=0.8307,
KL=0.3116, wKL=0.3750]
Train E15: 96%|█████████▌| 24/25 [00:35<00:01, 1.45s/batch, N=1.5164, E=0.8324,
KL=0.3098, wKL=0.3750]
Train E15: 100%|██████████| 25/25 [00:35<00:00, 1.20s/batch, N=1.5164, E=0.8324,
KL=0.3098, wKL=0.3750]
Train E15: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5164, E=0.8324,
KL=0.3098, wKL=0.3750]
581.5s 102 [Epoch 015] Total: 2.3881 | N: 1.4958 | E: 0.8322 | KL(0.38×0.5): 0.3204
616.5s 103 Train E16: 0%| | 0/25 [00:00<?, ?batch/s]
Train E16: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4652, E=0.8296, KL=0.3083,
wKL=0.4000]
Train E16: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4652, E=0.8296,
KL=0.3083, wKL=0.4000]
Train E16: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5902, E=0.8318,
KL=0.3113, wKL=0.4000]
Train E16: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5902, E=0.8318,
KL=0.3113, wKL=0.4000]
Train E16: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4924, E=0.8271,
KL=0.3087, wKL=0.4000]
Train E16: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4924, E=0.8271,
KL=0.3087, wKL=0.4000]
Train E16: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4109, E=0.8294,
KL=0.3025, wKL=0.4000]
Train E16: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4109, E=0.8294,
KL=0.3025, wKL=0.4000]
Train E16: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5012, E=0.8313,
KL=0.3038, wKL=0.4000]
Train E16: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5012, E=0.8313,
KL=0.3038, wKL=0.4000]
Train E16: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.3029, wKL=0.4000]
Train E16: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.3029, wKL=0.4000]
Train E16: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4879, E=0.8262,
KL=0.3038, wKL=0.4000]
Train E16: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4879, E=0.8262,
KL=0.3038, wKL=0.4000]
Train E16: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5264, E=0.8323,
KL=0.2990, wKL=0.4000]
Train E16: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.5264, E=0.8323,
KL=0.2990, wKL=0.4000]
Train E16: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.5038, E=0.8319,
KL=0.2969, wKL=0.4000]
Train E16: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5038, E=0.8319,
KL=0.2969, wKL=0.4000]
Train E16: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5893, E=0.8302,
KL=0.2981, wKL=0.4000]
Train E16: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.5893, E=0.8302,
KL=0.2981, wKL=0.4000]
Train E16: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4951, E=0.8289,
KL=0.2933, wKL=0.4000]
Train E16: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4951, E=0.8289,
KL=0.2933, wKL=0.4000]
Train E16: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4369, E=0.8354,
KL=0.2926, wKL=0.4000]
Train E16: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4369, E=0.8354,
KL=0.2926, wKL=0.4000]
Train E16: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4493, E=0.8349,
KL=0.2913, wKL=0.4000]
Train E16: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4493, E=0.8349,
KL=0.2913, wKL=0.4000]
Train E16: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5137, E=0.8328,
KL=0.2911, wKL=0.4000]
Train E16: 56%|█████▌ | 14/25 [00:19<00:15, 1.42s/batch, N=1.5137, E=0.8328,
KL=0.2911, wKL=0.4000]
Train E16: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.5055, E=0.8258,
KL=0.2906, wKL=0.4000]
Train E16: 60%|██████ | 15/25 [00:20<00:14, 1.43s/batch, N=1.5055, E=0.8258,
KL=0.2906, wKL=0.4000]
Train E16: 60%|██████ | 15/25 [00:22<00:14, 1.43s/batch, N=1.5913, E=0.8269,
KL=0.2929, wKL=0.4000]
Train E16: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.5913, E=0.8269,
KL=0.2929, wKL=0.4000]
Train E16: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.4800, E=0.8300,
KL=0.2900, wKL=0.4000]
Train E16: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.4800, E=0.8300,
KL=0.2900, wKL=0.4000]
Train E16: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5476, E=0.8323,
KL=0.2899, wKL=0.4000]
Train E16: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.5476, E=0.8323,
KL=0.2899, wKL=0.4000]
Train E16: 72%|███████▏ | 18/25 [00:27<00:10, 1.44s/batch, N=1.4325, E=0.8280,
KL=0.2881, wKL=0.4000]
Train E16: 76%|███████▌ | 19/25 [00:27<00:09, 1.64s/batch, N=1.4325, E=0.8280,
KL=0.2881, wKL=0.4000]
Train E16: 76%|███████▌ | 19/25 [00:28<00:09, 1.64s/batch, N=1.4304, E=0.8310,
KL=0.2873, wKL=0.4000]
Train E16: 80%|████████ | 20/25 [00:28<00:07, 1.58s/batch, N=1.4304, E=0.8310,
KL=0.2873, wKL=0.4000]
Train E16: 80%|████████ | 20/25 [00:30<00:07, 1.58s/batch, N=1.5011, E=0.8236,
KL=0.2911, wKL=0.4000]
Train E16: 84%|████████▍ | 21/25 [00:30<00:06, 1.53s/batch, N=1.5011, E=0.8236,
KL=0.2911, wKL=0.4000]
Train E16: 84%|████████▍ | 21/25 [00:31<00:06, 1.53s/batch, N=1.3503, E=0.8260,
KL=0.2881, wKL=0.4000]
Train E16: 88%|████████▊ | 22/25 [00:31<00:04, 1.49s/batch, N=1.3503, E=0.8260,
KL=0.2881, wKL=0.4000]
Train E16: 88%|████████▊ | 22/25 [00:33<00:04, 1.49s/batch, N=1.5412, E=0.8293,
KL=0.2905, wKL=0.4000]
Train E16: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.5412, E=0.8293,
KL=0.2905, wKL=0.4000]
Train E16: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5358, E=0.8297,
KL=0.2887, wKL=0.4000]
Train E16: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.5358, E=0.8297,
KL=0.2887, wKL=0.4000]
Train E16: 96%|█████████▌| 24/25 [00:35<00:01, 1.44s/batch, N=1.5065, E=0.8332,
KL=0.2890, wKL=0.4000]
Train E16: 100%|██████████| 25/25 [00:35<00:00, 1.21s/batch, N=1.5065, E=0.8332,
KL=0.2890, wKL=0.4000]
Train E16: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5065, E=0.8332,
KL=0.2890, wKL=0.4000]
616.5s 104 [Epoch 016] Total: 2.3838 | N: 1.4951 | E: 0.8295 | KL(0.40×0.5): 0.2958
651.5s 105 Train E17: 0%| | 0/25 [00:00<?, ?batch/s]
Train E17: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5101, E=0.8297, KL=0.2871,
wKL=0.4250]
Train E17: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5101, E=0.8297,
KL=0.2871, wKL=0.4250]
Train E17: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5180, E=0.8264,
KL=0.2894, wKL=0.4250]
Train E17: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5180, E=0.8264,
KL=0.2894, wKL=0.4250]
Train E17: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5914, E=0.8280,
KL=0.2868, wKL=0.4250]
Train E17: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5914, E=0.8280,
KL=0.2868, wKL=0.4250]
Train E17: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5319, E=0.8271,
KL=0.2854, wKL=0.4250]
Train E17: 16%|█▌ | 4/25 [00:05<00:29, 1.40s/batch, N=1.5319, E=0.8271,
KL=0.2854, wKL=0.4250]
Train E17: 16%|█▌ | 4/25 [00:06<00:29, 1.40s/batch, N=1.5482, E=0.8274,
KL=0.2864, wKL=0.4250]
Train E17: 20%|██ | 5/25 [00:06<00:28, 1.41s/batch, N=1.5482, E=0.8274,
KL=0.2864, wKL=0.4250]
Train E17: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.3949, E=0.8273,
KL=0.2783, wKL=0.4250]
Train E17: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.3949, E=0.8273,
KL=0.2783, wKL=0.4250]
Train E17: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5255, E=0.8298,
KL=0.2778, wKL=0.4250]
Train E17: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5255, E=0.8298,
KL=0.2778, wKL=0.4250]
Train E17: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4098, E=0.8272,
KL=0.2756, wKL=0.4250]
Train E17: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4098, E=0.8272,
KL=0.2756, wKL=0.4250]
Train E17: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4573, E=0.8311,
KL=0.2768, wKL=0.4250]
Train E17: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4573, E=0.8311,
KL=0.2768, wKL=0.4250]
Train E17: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5705, E=0.8246,
KL=0.2774, wKL=0.4250]
Train E17: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.5705, E=0.8246,
KL=0.2774, wKL=0.4250]
Train E17: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4136, E=0.8305,
KL=0.2737, wKL=0.4250]
Train E17: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4136, E=0.8305,
KL=0.2737, wKL=0.4250]
Train E17: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4269, E=0.8278,
KL=0.2703, wKL=0.4250]
Train E17: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4269, E=0.8278,
KL=0.2703, wKL=0.4250]
Train E17: 48%|████▊ | 12/25 [00:18<00:17, 1.38s/batch, N=1.4903, E=0.8246,
KL=0.2703, wKL=0.4250]
Train E17: 52%|█████▏ | 13/25 [00:18<00:16, 1.38s/batch, N=1.4903, E=0.8246,
KL=0.2703, wKL=0.4250]
Train E17: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.4956, E=0.8299,
KL=0.2703, wKL=0.4250]
Train E17: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.4956, E=0.8299,
KL=0.2703, wKL=0.4250]
Train E17: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4749, E=0.8302,
KL=0.2700, wKL=0.4250]
Train E17: 60%|██████ | 15/25 [00:20<00:13, 1.37s/batch, N=1.4749, E=0.8302,
KL=0.2700, wKL=0.4250]
Train E17: 60%|██████ | 15/25 [00:22<00:13, 1.37s/batch, N=1.4780, E=0.8297,
KL=0.2687, wKL=0.4250]
Train E17: 64%|██████▍ | 16/25 [00:22<00:13, 1.46s/batch, N=1.4780, E=0.8297,
KL=0.2687, wKL=0.4250]
Train E17: 64%|██████▍ | 16/25 [00:23<00:13, 1.46s/batch, N=1.4482, E=0.8310,
KL=0.2663, wKL=0.4250]
Train E17: 68%|██████▊ | 17/25 [00:23<00:11, 1.45s/batch, N=1.4482, E=0.8310,
KL=0.2663, wKL=0.4250]
Train E17: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.4167, E=0.8253,
KL=0.2665, wKL=0.4250]
Train E17: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.4167, E=0.8253,
KL=0.2665, wKL=0.4250]
Train E17: 72%|███████▏ | 18/25 [00:26<00:10, 1.44s/batch, N=1.6420, E=0.8272,
KL=0.2701, wKL=0.4250]
Train E17: 76%|███████▌ | 19/25 [00:26<00:08, 1.43s/batch, N=1.6420, E=0.8272,
KL=0.2701, wKL=0.4250]
Train E17: 76%|███████▌ | 19/25 [00:28<00:08, 1.43s/batch, N=1.4691, E=0.8249,
KL=0.2689, wKL=0.4250]
Train E17: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4691, E=0.8249,
KL=0.2689, wKL=0.4250]
Train E17: 80%|████████ | 20/25 [00:29<00:07, 1.42s/batch, N=1.5187, E=0.8294,
KL=0.2655, wKL=0.4250]
Train E17: 84%|████████▍ | 21/25 [00:29<00:05, 1.42s/batch, N=1.5187, E=0.8294,
KL=0.2655, wKL=0.4250]
Train E17: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5358, E=0.8260,
KL=0.2660, wKL=0.4250]
Train E17: 88%|████████▊ | 22/25 [00:30<00:04, 1.40s/batch, N=1.5358, E=0.8260,
KL=0.2660, wKL=0.4250]
Train E17: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4772, E=0.8265,
KL=0.2688, wKL=0.4250]
Train E17: 92%|█████████▏| 23/25 [00:32<00:03, 1.59s/batch, N=1.4772, E=0.8265,
KL=0.2688, wKL=0.4250]
Train E17: 92%|█████████▏| 23/25 [00:34<00:03, 1.59s/batch, N=1.5117, E=0.8327,
KL=0.2681, wKL=0.4250]
Train E17: 96%|█████████▌| 24/25 [00:34<00:01, 1.54s/batch, N=1.5117, E=0.8327,
KL=0.2681, wKL=0.4250]
Train E17: 96%|█████████▌| 24/25 [00:34<00:01, 1.54s/batch, N=1.4888, E=0.8280,
KL=0.2671, wKL=0.4250]
Train E17: 100%|██████████| 25/25 [00:34<00:00, 1.26s/batch, N=1.4888, E=0.8280,
KL=0.2671, wKL=0.4250]
Train E17: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4888, E=0.8280,
KL=0.2671, wKL=0.4250]
651.5s 106 [Epoch 017] Total: 2.3803 | N: 1.4939 | E: 0.8281 | KL(0.42×0.5): 0.2742
685.8s 107 Train E18: 0%| | 0/25 [00:00<?, ?batch/s]
Train E18: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4879, E=0.8247, KL=0.2658,
wKL=0.4500]
Train E18: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4879, E=0.8247,
KL=0.2658, wKL=0.4500]
Train E18: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4226, E=0.8264,
KL=0.2647, wKL=0.4500]
Train E18: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4226, E=0.8264,
KL=0.2647, wKL=0.4500]
Train E18: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5227, E=0.8289,
KL=0.2678, wKL=0.4500]
Train E18: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.5227, E=0.8289,
KL=0.2678, wKL=0.4500]
Train E18: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5000, E=0.8261,
KL=0.2642, wKL=0.4500]
Train E18: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5000, E=0.8261,
KL=0.2642, wKL=0.4500]
Train E18: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5024, E=0.8298,
KL=0.2628, wKL=0.4500]
Train E18: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5024, E=0.8298,
KL=0.2628, wKL=0.4500]
Train E18: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.5569, E=0.8267,
KL=0.2598, wKL=0.4500]
Train E18: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5569, E=0.8267,
KL=0.2598, wKL=0.4500]
Train E18: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4495, E=0.8282,
KL=0.2607, wKL=0.4500]
Train E18: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4495, E=0.8282,
KL=0.2607, wKL=0.4500]
Train E18: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4642, E=0.8282,
KL=0.2584, wKL=0.4500]
Train E18: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4642, E=0.8282,
KL=0.2584, wKL=0.4500]
Train E18: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5023, E=0.8306,
KL=0.2546, wKL=0.4500]
Train E18: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.5023, E=0.8306,
KL=0.2546, wKL=0.4500]
Train E18: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.4571, E=0.8248,
KL=0.2521, wKL=0.4500]
Train E18: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4571, E=0.8248,
KL=0.2521, wKL=0.4500]
Train E18: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.4748, E=0.8270,
KL=0.2542, wKL=0.4500]
Train E18: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4748, E=0.8270,
KL=0.2542, wKL=0.4500]
Train E18: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5611, E=0.8239,
KL=0.2546, wKL=0.4500]
Train E18: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5611, E=0.8239,
KL=0.2546, wKL=0.4500]
Train E18: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4592, E=0.8272,
KL=0.2513, wKL=0.4500]
Train E18: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4592, E=0.8272,
KL=0.2513, wKL=0.4500]
Train E18: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5321, E=0.8307,
KL=0.2512, wKL=0.4500]
Train E18: 56%|█████▌ | 14/25 [00:19<00:16, 1.47s/batch, N=1.5321, E=0.8307,
KL=0.2512, wKL=0.4500]
Train E18: 56%|█████▌ | 14/25 [00:20<00:16, 1.47s/batch, N=1.4314, E=0.8231,
KL=0.2524, wKL=0.4500]
Train E18: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.4314, E=0.8231,
KL=0.2524, wKL=0.4500]
Train E18: 60%|██████ | 15/25 [00:22<00:14, 1.44s/batch, N=1.4881, E=0.8294,
KL=0.2504, wKL=0.4500]
Train E18: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.4881, E=0.8294,
KL=0.2504, wKL=0.4500]
Train E18: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.5106, E=0.8253,
KL=0.2510, wKL=0.4500]
Train E18: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.5106, E=0.8253,
KL=0.2510, wKL=0.4500]
Train E18: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5054, E=0.8284,
KL=0.2513, wKL=0.4500]
Train E18: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5054, E=0.8284,
KL=0.2513, wKL=0.4500]
Train E18: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.4670, E=0.8261,
KL=0.2505, wKL=0.4500]
Train E18: 76%|███████▌ | 19/25 [00:26<00:08, 1.42s/batch, N=1.4670, E=0.8261,
KL=0.2505, wKL=0.4500]
Train E18: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5448, E=0.8259,
KL=0.2506, wKL=0.4500]
Train E18: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.5448, E=0.8259,
KL=0.2506, wKL=0.4500]
Train E18: 80%|████████ | 20/25 [00:29<00:07, 1.43s/batch, N=1.4607, E=0.8246,
KL=0.2483, wKL=0.4500]
Train E18: 84%|████████▍ | 21/25 [00:29<00:05, 1.43s/batch, N=1.4607, E=0.8246,
KL=0.2483, wKL=0.4500]
Train E18: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4392, E=0.8265,
KL=0.2488, wKL=0.4500]
Train E18: 88%|████████▊ | 22/25 [00:30<00:04, 1.42s/batch, N=1.4392, E=0.8265,
KL=0.2488, wKL=0.4500]
Train E18: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5339, E=0.8325,
KL=0.2493, wKL=0.4500]
Train E18: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.5339, E=0.8325,
KL=0.2493, wKL=0.4500]
Train E18: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.5196, E=0.8241,
KL=0.2475, wKL=0.4500]
Train E18: 96%|█████████▌| 24/25 [00:33<00:01, 1.41s/batch, N=1.5196, E=0.8241,
KL=0.2475, wKL=0.4500]
Train E18: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5215, E=0.8312,
KL=0.2472, wKL=0.4500]
Train E18: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5215, E=0.8312,
KL=0.2472, wKL=0.4500]
Train E18: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.5215, E=0.8312,
KL=0.2472, wKL=0.4500]
685.8s 108 [Epoch 018] Total: 2.3764 | N: 1.4919 | E: 0.8271 | KL(0.45×0.5): 0.2550
721.0s 109 Train E19: 0%| | 0/25 [00:00<?, ?batch/s]
Train E19: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5769, E=0.8271, KL=0.2476,
wKL=0.4750]
Train E19: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5769, E=0.8271,
KL=0.2476, wKL=0.4750]
Train E19: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4629, E=0.8268,
KL=0.2465, wKL=0.4750]
Train E19: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4629, E=0.8268,
KL=0.2465, wKL=0.4750]
Train E19: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4440, E=0.8297,
KL=0.2440, wKL=0.4750]
Train E19: 12%|█▏ | 3/25 [00:04<00:37, 1.72s/batch, N=1.4440, E=0.8297,
KL=0.2440, wKL=0.4750]
Train E19: 12%|█▏ | 3/25 [00:06<00:37, 1.72s/batch, N=1.4732, E=0.8238,
KL=0.2456, wKL=0.4750]
Train E19: 16%|█▌ | 4/25 [00:06<00:33, 1.58s/batch, N=1.4732, E=0.8238,
KL=0.2456, wKL=0.4750]
Train E19: 16%|█▌ | 4/25 [00:07<00:33, 1.58s/batch, N=1.4542, E=0.8244,
KL=0.2437, wKL=0.4750]
Train E19: 20%|██ | 5/25 [00:07<00:30, 1.50s/batch, N=1.4542, E=0.8244,
KL=0.2437, wKL=0.4750]
Train E19: 20%|██ | 5/25 [00:09<00:30, 1.50s/batch, N=1.5483, E=0.8246,
KL=0.2437, wKL=0.4750]
Train E19: 24%|██▍ | 6/25 [00:09<00:27, 1.47s/batch, N=1.5483, E=0.8246,
KL=0.2437, wKL=0.4750]
Train E19: 24%|██▍ | 6/25 [00:10<00:27, 1.47s/batch, N=1.4934, E=0.8268,
KL=0.2413, wKL=0.4750]
Train E19: 28%|██▊ | 7/25 [00:10<00:25, 1.44s/batch, N=1.4934, E=0.8268,
KL=0.2413, wKL=0.4750]
Train E19: 28%|██▊ | 7/25 [00:11<00:25, 1.44s/batch, N=1.5205, E=0.8303,
KL=0.2385, wKL=0.4750]
Train E19: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5205, E=0.8303,
KL=0.2385, wKL=0.4750]
Train E19: 32%|███▏ | 8/25 [00:13<00:24, 1.42s/batch, N=1.5378, E=0.8236,
KL=0.2394, wKL=0.4750]
Train E19: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.5378, E=0.8236,
KL=0.2394, wKL=0.4750]
Train E19: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.3987, E=0.8264,
KL=0.2364, wKL=0.4750]
Train E19: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.3987, E=0.8264,
KL=0.2364, wKL=0.4750]
Train E19: 40%|████ | 10/25 [00:16<00:21, 1.41s/batch, N=1.4580, E=0.8238,
KL=0.2371, wKL=0.4750]
Train E19: 44%|████▍ | 11/25 [00:16<00:20, 1.49s/batch, N=1.4580, E=0.8238,
KL=0.2371, wKL=0.4750]
Train E19: 44%|████▍ | 11/25 [00:17<00:20, 1.49s/batch, N=1.4793, E=0.8267,
KL=0.2371, wKL=0.4750]
Train E19: 48%|████▊ | 12/25 [00:17<00:19, 1.47s/batch, N=1.4793, E=0.8267,
KL=0.2371, wKL=0.4750]
Train E19: 48%|████▊ | 12/25 [00:19<00:19, 1.47s/batch, N=1.5524, E=0.8266,
KL=0.2352, wKL=0.4750]
Train E19: 52%|█████▏ | 13/25 [00:19<00:17, 1.44s/batch, N=1.5524, E=0.8266,
KL=0.2352, wKL=0.4750]
Train E19: 52%|█████▏ | 13/25 [00:20<00:17, 1.44s/batch, N=1.4741, E=0.8267,
KL=0.2346, wKL=0.4750]
Train E19: 56%|█████▌ | 14/25 [00:20<00:15, 1.44s/batch, N=1.4741, E=0.8267,
KL=0.2346, wKL=0.4750]
Train E19: 56%|█████▌ | 14/25 [00:21<00:15, 1.44s/batch, N=1.3984, E=0.8248,
KL=0.2339, wKL=0.4750]
Train E19: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.3984, E=0.8248,
KL=0.2339, wKL=0.4750]
Train E19: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.4899, E=0.8275,
KL=0.2346, wKL=0.4750]
Train E19: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.4899, E=0.8275,
KL=0.2346, wKL=0.4750]
Train E19: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.4708, E=0.8295,
KL=0.2338, wKL=0.4750]
Train E19: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.4708, E=0.8295,
KL=0.2338, wKL=0.4750]
Train E19: 68%|██████▊ | 17/25 [00:26<00:11, 1.43s/batch, N=1.5463, E=0.8269,
KL=0.2338, wKL=0.4750]
Train E19: 72%|███████▏ | 18/25 [00:26<00:09, 1.42s/batch, N=1.5463, E=0.8269,
KL=0.2338, wKL=0.4750]
Train E19: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5078, E=0.8283,
KL=0.2335, wKL=0.4750]
Train E19: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5078, E=0.8283,
KL=0.2335, wKL=0.4750]
Train E19: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5204, E=0.8328,
KL=0.2330, wKL=0.4750]
Train E19: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5204, E=0.8328,
KL=0.2330, wKL=0.4750]
Train E19: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.4661, E=0.8239,
KL=0.2334, wKL=0.4750]
Train E19: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.4661, E=0.8239,
KL=0.2334, wKL=0.4750]
Train E19: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5584, E=0.8301,
KL=0.2326, wKL=0.4750]
Train E19: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.5584, E=0.8301,
KL=0.2326, wKL=0.4750]
Train E19: 88%|████████▊ | 22/25 [00:33<00:04, 1.40s/batch, N=1.4515, E=0.8221,
KL=0.2326, wKL=0.4750]
Train E19: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4515, E=0.8221,
KL=0.2326, wKL=0.4750]
Train E19: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5171, E=0.8242,
KL=0.2332, wKL=0.4750]
Train E19: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5171, E=0.8242,
KL=0.2332, wKL=0.4750]
Train E19: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.4290, E=0.8331,
KL=0.2307, wKL=0.4750]
Train E19: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.4290, E=0.8331,
KL=0.2307, wKL=0.4750]
Train E19: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4290, E=0.8331,
KL=0.2307, wKL=0.4750]
721.0s 110 [Epoch 019] Total: 2.3737 | N: 1.4906 | E: 0.8267 | KL(0.47×0.5): 0.2376
755.9s 111 Train E20: 0%| | 0/25 [00:00<?, ?batch/s]
Train E20: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4677, E=0.8248, KL=0.2314,
wKL=0.5000]
Train E20: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4677, E=0.8248,
KL=0.2314, wKL=0.5000]
Train E20: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5378, E=0.8258,
KL=0.2315, wKL=0.5000]
Train E20: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5378, E=0.8258,
KL=0.2315, wKL=0.5000]
Train E20: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4674, E=0.8247,
KL=0.2297, wKL=0.5000]
Train E20: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4674, E=0.8247,
KL=0.2297, wKL=0.5000]
Train E20: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5557, E=0.8224,
KL=0.2296, wKL=0.5000]
Train E20: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5557, E=0.8224,
KL=0.2296, wKL=0.5000]
Train E20: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4961, E=0.8286,
KL=0.2284, wKL=0.5000]
Train E20: 20%|██ | 5/25 [00:06<00:27, 1.36s/batch, N=1.4961, E=0.8286,
KL=0.2284, wKL=0.5000]
Train E20: 20%|██ | 5/25 [00:08<00:27, 1.36s/batch, N=1.5197, E=0.8290,
KL=0.2270, wKL=0.5000]
Train E20: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.5197, E=0.8290,
KL=0.2270, wKL=0.5000]
Train E20: 24%|██▍ | 6/25 [00:10<00:25, 1.37s/batch, N=1.6077, E=0.8287,
KL=0.2248, wKL=0.5000]
Train E20: 28%|██▊ | 7/25 [00:10<00:28, 1.58s/batch, N=1.6077, E=0.8287,
KL=0.2248, wKL=0.5000]
Train E20: 28%|██▊ | 7/25 [00:11<00:28, 1.58s/batch, N=1.3944, E=0.8268,
KL=0.2233, wKL=0.5000]
Train E20: 32%|███▏ | 8/25 [00:11<00:25, 1.52s/batch, N=1.3944, E=0.8268,
KL=0.2233, wKL=0.5000]
Train E20: 32%|███▏ | 8/25 [00:13<00:25, 1.52s/batch, N=1.5787, E=0.8268,
KL=0.2245, wKL=0.5000]
Train E20: 36%|███▌ | 9/25 [00:13<00:25, 1.58s/batch, N=1.5787, E=0.8268,
KL=0.2245, wKL=0.5000]
Train E20: 36%|███▌ | 9/25 [00:14<00:25, 1.58s/batch, N=1.5136, E=0.8272,
KL=0.2224, wKL=0.5000]
Train E20: 40%|████ | 10/25 [00:14<00:22, 1.52s/batch, N=1.5136, E=0.8272,
KL=0.2224, wKL=0.5000]
Train E20: 40%|████ | 10/25 [00:16<00:22, 1.52s/batch, N=1.5782, E=0.8290,
KL=0.2225, wKL=0.5000]
Train E20: 44%|████▍ | 11/25 [00:16<00:20, 1.48s/batch, N=1.5782, E=0.8290,
KL=0.2225, wKL=0.5000]
Train E20: 44%|████▍ | 11/25 [00:17<00:20, 1.48s/batch, N=1.4295, E=0.8279,
KL=0.2208, wKL=0.5000]
Train E20: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.4295, E=0.8279,
KL=0.2208, wKL=0.5000]
Train E20: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.4425, E=0.8260,
KL=0.2219, wKL=0.5000]
Train E20: 52%|█████▏ | 13/25 [00:18<00:17, 1.44s/batch, N=1.4425, E=0.8260,
KL=0.2219, wKL=0.5000]
Train E20: 52%|█████▏ | 13/25 [00:20<00:17, 1.44s/batch, N=1.5109, E=0.8263,
KL=0.2234, wKL=0.5000]
Train E20: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.5109, E=0.8263,
KL=0.2234, wKL=0.5000]
Train E20: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.4230, E=0.8272,
KL=0.2203, wKL=0.5000]
Train E20: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4230, E=0.8272,
KL=0.2203, wKL=0.5000]
Train E20: 60%|██████ | 15/25 [00:23<00:14, 1.42s/batch, N=1.4053, E=0.8277,
KL=0.2223, wKL=0.5000]
Train E20: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.4053, E=0.8277,
KL=0.2223, wKL=0.5000]
Train E20: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.5393, E=0.8283,
KL=0.2215, wKL=0.5000]
Train E20: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5393, E=0.8283,
KL=0.2215, wKL=0.5000]
Train E20: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5004, E=0.8275,
KL=0.2212, wKL=0.5000]
Train E20: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5004, E=0.8275,
KL=0.2212, wKL=0.5000]
Train E20: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4836, E=0.8266,
KL=0.2210, wKL=0.5000]
Train E20: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.4836, E=0.8266,
KL=0.2210, wKL=0.5000]
Train E20: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4796, E=0.8254,
KL=0.2208, wKL=0.5000]
Train E20: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4796, E=0.8254,
KL=0.2208, wKL=0.5000]
Train E20: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.4548, E=0.8230,
KL=0.2215, wKL=0.5000]
Train E20: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4548, E=0.8230,
KL=0.2215, wKL=0.5000]
Train E20: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4718, E=0.8208,
KL=0.2196, wKL=0.5000]
Train E20: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4718, E=0.8208,
KL=0.2196, wKL=0.5000]
Train E20: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4813, E=0.8267,
KL=0.2173, wKL=0.5000]
Train E20: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4813, E=0.8267,
KL=0.2173, wKL=0.5000]
Train E20: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4811, E=0.8236,
KL=0.2188, wKL=0.5000]
Train E20: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4811, E=0.8236,
KL=0.2188, wKL=0.5000]
Train E20: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.3494, E=0.8218,
KL=0.2192, wKL=0.5000]
Train E20: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.3494, E=0.8218,
KL=0.2192, wKL=0.5000]
Train E20: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.3494, E=0.8218,
KL=0.2192, wKL=0.5000]
755.9s 112 [Epoch 020] Total: 2.3722 | N: 1.4901 | E: 0.8262 | KL(0.50×0.5): 0.2235
755.9s 113 Saved checkpoint: /kaggle/working/checkpoints/gvae_20_epoch020.pt
790.7s 114 Train E21: 0%| | 0/25 [00:00<?, ?batch/s]
Train E21: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5098, E=0.8210, KL=0.2180,
wKL=0.5250]
Train E21: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5098, E=0.8210,
KL=0.2180, wKL=0.5250]
Train E21: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5177, E=0.8287,
KL=0.2175, wKL=0.5250]
Train E21: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.5177, E=0.8287,
KL=0.2175, wKL=0.5250]
Train E21: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.4634, E=0.8311,
KL=0.2162, wKL=0.5250]
Train E21: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4634, E=0.8311,
KL=0.2162, wKL=0.5250]
Train E21: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5852, E=0.8255,
KL=0.2175, wKL=0.5250]
Train E21: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5852, E=0.8255,
KL=0.2175, wKL=0.5250]
Train E21: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4773, E=0.8272,
KL=0.2139, wKL=0.5250]
Train E21: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4773, E=0.8272,
KL=0.2139, wKL=0.5250]
Train E21: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5737, E=0.8219,
KL=0.2151, wKL=0.5250]
Train E21: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5737, E=0.8219,
KL=0.2151, wKL=0.5250]
Train E21: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5300, E=0.8267,
KL=0.2153, wKL=0.5250]
Train E21: 28%|██▊ | 7/25 [00:09<00:26, 1.47s/batch, N=1.5300, E=0.8267,
KL=0.2153, wKL=0.5250]
Train E21: 28%|██▊ | 7/25 [00:11<00:26, 1.47s/batch, N=1.5196, E=0.8270,
KL=0.2132, wKL=0.5250]
Train E21: 32%|███▏ | 8/25 [00:11<00:24, 1.44s/batch, N=1.5196, E=0.8270,
KL=0.2132, wKL=0.5250]
Train E21: 32%|███▏ | 8/25 [00:13<00:24, 1.44s/batch, N=1.4657, E=0.8232,
KL=0.2128, wKL=0.5250]
Train E21: 36%|███▌ | 9/25 [00:13<00:25, 1.62s/batch, N=1.4657, E=0.8232,
KL=0.2128, wKL=0.5250]
Train E21: 36%|███▌ | 9/25 [00:14<00:25, 1.62s/batch, N=1.4895, E=0.8277,
KL=0.2118, wKL=0.5250]
Train E21: 40%|████ | 10/25 [00:14<00:23, 1.55s/batch, N=1.4895, E=0.8277,
KL=0.2118, wKL=0.5250]
Train E21: 40%|████ | 10/25 [00:16<00:23, 1.55s/batch, N=1.4332, E=0.8272,
KL=0.2106, wKL=0.5250]
Train E21: 44%|████▍ | 11/25 [00:16<00:21, 1.51s/batch, N=1.4332, E=0.8272,
KL=0.2106, wKL=0.5250]
Train E21: 44%|████▍ | 11/25 [00:17<00:21, 1.51s/batch, N=1.5168, E=0.8265,
KL=0.2120, wKL=0.5250]
Train E21: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.5168, E=0.8265,
KL=0.2120, wKL=0.5250]
Train E21: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.5849, E=0.8252,
KL=0.2115, wKL=0.5250]
Train E21: 52%|█████▏ | 13/25 [00:18<00:17, 1.45s/batch, N=1.5849, E=0.8252,
KL=0.2115, wKL=0.5250]
Train E21: 52%|█████▏ | 13/25 [00:20<00:17, 1.45s/batch, N=1.5195, E=0.8266,
KL=0.2101, wKL=0.5250]
Train E21: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.5195, E=0.8266,
KL=0.2101, wKL=0.5250]
Train E21: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.4705, E=0.8262,
KL=0.2129, wKL=0.5250]
Train E21: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4705, E=0.8262,
KL=0.2129, wKL=0.5250]
Train E21: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.4954, E=0.8276,
KL=0.2114, wKL=0.5250]
Train E21: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4954, E=0.8276,
KL=0.2114, wKL=0.5250]
Train E21: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4817, E=0.8210,
KL=0.2115, wKL=0.5250]
Train E21: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4817, E=0.8210,
KL=0.2115, wKL=0.5250]
Train E21: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.4048, E=0.8283,
KL=0.2103, wKL=0.5250]
Train E21: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4048, E=0.8283,
KL=0.2103, wKL=0.5250]
Train E21: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.3990, E=0.8223,
KL=0.2099, wKL=0.5250]
Train E21: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.3990, E=0.8223,
KL=0.2099, wKL=0.5250]
Train E21: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4736, E=0.8249,
KL=0.2106, wKL=0.5250]
Train E21: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4736, E=0.8249,
KL=0.2106, wKL=0.5250]
Train E21: 80%|████████ | 20/25 [00:29<00:06, 1.40s/batch, N=1.5337, E=0.8247,
KL=0.2112, wKL=0.5250]
Train E21: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.5337, E=0.8247,
KL=0.2112, wKL=0.5250]
Train E21: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.4584, E=0.8301,
KL=0.2099, wKL=0.5250]
Train E21: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4584, E=0.8301,
KL=0.2099, wKL=0.5250]
Train E21: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4004, E=0.8241,
KL=0.2091, wKL=0.5250]
Train E21: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.4004, E=0.8241,
KL=0.2091, wKL=0.5250]
Train E21: 92%|█████████▏| 23/25 [00:34<00:02, 1.38s/batch, N=1.4414, E=0.8279,
KL=0.2077, wKL=0.5250]
Train E21: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4414, E=0.8279,
KL=0.2077, wKL=0.5250]
Train E21: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5041, E=0.8297,
KL=0.2090, wKL=0.5250]
Train E21: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5041, E=0.8297,
KL=0.2090, wKL=0.5250]
Train E21: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5041, E=0.8297,
KL=0.2090, wKL=0.5250]
790.7s 115 [Epoch 021] Total: 2.3714 | N: 1.4896 | E: 0.8260 | KL(0.53×0.5): 0.2124
825.7s 116 Train E22: 0%| | 0/25 [00:00<?, ?batch/s]
Train E22: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4450, E=0.8256, KL=0.2083,
wKL=0.5500]
Train E22: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4450, E=0.8256,
KL=0.2083, wKL=0.5500]
Train E22: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5185, E=0.8231,
KL=0.2093, wKL=0.5500]
Train E22: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5185, E=0.8231,
KL=0.2093, wKL=0.5500]
Train E22: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4773, E=0.8272,
KL=0.2076, wKL=0.5500]
Train E22: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4773, E=0.8272,
KL=0.2076, wKL=0.5500]
Train E22: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4963, E=0.8232,
KL=0.2067, wKL=0.5500]
Train E22: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4963, E=0.8232,
KL=0.2067, wKL=0.5500]
Train E22: 16%|█▌ | 4/25 [00:07<00:28, 1.37s/batch, N=1.4462, E=0.8232,
KL=0.2074, wKL=0.5500]
Train E22: 20%|██ | 5/25 [00:07<00:29, 1.46s/batch, N=1.4462, E=0.8232,
KL=0.2074, wKL=0.5500]
Train E22: 20%|██ | 5/25 [00:08<00:29, 1.46s/batch, N=1.5894, E=0.8296,
KL=0.2035, wKL=0.5500]
Train E22: 24%|██▍ | 6/25 [00:08<00:27, 1.43s/batch, N=1.5894, E=0.8296,
KL=0.2035, wKL=0.5500]
Train E22: 24%|██▍ | 6/25 [00:09<00:27, 1.43s/batch, N=1.5221, E=0.8278,
KL=0.2037, wKL=0.5500]
Train E22: 28%|██▊ | 7/25 [00:09<00:25, 1.42s/batch, N=1.5221, E=0.8278,
KL=0.2037, wKL=0.5500]
Train E22: 28%|██▊ | 7/25 [00:11<00:25, 1.42s/batch, N=1.4845, E=0.8295,
KL=0.2022, wKL=0.5500]
Train E22: 32%|███▏ | 8/25 [00:11<00:24, 1.41s/batch, N=1.4845, E=0.8295,
KL=0.2022, wKL=0.5500]
Train E22: 32%|███▏ | 8/25 [00:12<00:24, 1.41s/batch, N=1.3684, E=0.8345,
KL=0.2019, wKL=0.5500]
Train E22: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.3684, E=0.8345,
KL=0.2019, wKL=0.5500]
Train E22: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.5234, E=0.8283,
KL=0.2015, wKL=0.5500]
Train E22: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.5234, E=0.8283,
KL=0.2015, wKL=0.5500]
Train E22: 40%|████ | 10/25 [00:15<00:21, 1.42s/batch, N=1.4563, E=0.8228,
KL=0.2007, wKL=0.5500]
Train E22: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4563, E=0.8228,
KL=0.2007, wKL=0.5500]
Train E22: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.6448, E=0.8277,
KL=0.2030, wKL=0.5500]
Train E22: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.6448, E=0.8277,
KL=0.2030, wKL=0.5500]
Train E22: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4910, E=0.8219,
KL=0.2035, wKL=0.5500]
Train E22: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4910, E=0.8219,
KL=0.2035, wKL=0.5500]
Train E22: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.4948, E=0.8256,
KL=0.2007, wKL=0.5500]
Train E22: 56%|█████▌ | 14/25 [00:20<00:17, 1.61s/batch, N=1.4948, E=0.8256,
KL=0.2007, wKL=0.5500]
Train E22: 56%|█████▌ | 14/25 [00:21<00:17, 1.61s/batch, N=1.4372, E=0.8277,
KL=0.2027, wKL=0.5500]
Train E22: 60%|██████ | 15/25 [00:21<00:15, 1.54s/batch, N=1.4372, E=0.8277,
KL=0.2027, wKL=0.5500]
Train E22: 60%|██████ | 15/25 [00:23<00:15, 1.54s/batch, N=1.4764, E=0.8237,
KL=0.2005, wKL=0.5500]
Train E22: 64%|██████▍ | 16/25 [00:23<00:13, 1.50s/batch, N=1.4764, E=0.8237,
KL=0.2005, wKL=0.5500]
Train E22: 64%|██████▍ | 16/25 [00:24<00:13, 1.50s/batch, N=1.5053, E=0.8242,
KL=0.2018, wKL=0.5500]
Train E22: 68%|██████▊ | 17/25 [00:24<00:11, 1.47s/batch, N=1.5053, E=0.8242,
KL=0.2018, wKL=0.5500]
Train E22: 68%|██████▊ | 17/25 [00:25<00:11, 1.47s/batch, N=1.4938, E=0.8268,
KL=0.1999, wKL=0.5500]
Train E22: 72%|███████▏ | 18/25 [00:25<00:10, 1.45s/batch, N=1.4938, E=0.8268,
KL=0.1999, wKL=0.5500]
Train E22: 72%|███████▏ | 18/25 [00:27<00:10, 1.45s/batch, N=1.4765, E=0.8251,
KL=0.1995, wKL=0.5500]
Train E22: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.4765, E=0.8251,
KL=0.1995, wKL=0.5500]
Train E22: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.5090, E=0.8242,
KL=0.2015, wKL=0.5500]
Train E22: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5090, E=0.8242,
KL=0.2015, wKL=0.5500]
Train E22: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4856, E=0.8252,
KL=0.1976, wKL=0.5500]
Train E22: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4856, E=0.8252,
KL=0.1976, wKL=0.5500]
Train E22: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.4631, E=0.8223,
KL=0.2002, wKL=0.5500]
Train E22: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.4631, E=0.8223,
KL=0.2002, wKL=0.5500]
Train E22: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5016, E=0.8244,
KL=0.1999, wKL=0.5500]
Train E22: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5016, E=0.8244,
KL=0.1999, wKL=0.5500]
Train E22: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4187, E=0.8249,
KL=0.1979, wKL=0.5500]
Train E22: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4187, E=0.8249,
KL=0.1979, wKL=0.5500]
Train E22: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5344, E=0.8275,
KL=0.1993, wKL=0.5500]
Train E22: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5344, E=0.8275,
KL=0.1993, wKL=0.5500]
Train E22: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5344, E=0.8275,
KL=0.1993, wKL=0.5500]
825.7s 117 [Epoch 022] Total: 2.3708 | N: 1.4893 | E: 0.8258 | KL(0.55×0.5): 0.2025
860.9s 118 Train E23: 0%| | 0/25 [00:00<?, ?batch/s]
Train E23: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5300, E=0.8231, KL=0.1997,
wKL=0.5750]
Train E23: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5300, E=0.8231,
KL=0.1997, wKL=0.5750]
Train E23: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5106, E=0.8261,
KL=0.2004, wKL=0.5750]
Train E23: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.5106, E=0.8261,
KL=0.2004, wKL=0.5750]
Train E23: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.4288, E=0.8297,
KL=0.1955, wKL=0.5750]
Train E23: 12%|█▏ | 3/25 [00:04<00:32, 1.47s/batch, N=1.4288, E=0.8297,
KL=0.1955, wKL=0.5750]
Train E23: 12%|█▏ | 3/25 [00:05<00:32, 1.47s/batch, N=1.4177, E=0.8252,
KL=0.1966, wKL=0.5750]
Train E23: 16%|█▌ | 4/25 [00:05<00:30, 1.44s/batch, N=1.4177, E=0.8252,
KL=0.1966, wKL=0.5750]
Train E23: 16%|█▌ | 4/25 [00:07<00:30, 1.44s/batch, N=1.4072, E=0.8228,
KL=0.1960, wKL=0.5750]
Train E23: 20%|██ | 5/25 [00:07<00:28, 1.42s/batch, N=1.4072, E=0.8228,
KL=0.1960, wKL=0.5750]
Train E23: 20%|██ | 5/25 [00:08<00:28, 1.42s/batch, N=1.3849, E=0.8260,
KL=0.1940, wKL=0.5750]
Train E23: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.3849, E=0.8260,
KL=0.1940, wKL=0.5750]
Train E23: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5069, E=0.8195,
KL=0.1950, wKL=0.5750]
Train E23: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5069, E=0.8195,
KL=0.1950, wKL=0.5750]
Train E23: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4894, E=0.8278,
KL=0.1940, wKL=0.5750]
Train E23: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4894, E=0.8278,
KL=0.1940, wKL=0.5750]
Train E23: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5203, E=0.8227,
KL=0.1948, wKL=0.5750]
Train E23: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5203, E=0.8227,
KL=0.1948, wKL=0.5750]
Train E23: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.5059, E=0.8244,
KL=0.1938, wKL=0.5750]
Train E23: 40%|████ | 10/25 [00:14<00:20, 1.40s/batch, N=1.5059, E=0.8244,
KL=0.1938, wKL=0.5750]
Train E23: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4490, E=0.8256,
KL=0.1900, wKL=0.5750]
Train E23: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4490, E=0.8256,
KL=0.1900, wKL=0.5750]
Train E23: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4577, E=0.8250,
KL=0.1937, wKL=0.5750]
Train E23: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4577, E=0.8250,
KL=0.1937, wKL=0.5750]
Train E23: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.5089, E=0.8263,
KL=0.1905, wKL=0.5750]
Train E23: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5089, E=0.8263,
KL=0.1905, wKL=0.5750]
Train E23: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5315, E=0.8321,
KL=0.1929, wKL=0.5750]
Train E23: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5315, E=0.8321,
KL=0.1929, wKL=0.5750]
Train E23: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5533, E=0.8261,
KL=0.1915, wKL=0.5750]
Train E23: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5533, E=0.8261,
KL=0.1915, wKL=0.5750]
Train E23: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4476, E=0.8238,
KL=0.1910, wKL=0.5750]
Train E23: 64%|██████▍ | 16/25 [00:23<00:14, 1.59s/batch, N=1.4476, E=0.8238,
KL=0.1910, wKL=0.5750]
Train E23: 64%|██████▍ | 16/25 [00:24<00:14, 1.59s/batch, N=1.5785, E=0.8269,
KL=0.1936, wKL=0.5750]
Train E23: 68%|██████▊ | 17/25 [00:24<00:12, 1.53s/batch, N=1.5785, E=0.8269,
KL=0.1936, wKL=0.5750]
Train E23: 68%|██████▊ | 17/25 [00:26<00:12, 1.53s/batch, N=1.5131, E=0.8269,
KL=0.1897, wKL=0.5750]
Train E23: 72%|███████▏ | 18/25 [00:26<00:10, 1.51s/batch, N=1.5131, E=0.8269,
KL=0.1897, wKL=0.5750]
Train E23: 72%|███████▏ | 18/25 [00:27<00:10, 1.51s/batch, N=1.4914, E=0.8276,
KL=0.1936, wKL=0.5750]
Train E23: 76%|███████▌ | 19/25 [00:27<00:08, 1.48s/batch, N=1.4914, E=0.8276,
KL=0.1936, wKL=0.5750]
Train E23: 76%|███████▌ | 19/25 [00:28<00:08, 1.48s/batch, N=1.5348, E=0.8265,
KL=0.1907, wKL=0.5750]
Train E23: 80%|████████ | 20/25 [00:28<00:07, 1.45s/batch, N=1.5348, E=0.8265,
KL=0.1907, wKL=0.5750]
Train E23: 80%|████████ | 20/25 [00:30<00:07, 1.45s/batch, N=1.5551, E=0.8236,
KL=0.1917, wKL=0.5750]
Train E23: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.5551, E=0.8236,
KL=0.1917, wKL=0.5750]
Train E23: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.4673, E=0.8266,
KL=0.1926, wKL=0.5750]
Train E23: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4673, E=0.8266,
KL=0.1926, wKL=0.5750]
Train E23: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.3957, E=0.8260,
KL=0.1887, wKL=0.5750]
Train E23: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.3957, E=0.8260,
KL=0.1887, wKL=0.5750]
Train E23: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5596, E=0.8263,
KL=0.1921, wKL=0.5750]
Train E23: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.5596, E=0.8263,
KL=0.1921, wKL=0.5750]
Train E23: 96%|█████████▌| 24/25 [00:35<00:01, 1.43s/batch, N=1.4980, E=0.8318,
KL=0.1911, wKL=0.5750]
Train E23: 100%|██████████| 25/25 [00:35<00:00, 1.24s/batch, N=1.4980, E=0.8318,
KL=0.1911, wKL=0.5750]
Train E23: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4980, E=0.8318,
KL=0.1911, wKL=0.5750]
860.9s 119 [Epoch 023] Total: 2.3709 | N: 1.4895 | E: 0.8258 | KL(0.57×0.5): 0.1934
895.8s 120 Train E24: 0%| | 0/25 [00:00<?, ?batch/s]
Train E24: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5214, E=0.8300, KL=0.1897,
wKL=0.6000]
Train E24: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5214, E=0.8300,
KL=0.1897, wKL=0.6000]
Train E24: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4791, E=0.8293,
KL=0.1908, wKL=0.6000]
Train E24: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4791, E=0.8293,
KL=0.1908, wKL=0.6000]
Train E24: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.5493, E=0.8232,
KL=0.1886, wKL=0.6000]
Train E24: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5493, E=0.8232,
KL=0.1886, wKL=0.6000]
Train E24: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.5567, E=0.8266,
KL=0.1891, wKL=0.6000]
Train E24: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.5567, E=0.8266,
KL=0.1891, wKL=0.6000]
Train E24: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4458, E=0.8216,
KL=0.1898, wKL=0.6000]
Train E24: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4458, E=0.8216,
KL=0.1898, wKL=0.6000]
Train E24: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4449, E=0.8214,
KL=0.1841, wKL=0.6000]
Train E24: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4449, E=0.8214,
KL=0.1841, wKL=0.6000]
Train E24: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4670, E=0.8253,
KL=0.1860, wKL=0.6000]
Train E24: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4670, E=0.8253,
KL=0.1860, wKL=0.6000]
Train E24: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4734, E=0.8256,
KL=0.1853, wKL=0.6000]
Train E24: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4734, E=0.8256,
KL=0.1853, wKL=0.6000]
Train E24: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5170, E=0.8210,
KL=0.1827, wKL=0.6000]
Train E24: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5170, E=0.8210,
KL=0.1827, wKL=0.6000]
Train E24: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4594, E=0.8295,
KL=0.1843, wKL=0.6000]
Train E24: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4594, E=0.8295,
KL=0.1843, wKL=0.6000]
Train E24: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.5517, E=0.8258,
KL=0.1840, wKL=0.6000]
Train E24: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.5517, E=0.8258,
KL=0.1840, wKL=0.6000]
Train E24: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4509, E=0.8292,
KL=0.1810, wKL=0.6000]
Train E24: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4509, E=0.8292,
KL=0.1810, wKL=0.6000]
Train E24: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4944, E=0.8252,
KL=0.1848, wKL=0.6000]
Train E24: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4944, E=0.8252,
KL=0.1848, wKL=0.6000]
Train E24: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5482, E=0.8180,
KL=0.1844, wKL=0.6000]
Train E24: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5482, E=0.8180,
KL=0.1844, wKL=0.6000]
Train E24: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.5802, E=0.8256,
KL=0.1864, wKL=0.6000]
Train E24: 60%|██████ | 15/25 [00:20<00:14, 1.41s/batch, N=1.5802, E=0.8256,
KL=0.1864, wKL=0.6000]
Train E24: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.4072, E=0.8256,
KL=0.1835, wKL=0.6000]
Train E24: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4072, E=0.8256,
KL=0.1835, wKL=0.6000]
Train E24: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.3967, E=0.8277,
KL=0.1825, wKL=0.6000]
Train E24: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.3967, E=0.8277,
KL=0.1825, wKL=0.6000]
Train E24: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4961, E=0.8282,
KL=0.1856, wKL=0.6000]
Train E24: 72%|███████▏ | 18/25 [00:24<00:09, 1.39s/batch, N=1.4961, E=0.8282,
KL=0.1856, wKL=0.6000]
Train E24: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4918, E=0.8258,
KL=0.1834, wKL=0.6000]
Train E24: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4918, E=0.8258,
KL=0.1834, wKL=0.6000]
Train E24: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5006, E=0.8242,
KL=0.1837, wKL=0.6000]
Train E24: 80%|████████ | 20/25 [00:28<00:07, 1.58s/batch, N=1.5006, E=0.8242,
KL=0.1837, wKL=0.6000]
Train E24: 80%|████████ | 20/25 [00:29<00:07, 1.58s/batch, N=1.4928, E=0.8284,
KL=0.1838, wKL=0.6000]
Train E24: 84%|████████▍ | 21/25 [00:29<00:06, 1.54s/batch, N=1.4928, E=0.8284,
KL=0.1838, wKL=0.6000]
Train E24: 84%|████████▍ | 21/25 [00:31<00:06, 1.54s/batch, N=1.4682, E=0.8226,
KL=0.1824, wKL=0.6000]
Train E24: 88%|████████▊ | 22/25 [00:31<00:04, 1.54s/batch, N=1.4682, E=0.8226,
KL=0.1824, wKL=0.6000]
Train E24: 88%|████████▊ | 22/25 [00:32<00:04, 1.54s/batch, N=1.4533, E=0.8305,
KL=0.1819, wKL=0.6000]
Train E24: 92%|█████████▏| 23/25 [00:32<00:03, 1.51s/batch, N=1.4533, E=0.8305,
KL=0.1819, wKL=0.6000]
Train E24: 92%|█████████▏| 23/25 [00:34<00:03, 1.51s/batch, N=1.5288, E=0.8235,
KL=0.1823, wKL=0.6000]
Train E24: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.5288, E=0.8235,
KL=0.1823, wKL=0.6000]
Train E24: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.4011, E=0.8268,
KL=0.1785, wKL=0.6000]
Train E24: 100%|██████████| 25/25 [00:34<00:00, 1.22s/batch, N=1.4011, E=0.8268,
KL=0.1785, wKL=0.6000]
Train E24: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4011, E=0.8268,
KL=0.1785, wKL=0.6000]
895.8s 121 [Epoch 024] Total: 2.3702 | N: 1.4891 | E: 0.8256 | KL(0.60×0.5): 0.1849
930.8s 122 Train E25: 0%| | 0/25 [00:00<?, ?batch/s]
Train E25: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5353, E=0.8301, KL=0.1801,
wKL=0.6250]
Train E25: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5353, E=0.8301,
KL=0.1801, wKL=0.6250]
Train E25: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5337, E=0.8260,
KL=0.1806, wKL=0.6250]
Train E25: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5337, E=0.8260,
KL=0.1806, wKL=0.6250]
Train E25: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4798, E=0.8257,
KL=0.1783, wKL=0.6250]
Train E25: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4798, E=0.8257,
KL=0.1783, wKL=0.6250]
Train E25: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5227, E=0.8266,
KL=0.1803, wKL=0.6250]
Train E25: 16%|█▌ | 4/25 [00:05<00:29, 1.42s/batch, N=1.5227, E=0.8266,
KL=0.1803, wKL=0.6250]
Train E25: 16%|█▌ | 4/25 [00:07<00:29, 1.42s/batch, N=1.4464, E=0.8266,
KL=0.1784, wKL=0.6250]
Train E25: 20%|██ | 5/25 [00:07<00:28, 1.41s/batch, N=1.4464, E=0.8266,
KL=0.1784, wKL=0.6250]
Train E25: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.4993, E=0.8181,
KL=0.1793, wKL=0.6250]
Train E25: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4993, E=0.8181,
KL=0.1793, wKL=0.6250]
Train E25: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.4512, E=0.8302,
KL=0.1785, wKL=0.6250]
Train E25: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4512, E=0.8302,
KL=0.1785, wKL=0.6250]
Train E25: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4525, E=0.8227,
KL=0.1783, wKL=0.6250]
Train E25: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4525, E=0.8227,
KL=0.1783, wKL=0.6250]
Train E25: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4172, E=0.8237,
KL=0.1772, wKL=0.6250]
Train E25: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4172, E=0.8237,
KL=0.1772, wKL=0.6250]
Train E25: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.4498, E=0.8272,
KL=0.1769, wKL=0.6250]
Train E25: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4498, E=0.8272,
KL=0.1769, wKL=0.6250]
Train E25: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5047, E=0.8297,
KL=0.1745, wKL=0.6250]
Train E25: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5047, E=0.8297,
KL=0.1745, wKL=0.6250]
Train E25: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5093, E=0.8220,
KL=0.1783, wKL=0.6250]
Train E25: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5093, E=0.8220,
KL=0.1783, wKL=0.6250]
Train E25: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4370, E=0.8274,
KL=0.1743, wKL=0.6250]
Train E25: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4370, E=0.8274,
KL=0.1743, wKL=0.6250]
Train E25: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.6212, E=0.8199,
KL=0.1776, wKL=0.6250]
Train E25: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.6212, E=0.8199,
KL=0.1776, wKL=0.6250]
Train E25: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5419, E=0.8209,
KL=0.1768, wKL=0.6250]
Train E25: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.5419, E=0.8209,
KL=0.1768, wKL=0.6250]
Train E25: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4988, E=0.8204,
KL=0.1756, wKL=0.6250]
Train E25: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4988, E=0.8204,
KL=0.1756, wKL=0.6250]
Train E25: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.5101, E=0.8248,
KL=0.1758, wKL=0.6250]
Train E25: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5101, E=0.8248,
KL=0.1758, wKL=0.6250]
Train E25: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4803, E=0.8256,
KL=0.1742, wKL=0.6250]
Train E25: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4803, E=0.8256,
KL=0.1742, wKL=0.6250]
Train E25: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4426, E=0.8228,
KL=0.1753, wKL=0.6250]
Train E25: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4426, E=0.8228,
KL=0.1753, wKL=0.6250]
Train E25: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4549, E=0.8227,
KL=0.1766, wKL=0.6250]
Train E25: 80%|████████ | 20/25 [00:28<00:07, 1.44s/batch, N=1.4549, E=0.8227,
KL=0.1766, wKL=0.6250]
Train E25: 80%|████████ | 20/25 [00:29<00:07, 1.44s/batch, N=1.5083, E=0.8313,
KL=0.1723, wKL=0.6250]
Train E25: 84%|████████▍ | 21/25 [00:29<00:05, 1.46s/batch, N=1.5083, E=0.8313,
KL=0.1723, wKL=0.6250]
Train E25: 84%|████████▍ | 21/25 [00:30<00:05, 1.46s/batch, N=1.4604, E=0.8243,
KL=0.1754, wKL=0.6250]
Train E25: 88%|████████▊ | 22/25 [00:30<00:04, 1.44s/batch, N=1.4604, E=0.8243,
KL=0.1754, wKL=0.6250]
Train E25: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.5875, E=0.8262,
KL=0.1762, wKL=0.6250]
Train E25: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.5875, E=0.8262,
KL=0.1762, wKL=0.6250]
Train E25: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.3709, E=0.8264,
KL=0.1742, wKL=0.6250]
Train E25: 96%|█████████▌| 24/25 [00:34<00:01, 1.61s/batch, N=1.3709, E=0.8264,
KL=0.1742, wKL=0.6250]
Train E25: 96%|█████████▌| 24/25 [00:35<00:01, 1.61s/batch, N=1.5278, E=0.8294,
KL=0.1759, wKL=0.6250]
Train E25: 100%|██████████| 25/25 [00:35<00:00, 1.32s/batch, N=1.5278, E=0.8294,
KL=0.1759, wKL=0.6250]
Train E25: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5278, E=0.8294,
KL=0.1759, wKL=0.6250]
930.8s 123 [Epoch 025] Total: 2.3692 | N: 1.4888 | E: 0.8251 | KL(0.62×0.5): 0.1769
965.3s 124 Train E26: 0%| | 0/25 [00:00<?, ?batch/s]
Train E26: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4241, E=0.8307, KL=0.1734,
wKL=0.6500]
Train E26: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4241, E=0.8307,
KL=0.1734, wKL=0.6500]
Train E26: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4363, E=0.8242,
KL=0.1743, wKL=0.6500]
Train E26: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4363, E=0.8242,
KL=0.1743, wKL=0.6500]
Train E26: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.5535, E=0.8239,
KL=0.1720, wKL=0.6500]
Train E26: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5535, E=0.8239,
KL=0.1720, wKL=0.6500]
Train E26: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4711, E=0.8247,
KL=0.1716, wKL=0.6500]
Train E26: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4711, E=0.8247,
KL=0.1716, wKL=0.6500]
Train E26: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.6623, E=0.8266,
KL=0.1748, wKL=0.6500]
Train E26: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.6623, E=0.8266,
KL=0.1748, wKL=0.6500]
Train E26: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4884, E=0.8230,
KL=0.1702, wKL=0.6500]
Train E26: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4884, E=0.8230,
KL=0.1702, wKL=0.6500]
Train E26: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4805, E=0.8266,
KL=0.1710, wKL=0.6500]
Train E26: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4805, E=0.8266,
KL=0.1710, wKL=0.6500]
Train E26: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4945, E=0.8230,
KL=0.1707, wKL=0.6500]
Train E26: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.4945, E=0.8230,
KL=0.1707, wKL=0.6500]
Train E26: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.4994, E=0.8272,
KL=0.1689, wKL=0.6500]
Train E26: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.4994, E=0.8272,
KL=0.1689, wKL=0.6500]
Train E26: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.4202, E=0.8284,
KL=0.1706, wKL=0.6500]
Train E26: 40%|████ | 10/25 [00:13<00:21, 1.41s/batch, N=1.4202, E=0.8284,
KL=0.1706, wKL=0.6500]
Train E26: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4906, E=0.8257,
KL=0.1703, wKL=0.6500]
Train E26: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4906, E=0.8257,
KL=0.1703, wKL=0.6500]
Train E26: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4588, E=0.8290,
KL=0.1689, wKL=0.6500]
Train E26: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4588, E=0.8290,
KL=0.1689, wKL=0.6500]
Train E26: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5497, E=0.8289,
KL=0.1709, wKL=0.6500]
Train E26: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5497, E=0.8289,
KL=0.1709, wKL=0.6500]
Train E26: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4802, E=0.8266,
KL=0.1679, wKL=0.6500]
Train E26: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4802, E=0.8266,
KL=0.1679, wKL=0.6500]
Train E26: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5013, E=0.8279,
KL=0.1706, wKL=0.6500]
Train E26: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.5013, E=0.8279,
KL=0.1706, wKL=0.6500]
Train E26: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5297, E=0.8234,
KL=0.1699, wKL=0.6500]
Train E26: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5297, E=0.8234,
KL=0.1699, wKL=0.6500]
Train E26: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4244, E=0.8228,
KL=0.1680, wKL=0.6500]
Train E26: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4244, E=0.8228,
KL=0.1680, wKL=0.6500]
Train E26: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5109, E=0.8195,
KL=0.1701, wKL=0.6500]
Train E26: 72%|███████▏ | 18/25 [00:25<00:10, 1.45s/batch, N=1.5109, E=0.8195,
KL=0.1701, wKL=0.6500]
Train E26: 72%|███████▏ | 18/25 [00:26<00:10, 1.45s/batch, N=1.4531, E=0.8178,
KL=0.1668, wKL=0.6500]
Train E26: 76%|███████▌ | 19/25 [00:26<00:08, 1.45s/batch, N=1.4531, E=0.8178,
KL=0.1668, wKL=0.6500]
Train E26: 76%|███████▌ | 19/25 [00:28<00:08, 1.45s/batch, N=1.4466, E=0.8198,
KL=0.1667, wKL=0.6500]
Train E26: 80%|████████ | 20/25 [00:28<00:07, 1.44s/batch, N=1.4466, E=0.8198,
KL=0.1667, wKL=0.6500]
Train E26: 80%|████████ | 20/25 [00:29<00:07, 1.44s/batch, N=1.4801, E=0.8170,
KL=0.1695, wKL=0.6500]
Train E26: 84%|████████▍ | 21/25 [00:29<00:05, 1.43s/batch, N=1.4801, E=0.8170,
KL=0.1695, wKL=0.6500]
Train E26: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4348, E=0.8250,
KL=0.1663, wKL=0.6500]
Train E26: 88%|████████▊ | 22/25 [00:30<00:04, 1.44s/batch, N=1.4348, E=0.8250,
KL=0.1663, wKL=0.6500]
Train E26: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.6050, E=0.8220,
KL=0.1687, wKL=0.6500]
Train E26: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.6050, E=0.8220,
KL=0.1687, wKL=0.6500]
Train E26: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4663, E=0.8265,
KL=0.1667, wKL=0.6500]
Train E26: 96%|█████████▌| 24/25 [00:33<00:01, 1.42s/batch, N=1.4663, E=0.8265,
KL=0.1667, wKL=0.6500]
Train E26: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4224, E=0.8277,
KL=0.1662, wKL=0.6500]
Train E26: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.4224, E=0.8277,
KL=0.1662, wKL=0.6500]
Train E26: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4224, E=0.8277,
KL=0.1662, wKL=0.6500]
965.3s 125 [Epoch 026] Total: 2.3688 | N: 1.4889 | E: 0.8246 | KL(0.65×0.5): 0.1699
1000.4s 126 Train E27: 0%| | 0/25 [00:00<?, ?batch/s]
Train E27: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5394, E=0.8218, KL=0.1679,
wKL=0.6750]
Train E27: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5394, E=0.8218,
KL=0.1679, wKL=0.6750]
Train E27: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4821, E=0.8224,
KL=0.1674, wKL=0.6750]
Train E27: 8%|▊ | 2/25 [00:02<00:32, 1.39s/batch, N=1.4821, E=0.8224,
KL=0.1674, wKL=0.6750]
Train E27: 8%|▊ | 2/25 [00:04<00:32, 1.39s/batch, N=1.5124, E=0.8247,
KL=0.1666, wKL=0.6750]
Train E27: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5124, E=0.8247,
KL=0.1666, wKL=0.6750]
Train E27: 12%|█▏ | 3/25 [00:06<00:30, 1.39s/batch, N=1.5776, E=0.8220,
KL=0.1668, wKL=0.6750]
Train E27: 16%|█▌ | 4/25 [00:06<00:35, 1.68s/batch, N=1.5776, E=0.8220,
KL=0.1668, wKL=0.6750]
Train E27: 16%|█▌ | 4/25 [00:07<00:35, 1.68s/batch, N=1.4154, E=0.8297,
KL=0.1645, wKL=0.6750]
Train E27: 20%|██ | 5/25 [00:07<00:31, 1.59s/batch, N=1.4154, E=0.8297,
KL=0.1645, wKL=0.6750]
Train E27: 20%|██ | 5/25 [00:09<00:31, 1.59s/batch, N=1.4853, E=0.8218,
KL=0.1644, wKL=0.6750]
Train E27: 24%|██▍ | 6/25 [00:09<00:28, 1.52s/batch, N=1.4853, E=0.8218,
KL=0.1644, wKL=0.6750]
Train E27: 24%|██▍ | 6/25 [00:10<00:28, 1.52s/batch, N=1.4476, E=0.8222,
KL=0.1640, wKL=0.6750]
Train E27: 28%|██▊ | 7/25 [00:10<00:26, 1.48s/batch, N=1.4476, E=0.8222,
KL=0.1640, wKL=0.6750]
Train E27: 28%|██▊ | 7/25 [00:11<00:26, 1.48s/batch, N=1.5324, E=0.8213,
KL=0.1652, wKL=0.6750]
Train E27: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.5324, E=0.8213,
KL=0.1652, wKL=0.6750]
Train E27: 32%|███▏ | 8/25 [00:13<00:24, 1.45s/batch, N=1.5175, E=0.8235,
KL=0.1642, wKL=0.6750]
Train E27: 36%|███▌ | 9/25 [00:13<00:22, 1.43s/batch, N=1.5175, E=0.8235,
KL=0.1642, wKL=0.6750]
Train E27: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4774, E=0.8272,
KL=0.1622, wKL=0.6750]
Train E27: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.4774, E=0.8272,
KL=0.1622, wKL=0.6750]
Train E27: 40%|████ | 10/25 [00:16<00:21, 1.42s/batch, N=1.4666, E=0.8253,
KL=0.1639, wKL=0.6750]
Train E27: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.4666, E=0.8253,
KL=0.1639, wKL=0.6750]
Train E27: 44%|████▍ | 11/25 [00:17<00:19, 1.42s/batch, N=1.4312, E=0.8232,
KL=0.1627, wKL=0.6750]
Train E27: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4312, E=0.8232,
KL=0.1627, wKL=0.6750]
Train E27: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4510, E=0.8269,
KL=0.1630, wKL=0.6750]
Train E27: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4510, E=0.8269,
KL=0.1630, wKL=0.6750]
Train E27: 52%|█████▏ | 13/25 [00:20<00:16, 1.41s/batch, N=1.4915, E=0.8250,
KL=0.1641, wKL=0.6750]
Train E27: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4915, E=0.8250,
KL=0.1641, wKL=0.6750]
Train E27: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4872, E=0.8192,
KL=0.1654, wKL=0.6750]
Train E27: 60%|██████ | 15/25 [00:21<00:14, 1.45s/batch, N=1.4872, E=0.8192,
KL=0.1654, wKL=0.6750]
Train E27: 60%|██████ | 15/25 [00:23<00:14, 1.45s/batch, N=1.5341, E=0.8243,
KL=0.1633, wKL=0.6750]
Train E27: 64%|██████▍ | 16/25 [00:23<00:13, 1.45s/batch, N=1.5341, E=0.8243,
KL=0.1633, wKL=0.6750]
Train E27: 64%|██████▍ | 16/25 [00:24<00:13, 1.45s/batch, N=1.4783, E=0.8250,
KL=0.1645, wKL=0.6750]
Train E27: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.4783, E=0.8250,
KL=0.1645, wKL=0.6750]
Train E27: 68%|██████▊ | 17/25 [00:26<00:11, 1.44s/batch, N=1.4938, E=0.8233,
KL=0.1624, wKL=0.6750]
Train E27: 72%|███████▏ | 18/25 [00:26<00:10, 1.44s/batch, N=1.4938, E=0.8233,
KL=0.1624, wKL=0.6750]
Train E27: 72%|███████▏ | 18/25 [00:27<00:10, 1.44s/batch, N=1.5440, E=0.8236,
KL=0.1649, wKL=0.6750]
Train E27: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.5440, E=0.8236,
KL=0.1649, wKL=0.6750]
Train E27: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.4384, E=0.8155,
KL=0.1632, wKL=0.6750]
Train E27: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4384, E=0.8155,
KL=0.1632, wKL=0.6750]
Train E27: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4082, E=0.8276,
KL=0.1638, wKL=0.6750]
Train E27: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4082, E=0.8276,
KL=0.1638, wKL=0.6750]
Train E27: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5292, E=0.8252,
KL=0.1646, wKL=0.6750]
Train E27: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5292, E=0.8252,
KL=0.1646, wKL=0.6750]
Train E27: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.5363, E=0.8221,
KL=0.1621, wKL=0.6750]
Train E27: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.5363, E=0.8221,
KL=0.1621, wKL=0.6750]
Train E27: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4207, E=0.8267,
KL=0.1626, wKL=0.6750]
Train E27: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4207, E=0.8267,
KL=0.1626, wKL=0.6750]
Train E27: 96%|█████████▌| 24/25 [00:35<00:01, 1.41s/batch, N=1.5373, E=0.8203,
KL=0.1621, wKL=0.6750]
Train E27: 100%|██████████| 25/25 [00:35<00:00, 1.17s/batch, N=1.5373, E=0.8203,
KL=0.1621, wKL=0.6750]
Train E27: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5373, E=0.8203,
KL=0.1621, wKL=0.6750]
1000.4s 127 [Epoch 027] Total: 2.3674 | N: 1.4882 | E: 0.8237 | KL(0.68×0.5):
0.1643
1035.2s 128 Train E28: 0%| | 0/25 [00:00<?, ?batch/s]
Train E28: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4465, E=0.8288, KL=0.1615,
wKL=0.7000]
Train E28: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4465, E=0.8288,
KL=0.1615, wKL=0.7000]
Train E28: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4221, E=0.8226,
KL=0.1611, wKL=0.7000]
Train E28: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4221, E=0.8226,
KL=0.1611, wKL=0.7000]
Train E28: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4262, E=0.8246,
KL=0.1607, wKL=0.7000]
Train E28: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4262, E=0.8246,
KL=0.1607, wKL=0.7000]
Train E28: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4637, E=0.8223,
KL=0.1621, wKL=0.7000]
Train E28: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4637, E=0.8223,
KL=0.1621, wKL=0.7000]
Train E28: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5387, E=0.8274,
KL=0.1594, wKL=0.7000]
Train E28: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5387, E=0.8274,
KL=0.1594, wKL=0.7000]
Train E28: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4756, E=0.8190,
KL=0.1613, wKL=0.7000]
Train E28: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.4756, E=0.8190,
KL=0.1613, wKL=0.7000]
Train E28: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.5217, E=0.8216,
KL=0.1604, wKL=0.7000]
Train E28: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.5217, E=0.8216,
KL=0.1604, wKL=0.7000]
Train E28: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4858, E=0.8216,
KL=0.1607, wKL=0.7000]
Train E28: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4858, E=0.8216,
KL=0.1607, wKL=0.7000]
Train E28: 32%|███▏ | 8/25 [00:13<00:23, 1.40s/batch, N=1.4428, E=0.8249,
KL=0.1578, wKL=0.7000]
Train E28: 36%|███▌ | 9/25 [00:13<00:25, 1.59s/batch, N=1.4428, E=0.8249,
KL=0.1578, wKL=0.7000]
Train E28: 36%|███▌ | 9/25 [00:14<00:25, 1.59s/batch, N=1.5075, E=0.8192,
KL=0.1592, wKL=0.7000]
Train E28: 40%|████ | 10/25 [00:14<00:22, 1.53s/batch, N=1.5075, E=0.8192,
KL=0.1592, wKL=0.7000]
Train E28: 40%|████ | 10/25 [00:15<00:22, 1.53s/batch, N=1.4390, E=0.8229,
KL=0.1590, wKL=0.7000]
Train E28: 44%|████▍ | 11/25 [00:15<00:20, 1.48s/batch, N=1.4390, E=0.8229,
KL=0.1590, wKL=0.7000]
Train E28: 44%|████▍ | 11/25 [00:17<00:20, 1.48s/batch, N=1.4187, E=0.8218,
KL=0.1563, wKL=0.7000]
Train E28: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.4187, E=0.8218,
KL=0.1563, wKL=0.7000]
Train E28: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.5216, E=0.8252,
KL=0.1581, wKL=0.7000]
Train E28: 52%|█████▏ | 13/25 [00:18<00:17, 1.49s/batch, N=1.5216, E=0.8252,
KL=0.1581, wKL=0.7000]
Train E28: 52%|█████▏ | 13/25 [00:20<00:17, 1.49s/batch, N=1.4731, E=0.8204,
KL=0.1575, wKL=0.7000]
Train E28: 56%|█████▌ | 14/25 [00:20<00:16, 1.49s/batch, N=1.4731, E=0.8204,
KL=0.1575, wKL=0.7000]
Train E28: 56%|█████▌ | 14/25 [00:21<00:16, 1.49s/batch, N=1.5030, E=0.8283,
KL=0.1587, wKL=0.7000]
Train E28: 60%|██████ | 15/25 [00:21<00:14, 1.46s/batch, N=1.5030, E=0.8283,
KL=0.1587, wKL=0.7000]
Train E28: 60%|██████ | 15/25 [00:23<00:14, 1.46s/batch, N=1.5571, E=0.8262,
KL=0.1585, wKL=0.7000]
Train E28: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.5571, E=0.8262,
KL=0.1585, wKL=0.7000]
Train E28: 64%|██████▍ | 16/25 [00:24<00:12, 1.44s/batch, N=1.5183, E=0.8212,
KL=0.1586, wKL=0.7000]
Train E28: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.5183, E=0.8212,
KL=0.1586, wKL=0.7000]
Train E28: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.6235, E=0.8213,
KL=0.1600, wKL=0.7000]
Train E28: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.6235, E=0.8213,
KL=0.1600, wKL=0.7000]
Train E28: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4393, E=0.8235,
KL=0.1575, wKL=0.7000]
Train E28: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.4393, E=0.8235,
KL=0.1575, wKL=0.7000]
Train E28: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4900, E=0.8218,
KL=0.1594, wKL=0.7000]
Train E28: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4900, E=0.8218,
KL=0.1594, wKL=0.7000]
Train E28: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.5360, E=0.8193,
KL=0.1586, wKL=0.7000]
Train E28: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5360, E=0.8193,
KL=0.1586, wKL=0.7000]
Train E28: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5623, E=0.8235,
KL=0.1598, wKL=0.7000]
Train E28: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5623, E=0.8235,
KL=0.1598, wKL=0.7000]
Train E28: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.3578, E=0.8217,
KL=0.1563, wKL=0.7000]
Train E28: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.3578, E=0.8217,
KL=0.1563, wKL=0.7000]
Train E28: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5508, E=0.8266,
KL=0.1567, wKL=0.7000]
Train E28: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5508, E=0.8266,
KL=0.1567, wKL=0.7000]
Train E28: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4833, E=0.8317,
KL=0.1573, wKL=0.7000]
Train E28: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4833, E=0.8317,
KL=0.1573, wKL=0.7000]
Train E28: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4833, E=0.8317,
KL=0.1573, wKL=0.7000]
1035.2s 129 [Epoch 028] Total: 2.3673 | N: 1.4883 | E: 0.8233 | KL(0.70×0.5):
0.1591
1070.0s 130 Train E29: 0%| | 0/25 [00:00<?, ?batch/s]
Train E29: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4347, E=0.8272, KL=0.1542,
wKL=0.7250]
Train E29: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4347, E=0.8272,
KL=0.1542, wKL=0.7250]
Train E29: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5497, E=0.8231,
KL=0.1580, wKL=0.7250]
Train E29: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.5497, E=0.8231,
KL=0.1580, wKL=0.7250]
Train E29: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4634, E=0.8214,
KL=0.1549, wKL=0.7250]
Train E29: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4634, E=0.8214,
KL=0.1549, wKL=0.7250]
Train E29: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5020, E=0.8177,
KL=0.1548, wKL=0.7250]
Train E29: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5020, E=0.8177,
KL=0.1548, wKL=0.7250]
Train E29: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4169, E=0.8187,
KL=0.1554, wKL=0.7250]
Train E29: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4169, E=0.8187,
KL=0.1554, wKL=0.7250]
Train E29: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4167, E=0.8264,
KL=0.1530, wKL=0.7250]
Train E29: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4167, E=0.8264,
KL=0.1530, wKL=0.7250]
Train E29: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5830, E=0.8255,
KL=0.1540, wKL=0.7250]
Train E29: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.5830, E=0.8255,
KL=0.1540, wKL=0.7250]
Train E29: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5142, E=0.8186,
KL=0.1543, wKL=0.7250]
Train E29: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.5142, E=0.8186,
KL=0.1543, wKL=0.7250]
Train E29: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4767, E=0.8210,
KL=0.1531, wKL=0.7250]
Train E29: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4767, E=0.8210,
KL=0.1531, wKL=0.7250]
Train E29: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.5523, E=0.8236,
KL=0.1538, wKL=0.7250]
Train E29: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5523, E=0.8236,
KL=0.1538, wKL=0.7250]
Train E29: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.3959, E=0.8267,
KL=0.1541, wKL=0.7250]
Train E29: 44%|████▍ | 11/25 [00:15<00:19, 1.42s/batch, N=1.3959, E=0.8267,
KL=0.1541, wKL=0.7250]
Train E29: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.5864, E=0.8208,
KL=0.1568, wKL=0.7250]
Train E29: 48%|████▊ | 12/25 [00:16<00:18, 1.44s/batch, N=1.5864, E=0.8208,
KL=0.1568, wKL=0.7250]
Train E29: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.4578, E=0.8226,
KL=0.1543, wKL=0.7250]
Train E29: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4578, E=0.8226,
KL=0.1543, wKL=0.7250]
Train E29: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.4437, E=0.8224,
KL=0.1547, wKL=0.7250]
Train E29: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.4437, E=0.8224,
KL=0.1547, wKL=0.7250]
Train E29: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4298, E=0.8221,
KL=0.1549, wKL=0.7250]
Train E29: 60%|██████ | 15/25 [00:21<00:15, 1.58s/batch, N=1.4298, E=0.8221,
KL=0.1549, wKL=0.7250]
Train E29: 60%|██████ | 15/25 [00:22<00:15, 1.58s/batch, N=1.5296, E=0.8273,
KL=0.1540, wKL=0.7250]
Train E29: 64%|██████▍ | 16/25 [00:22<00:13, 1.52s/batch, N=1.5296, E=0.8273,
KL=0.1540, wKL=0.7250]
Train E29: 64%|██████▍ | 16/25 [00:24<00:13, 1.52s/batch, N=1.5725, E=0.8252,
KL=0.1565, wKL=0.7250]
Train E29: 68%|██████▊ | 17/25 [00:24<00:11, 1.49s/batch, N=1.5725, E=0.8252,
KL=0.1565, wKL=0.7250]
Train E29: 68%|██████▊ | 17/25 [00:25<00:11, 1.49s/batch, N=1.5650, E=0.8186,
KL=0.1532, wKL=0.7250]
Train E29: 72%|███████▏ | 18/25 [00:25<00:10, 1.47s/batch, N=1.5650, E=0.8186,
KL=0.1532, wKL=0.7250]
Train E29: 72%|███████▏ | 18/25 [00:27<00:10, 1.47s/batch, N=1.4760, E=0.8249,
KL=0.1528, wKL=0.7250]
Train E29: 76%|███████▌ | 19/25 [00:27<00:08, 1.45s/batch, N=1.4760, E=0.8249,
KL=0.1528, wKL=0.7250]
Train E29: 76%|███████▌ | 19/25 [00:28<00:08, 1.45s/batch, N=1.4987, E=0.8262,
KL=0.1540, wKL=0.7250]
Train E29: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4987, E=0.8262,
KL=0.1540, wKL=0.7250]
Train E29: 80%|████████ | 20/25 [00:29<00:07, 1.42s/batch, N=1.4728, E=0.8222,
KL=0.1514, wKL=0.7250]
Train E29: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4728, E=0.8222,
KL=0.1514, wKL=0.7250]
Train E29: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5235, E=0.8316,
KL=0.1531, wKL=0.7250]
Train E29: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5235, E=0.8316,
KL=0.1531, wKL=0.7250]
Train E29: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.3917, E=0.8257,
KL=0.1507, wKL=0.7250]
Train E29: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.3917, E=0.8257,
KL=0.1507, wKL=0.7250]
Train E29: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4670, E=0.8277,
KL=0.1517, wKL=0.7250]
Train E29: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4670, E=0.8277,
KL=0.1517, wKL=0.7250]
Train E29: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5261, E=0.8165,
KL=0.1537, wKL=0.7250]
Train E29: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5261, E=0.8165,
KL=0.1537, wKL=0.7250]
Train E29: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5261, E=0.8165,
KL=0.1537, wKL=0.7250]
1070.0s 131 [Epoch 029] Total: 2.3683 | N: 1.4890 | E: 0.8235 | KL(0.72×0.5):
0.1541
1104.9s 132 Train E30: 0%| | 0/25 [00:00<?, ?batch/s]
Train E30: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4621, E=0.8241, KL=0.1524,
wKL=0.7500]
Train E30: 4%|▍ | 1/25 [00:01<00:33, 1.39s/batch, N=1.4621, E=0.8241,
KL=0.1524, wKL=0.7500]
Train E30: 4%|▍ | 1/25 [00:02<00:33, 1.39s/batch, N=1.4448, E=0.8217,
KL=0.1525, wKL=0.7500]
Train E30: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4448, E=0.8217,
KL=0.1525, wKL=0.7500]
Train E30: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.6167, E=0.8271,
KL=0.1520, wKL=0.7500]
Train E30: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.6167, E=0.8271,
KL=0.1520, wKL=0.7500]
Train E30: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.4834, E=0.8184,
KL=0.1531, wKL=0.7500]
Train E30: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4834, E=0.8184,
KL=0.1531, wKL=0.7500]
Train E30: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4240, E=0.8199,
KL=0.1509, wKL=0.7500]
Train E30: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4240, E=0.8199,
KL=0.1509, wKL=0.7500]
Train E30: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4628, E=0.8240,
KL=0.1518, wKL=0.7500]
Train E30: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4628, E=0.8240,
KL=0.1518, wKL=0.7500]
Train E30: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5116, E=0.8237,
KL=0.1500, wKL=0.7500]
Train E30: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5116, E=0.8237,
KL=0.1500, wKL=0.7500]
Train E30: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.5467, E=0.8247,
KL=0.1492, wKL=0.7500]
Train E30: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5467, E=0.8247,
KL=0.1492, wKL=0.7500]
Train E30: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4071, E=0.8291,
KL=0.1479, wKL=0.7500]
Train E30: 36%|███▌ | 9/25 [00:12<00:23, 1.46s/batch, N=1.4071, E=0.8291,
KL=0.1479, wKL=0.7500]
Train E30: 36%|███▌ | 9/25 [00:14<00:23, 1.46s/batch, N=1.4695, E=0.8215,
KL=0.1489, wKL=0.7500]
Train E30: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.4695, E=0.8215,
KL=0.1489, wKL=0.7500]
Train E30: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4986, E=0.8220,
KL=0.1488, wKL=0.7500]
Train E30: 44%|████▍ | 11/25 [00:15<00:20, 1.43s/batch, N=1.4986, E=0.8220,
KL=0.1488, wKL=0.7500]
Train E30: 44%|████▍ | 11/25 [00:16<00:20, 1.43s/batch, N=1.4979, E=0.8262,
KL=0.1494, wKL=0.7500]
Train E30: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.4979, E=0.8262,
KL=0.1494, wKL=0.7500]
Train E30: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.4749, E=0.8277,
KL=0.1483, wKL=0.7500]
Train E30: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4749, E=0.8277,
KL=0.1483, wKL=0.7500]
Train E30: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4977, E=0.8231,
KL=0.1496, wKL=0.7500]
Train E30: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4977, E=0.8231,
KL=0.1496, wKL=0.7500]
Train E30: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4391, E=0.8221,
KL=0.1491, wKL=0.7500]
Train E30: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4391, E=0.8221,
KL=0.1491, wKL=0.7500]
Train E30: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.5263, E=0.8217,
KL=0.1499, wKL=0.7500]
Train E30: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5263, E=0.8217,
KL=0.1499, wKL=0.7500]
Train E30: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.5557, E=0.8228,
KL=0.1508, wKL=0.7500]
Train E30: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5557, E=0.8228,
KL=0.1508, wKL=0.7500]
Train E30: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5575, E=0.8170,
KL=0.1476, wKL=0.7500]
Train E30: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5575, E=0.8170,
KL=0.1476, wKL=0.7500]
Train E30: 72%|███████▏ | 18/25 [00:27<00:09, 1.39s/batch, N=1.5374, E=0.8224,
KL=0.1490, wKL=0.7500]
Train E30: 76%|███████▌ | 19/25 [00:27<00:09, 1.58s/batch, N=1.5374, E=0.8224,
KL=0.1490, wKL=0.7500]
Train E30: 76%|███████▌ | 19/25 [00:28<00:09, 1.58s/batch, N=1.5458, E=0.8247,
KL=0.1476, wKL=0.7500]
Train E30: 80%|████████ | 20/25 [00:28<00:07, 1.52s/batch, N=1.5458, E=0.8247,
KL=0.1476, wKL=0.7500]
Train E30: 80%|████████ | 20/25 [00:30<00:07, 1.52s/batch, N=1.4827, E=0.8263,
KL=0.1478, wKL=0.7500]
Train E30: 84%|████████▍ | 21/25 [00:30<00:05, 1.48s/batch, N=1.4827, E=0.8263,
KL=0.1478, wKL=0.7500]
Train E30: 84%|████████▍ | 21/25 [00:31<00:05, 1.48s/batch, N=1.3661, E=0.8217,
KL=0.1486, wKL=0.7500]
Train E30: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.3661, E=0.8217,
KL=0.1486, wKL=0.7500]
Train E30: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.4920, E=0.8242,
KL=0.1460, wKL=0.7500]
Train E30: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.4920, E=0.8242,
KL=0.1460, wKL=0.7500]
Train E30: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.4350, E=0.8216,
KL=0.1485, wKL=0.7500]
Train E30: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.4350, E=0.8216,
KL=0.1485, wKL=0.7500]
Train E30: 96%|█████████▌| 24/25 [00:34<00:01, 1.44s/batch, N=1.4819, E=0.8261,
KL=0.1479, wKL=0.7500]
Train E30: 100%|██████████| 25/25 [00:34<00:00, 1.19s/batch, N=1.4819, E=0.8261,
KL=0.1479, wKL=0.7500]
Train E30: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4819, E=0.8261,
KL=0.1479, wKL=0.7500]
1104.9s 133 [Epoch 030] Total: 2.3682 | N: 1.4889 | E: 0.8233 | KL(0.75×0.5):
0.1495
1104.9s 134 Saved checkpoint: /kaggle/working/checkpoints/gvae_30_epoch030.pt
1139.7s 135 Train E31: 0%| | 0/25 [00:00<?, ?batch/s]
Train E31: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5405, E=0.8219, KL=0.1472,
wKL=0.7750]
Train E31: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5405, E=0.8219,
KL=0.1472, wKL=0.7750]
Train E31: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.5836, E=0.8256,
KL=0.1474, wKL=0.7750]
Train E31: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5836, E=0.8256,
KL=0.1474, wKL=0.7750]
Train E31: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4920, E=0.8246,
KL=0.1458, wKL=0.7750]
Train E31: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4920, E=0.8246,
KL=0.1458, wKL=0.7750]
Train E31: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4810, E=0.8253,
KL=0.1452, wKL=0.7750]
Train E31: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4810, E=0.8253,
KL=0.1452, wKL=0.7750]
Train E31: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4772, E=0.8285,
KL=0.1448, wKL=0.7750]
Train E31: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4772, E=0.8285,
KL=0.1448, wKL=0.7750]
Train E31: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4420, E=0.8184,
KL=0.1446, wKL=0.7750]
Train E31: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4420, E=0.8184,
KL=0.1446, wKL=0.7750]
Train E31: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5350, E=0.8218,
KL=0.1454, wKL=0.7750]
Train E31: 28%|██▊ | 7/25 [00:09<00:26, 1.49s/batch, N=1.5350, E=0.8218,
KL=0.1454, wKL=0.7750]
Train E31: 28%|██▊ | 7/25 [00:11<00:26, 1.49s/batch, N=1.5586, E=0.8255,
KL=0.1449, wKL=0.7750]
Train E31: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.5586, E=0.8255,
KL=0.1449, wKL=0.7750]
Train E31: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.4219, E=0.8225,
KL=0.1441, wKL=0.7750]
Train E31: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.4219, E=0.8225,
KL=0.1441, wKL=0.7750]
Train E31: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.4970, E=0.8234,
KL=0.1458, wKL=0.7750]
Train E31: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4970, E=0.8234,
KL=0.1458, wKL=0.7750]
Train E31: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4820, E=0.8245,
KL=0.1445, wKL=0.7750]
Train E31: 44%|████▍ | 11/25 [00:15<00:19, 1.42s/batch, N=1.4820, E=0.8245,
KL=0.1445, wKL=0.7750]
Train E31: 44%|████▍ | 11/25 [00:16<00:19, 1.42s/batch, N=1.4467, E=0.8221,
KL=0.1452, wKL=0.7750]
Train E31: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4467, E=0.8221,
KL=0.1452, wKL=0.7750]
Train E31: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.3781, E=0.8219,
KL=0.1446, wKL=0.7750]
Train E31: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.3781, E=0.8219,
KL=0.1446, wKL=0.7750]
Train E31: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5280, E=0.8291,
KL=0.1446, wKL=0.7750]
Train E31: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5280, E=0.8291,
KL=0.1446, wKL=0.7750]
Train E31: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4462, E=0.8217,
KL=0.1455, wKL=0.7750]
Train E31: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.4462, E=0.8217,
KL=0.1455, wKL=0.7750]
Train E31: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.4947, E=0.8237,
KL=0.1445, wKL=0.7750]
Train E31: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4947, E=0.8237,
KL=0.1445, wKL=0.7750]
Train E31: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5019, E=0.8227,
KL=0.1449, wKL=0.7750]
Train E31: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5019, E=0.8227,
KL=0.1449, wKL=0.7750]
Train E31: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.5066, E=0.8258,
KL=0.1443, wKL=0.7750]
Train E31: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5066, E=0.8258,
KL=0.1443, wKL=0.7750]
Train E31: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4949, E=0.8243,
KL=0.1441, wKL=0.7750]
Train E31: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4949, E=0.8243,
KL=0.1441, wKL=0.7750]
Train E31: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5473, E=0.8259,
KL=0.1454, wKL=0.7750]
Train E31: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.5473, E=0.8259,
KL=0.1454, wKL=0.7750]
Train E31: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5141, E=0.8215,
KL=0.1445, wKL=0.7750]
Train E31: 84%|████████▍ | 21/25 [00:29<00:05, 1.38s/batch, N=1.5141, E=0.8215,
KL=0.1445, wKL=0.7750]
Train E31: 84%|████████▍ | 21/25 [00:30<00:05, 1.38s/batch, N=1.3829, E=0.8229,
KL=0.1457, wKL=0.7750]
Train E31: 88%|████████▊ | 22/25 [00:30<00:04, 1.38s/batch, N=1.3829, E=0.8229,
KL=0.1457, wKL=0.7750]
Train E31: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.4962, E=0.8160,
KL=0.1466, wKL=0.7750]
Train E31: 92%|█████████▏| 23/25 [00:32<00:03, 1.57s/batch, N=1.4962, E=0.8160,
KL=0.1466, wKL=0.7750]
Train E31: 92%|█████████▏| 23/25 [00:34<00:03, 1.57s/batch, N=1.5284, E=0.8235,
KL=0.1466, wKL=0.7750]
Train E31: 96%|█████████▌| 24/25 [00:34<00:01, 1.51s/batch, N=1.5284, E=0.8235,
KL=0.1466, wKL=0.7750]
Train E31: 96%|█████████▌| 24/25 [00:34<00:01, 1.51s/batch, N=1.3842, E=0.8176,
KL=0.1451, wKL=0.7750]
Train E31: 100%|██████████| 25/25 [00:34<00:00, 1.24s/batch, N=1.3842, E=0.8176,
KL=0.1451, wKL=0.7750]
Train E31: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.3842, E=0.8176,
KL=0.1451, wKL=0.7750]
1139.7s 136 [Epoch 031] Total: 2.3685 | N: 1.4889 | E: 0.8234 | KL(0.78×0.5):
0.1453
1173.9s 137 Train E32: 0%| | 0/25 [00:00<?, ?batch/s]
Train E32: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5335, E=0.8253, KL=0.1451,
wKL=0.8000]
Train E32: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5335, E=0.8253,
KL=0.1451, wKL=0.8000]
Train E32: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.5083, E=0.8255,
KL=0.1444, wKL=0.8000]
Train E32: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5083, E=0.8255,
KL=0.1444, wKL=0.8000]
Train E32: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.5043, E=0.8250,
KL=0.1433, wKL=0.8000]
Train E32: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5043, E=0.8250,
KL=0.1433, wKL=0.8000]
Train E32: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4564, E=0.8267,
KL=0.1401, wKL=0.8000]
Train E32: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.4564, E=0.8267,
KL=0.1401, wKL=0.8000]
Train E32: 16%|█▌ | 4/25 [00:07<00:29, 1.38s/batch, N=1.4805, E=0.8281,
KL=0.1402, wKL=0.8000]
Train E32: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.4805, E=0.8281,
KL=0.1402, wKL=0.8000]
Train E32: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.5015, E=0.8239,
KL=0.1380, wKL=0.8000]
Train E32: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.5015, E=0.8239,
KL=0.1380, wKL=0.8000]
Train E32: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5169, E=0.8256,
KL=0.1377, wKL=0.8000]
Train E32: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5169, E=0.8256,
KL=0.1377, wKL=0.8000]
Train E32: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4322, E=0.8234,
KL=0.1377, wKL=0.8000]
Train E32: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.4322, E=0.8234,
KL=0.1377, wKL=0.8000]
Train E32: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.5228, E=0.8248,
KL=0.1396, wKL=0.8000]
Train E32: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5228, E=0.8248,
KL=0.1396, wKL=0.8000]
Train E32: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4262, E=0.8217,
KL=0.1389, wKL=0.8000]
Train E32: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4262, E=0.8217,
KL=0.1389, wKL=0.8000]
Train E32: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.3619, E=0.8251,
KL=0.1399, wKL=0.8000]
Train E32: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.3619, E=0.8251,
KL=0.1399, wKL=0.8000]
Train E32: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5627, E=0.8233,
KL=0.1428, wKL=0.8000]
Train E32: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5627, E=0.8233,
KL=0.1428, wKL=0.8000]
Train E32: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5259, E=0.8187,
KL=0.1431, wKL=0.8000]
Train E32: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5259, E=0.8187,
KL=0.1431, wKL=0.8000]
Train E32: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5005, E=0.8238,
KL=0.1429, wKL=0.8000]
Train E32: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5005, E=0.8238,
KL=0.1429, wKL=0.8000]
Train E32: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4982, E=0.8253,
KL=0.1437, wKL=0.8000]
Train E32: 60%|██████ | 15/25 [00:20<00:14, 1.40s/batch, N=1.4982, E=0.8253,
KL=0.1437, wKL=0.8000]
Train E32: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4044, E=0.8217,
KL=0.1415, wKL=0.8000]
Train E32: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4044, E=0.8217,
KL=0.1415, wKL=0.8000]
Train E32: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5273, E=0.8208,
KL=0.1418, wKL=0.8000]
Train E32: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5273, E=0.8208,
KL=0.1418, wKL=0.8000]
Train E32: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5201, E=0.8204,
KL=0.1425, wKL=0.8000]
Train E32: 72%|███████▏ | 18/25 [00:25<00:09, 1.38s/batch, N=1.5201, E=0.8204,
KL=0.1425, wKL=0.8000]
Train E32: 72%|███████▏ | 18/25 [00:26<00:09, 1.38s/batch, N=1.5206, E=0.8213,
KL=0.1406, wKL=0.8000]
Train E32: 76%|███████▌ | 19/25 [00:26<00:08, 1.38s/batch, N=1.5206, E=0.8213,
KL=0.1406, wKL=0.8000]
Train E32: 76%|███████▌ | 19/25 [00:27<00:08, 1.38s/batch, N=1.4744, E=0.8250,
KL=0.1391, wKL=0.8000]
Train E32: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.4744, E=0.8250,
KL=0.1391, wKL=0.8000]
Train E32: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5144, E=0.8198,
KL=0.1381, wKL=0.8000]
Train E32: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5144, E=0.8198,
KL=0.1381, wKL=0.8000]
Train E32: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.5535, E=0.8204,
KL=0.1367, wKL=0.8000]
Train E32: 88%|████████▊ | 22/25 [00:30<00:04, 1.39s/batch, N=1.5535, E=0.8204,
KL=0.1367, wKL=0.8000]
Train E32: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.3901, E=0.8202,
KL=0.1382, wKL=0.8000]
Train E32: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.3901, E=0.8202,
KL=0.1382, wKL=0.8000]
Train E32: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4896, E=0.8242,
KL=0.1377, wKL=0.8000]
Train E32: 96%|█████████▌| 24/25 [00:33<00:01, 1.42s/batch, N=1.4896, E=0.8242,
KL=0.1377, wKL=0.8000]
Train E32: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.5109, E=0.8219,
KL=0.1382, wKL=0.8000]
Train E32: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.5109, E=0.8219,
KL=0.1382, wKL=0.8000]
Train E32: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.5109, E=0.8219,
KL=0.1382, wKL=0.8000]
1173.9s 138 [Epoch 032] Total: 2.3685 | N: 1.4890 | E: 0.8233 | KL(0.80×0.5):
0.1405
1209.1s 139 Train E33: 0%| | 0/25 [00:00<?, ?batch/s]
Train E33: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5660, E=0.8236, KL=0.1380,
wKL=0.8250]
Train E33: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5660, E=0.8236,
KL=0.1380, wKL=0.8250]
Train E33: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.3950, E=0.8211,
KL=0.1381, wKL=0.8250]
Train E33: 8%|▊ | 2/25 [00:02<00:32, 1.40s/batch, N=1.3950, E=0.8211,
KL=0.1381, wKL=0.8250]
Train E33: 8%|▊ | 2/25 [00:04<00:32, 1.40s/batch, N=1.5396, E=0.8229,
KL=0.1400, wKL=0.8250]
Train E33: 12%|█▏ | 3/25 [00:04<00:32, 1.49s/batch, N=1.5396, E=0.8229,
KL=0.1400, wKL=0.8250]
Train E33: 12%|█▏ | 3/25 [00:05<00:32, 1.49s/batch, N=1.4796, E=0.8253,
KL=0.1382, wKL=0.8250]
Train E33: 16%|█▌ | 4/25 [00:05<00:30, 1.47s/batch, N=1.4796, E=0.8253,
KL=0.1382, wKL=0.8250]
Train E33: 16%|█▌ | 4/25 [00:07<00:30, 1.47s/batch, N=1.4762, E=0.8303,
KL=0.1379, wKL=0.8250]
Train E33: 20%|██ | 5/25 [00:07<00:29, 1.45s/batch, N=1.4762, E=0.8303,
KL=0.1379, wKL=0.8250]
Train E33: 20%|██ | 5/25 [00:08<00:29, 1.45s/batch, N=1.4490, E=0.8192,
KL=0.1388, wKL=0.8250]
Train E33: 24%|██▍ | 6/25 [00:08<00:27, 1.44s/batch, N=1.4490, E=0.8192,
KL=0.1388, wKL=0.8250]
Train E33: 24%|██▍ | 6/25 [00:10<00:27, 1.44s/batch, N=1.4558, E=0.8242,
KL=0.1362, wKL=0.8250]
Train E33: 28%|██▊ | 7/25 [00:10<00:29, 1.62s/batch, N=1.4558, E=0.8242,
KL=0.1362, wKL=0.8250]
Train E33: 28%|██▊ | 7/25 [00:12<00:29, 1.62s/batch, N=1.5475, E=0.8254,
KL=0.1370, wKL=0.8250]
Train E33: 32%|███▏ | 8/25 [00:12<00:26, 1.54s/batch, N=1.5475, E=0.8254,
KL=0.1370, wKL=0.8250]
Train E33: 32%|███▏ | 8/25 [00:13<00:26, 1.54s/batch, N=1.4838, E=0.8240,
KL=0.1365, wKL=0.8250]
Train E33: 36%|███▌ | 9/25 [00:13<00:23, 1.49s/batch, N=1.4838, E=0.8240,
KL=0.1365, wKL=0.8250]
Train E33: 36%|███▌ | 9/25 [00:14<00:23, 1.49s/batch, N=1.4609, E=0.8262,
KL=0.1357, wKL=0.8250]
Train E33: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.4609, E=0.8262,
KL=0.1357, wKL=0.8250]
Train E33: 40%|████ | 10/25 [00:16<00:21, 1.46s/batch, N=1.4215, E=0.8265,
KL=0.1349, wKL=0.8250]
Train E33: 44%|████▍ | 11/25 [00:16<00:20, 1.43s/batch, N=1.4215, E=0.8265,
KL=0.1349, wKL=0.8250]
Train E33: 44%|████▍ | 11/25 [00:17<00:20, 1.43s/batch, N=1.4652, E=0.8216,
KL=0.1357, wKL=0.8250]
Train E33: 48%|████▊ | 12/25 [00:17<00:18, 1.44s/batch, N=1.4652, E=0.8216,
KL=0.1357, wKL=0.8250]
Train E33: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.4590, E=0.8262,
KL=0.1341, wKL=0.8250]
Train E33: 52%|█████▏ | 13/25 [00:18<00:16, 1.42s/batch, N=1.4590, E=0.8262,
KL=0.1341, wKL=0.8250]
Train E33: 52%|█████▏ | 13/25 [00:20<00:16, 1.42s/batch, N=1.5590, E=0.8273,
KL=0.1348, wKL=0.8250]
Train E33: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.5590, E=0.8273,
KL=0.1348, wKL=0.8250]
Train E33: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.5220, E=0.8222,
KL=0.1342, wKL=0.8250]
Train E33: 60%|██████ | 15/25 [00:21<00:13, 1.39s/batch, N=1.5220, E=0.8222,
KL=0.1342, wKL=0.8250]
Train E33: 60%|██████ | 15/25 [00:23<00:13, 1.39s/batch, N=1.4700, E=0.8246,
KL=0.1346, wKL=0.8250]
Train E33: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4700, E=0.8246,
KL=0.1346, wKL=0.8250]
Train E33: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.5006, E=0.8253,
KL=0.1350, wKL=0.8250]
Train E33: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5006, E=0.8253,
KL=0.1350, wKL=0.8250]
Train E33: 68%|██████▊ | 17/25 [00:26<00:11, 1.40s/batch, N=1.4063, E=0.8239,
KL=0.1335, wKL=0.8250]
Train E33: 72%|███████▏ | 18/25 [00:26<00:10, 1.43s/batch, N=1.4063, E=0.8239,
KL=0.1335, wKL=0.8250]
Train E33: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.4765, E=0.8208,
KL=0.1358, wKL=0.8250]
Train E33: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.4765, E=0.8208,
KL=0.1358, wKL=0.8250]
Train E33: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.5272, E=0.8236,
KL=0.1348, wKL=0.8250]
Train E33: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5272, E=0.8236,
KL=0.1348, wKL=0.8250]
Train E33: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.5352, E=0.8216,
KL=0.1350, wKL=0.8250]
Train E33: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5352, E=0.8216,
KL=0.1350, wKL=0.8250]
Train E33: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4673, E=0.8185,
KL=0.1357, wKL=0.8250]
Train E33: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.4673, E=0.8185,
KL=0.1357, wKL=0.8250]
Train E33: 88%|████████▊ | 22/25 [00:33<00:04, 1.40s/batch, N=1.5042, E=0.8231,
KL=0.1336, wKL=0.8250]
Train E33: 92%|█████████▏| 23/25 [00:33<00:02, 1.39s/batch, N=1.5042, E=0.8231,
KL=0.1336, wKL=0.8250]
Train E33: 92%|█████████▏| 23/25 [00:34<00:02, 1.39s/batch, N=1.5281, E=0.8194,
KL=0.1355, wKL=0.8250]
Train E33: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5281, E=0.8194,
KL=0.1355, wKL=0.8250]
Train E33: 96%|█████████▌| 24/25 [00:35<00:01, 1.39s/batch, N=1.6140, E=0.8276,
KL=0.1353, wKL=0.8250]
Train E33: 100%|██████████| 25/25 [00:35<00:00, 1.19s/batch, N=1.6140, E=0.8276,
KL=0.1353, wKL=0.8250]
Train E33: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.6140, E=0.8276,
KL=0.1353, wKL=0.8250]
1209.1s 140 [Epoch 033] Total: 2.3692 | N: 1.4895 | E: 0.8237 | KL(0.82×0.5):
0.1360
1243.8s 141 Train E34: 0%| | 0/25 [00:00<?, ?batch/s]
Train E34: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5060, E=0.8251, KL=0.1327,
wKL=0.8500]
Train E34: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.5060, E=0.8251,
KL=0.1327, wKL=0.8500]
Train E34: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.4803, E=0.8254,
KL=0.1340, wKL=0.8500]
Train E34: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4803, E=0.8254,
KL=0.1340, wKL=0.8500]
Train E34: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4526, E=0.8243,
KL=0.1328, wKL=0.8500]
Train E34: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4526, E=0.8243,
KL=0.1328, wKL=0.8500]
Train E34: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5122, E=0.8203,
KL=0.1327, wKL=0.8500]
Train E34: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5122, E=0.8203,
KL=0.1327, wKL=0.8500]
Train E34: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5363, E=0.8263,
KL=0.1334, wKL=0.8500]
Train E34: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5363, E=0.8263,
KL=0.1334, wKL=0.8500]
Train E34: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4758, E=0.8253,
KL=0.1319, wKL=0.8500]
Train E34: 24%|██▍ | 6/25 [00:08<00:25, 1.36s/batch, N=1.4758, E=0.8253,
KL=0.1319, wKL=0.8500]
Train E34: 24%|██▍ | 6/25 [00:09<00:25, 1.36s/batch, N=1.4749, E=0.8255,
KL=0.1312, wKL=0.8500]
Train E34: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4749, E=0.8255,
KL=0.1312, wKL=0.8500]
Train E34: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4804, E=0.8176,
KL=0.1317, wKL=0.8500]
Train E34: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.4804, E=0.8176,
KL=0.1317, wKL=0.8500]
Train E34: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.4409, E=0.8231,
KL=0.1311, wKL=0.8500]
Train E34: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4409, E=0.8231,
KL=0.1311, wKL=0.8500]
Train E34: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.4400, E=0.8221,
KL=0.1311, wKL=0.8500]
Train E34: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4400, E=0.8221,
KL=0.1311, wKL=0.8500]
Train E34: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.4646, E=0.8242,
KL=0.1313, wKL=0.8500]
Train E34: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.4646, E=0.8242,
KL=0.1313, wKL=0.8500]
Train E34: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.5191, E=0.8253,
KL=0.1314, wKL=0.8500]
Train E34: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.5191, E=0.8253,
KL=0.1314, wKL=0.8500]
Train E34: 48%|████▊ | 12/25 [00:18<00:17, 1.38s/batch, N=1.5061, E=0.8197,
KL=0.1312, wKL=0.8500]
Train E34: 52%|█████▏ | 13/25 [00:18<00:18, 1.57s/batch, N=1.5061, E=0.8197,
KL=0.1312, wKL=0.8500]
Train E34: 52%|█████▏ | 13/25 [00:19<00:18, 1.57s/batch, N=1.4309, E=0.8260,
KL=0.1295, wKL=0.8500]
Train E34: 56%|█████▌ | 14/25 [00:19<00:16, 1.52s/batch, N=1.4309, E=0.8260,
KL=0.1295, wKL=0.8500]
Train E34: 56%|█████▌ | 14/25 [00:21<00:16, 1.52s/batch, N=1.5514, E=0.8220,
KL=0.1337, wKL=0.8500]
Train E34: 60%|██████ | 15/25 [00:21<00:14, 1.49s/batch, N=1.5514, E=0.8220,
KL=0.1337, wKL=0.8500]
Train E34: 60%|██████ | 15/25 [00:22<00:14, 1.49s/batch, N=1.5622, E=0.8240,
KL=0.1315, wKL=0.8500]
Train E34: 64%|██████▍ | 16/25 [00:22<00:13, 1.47s/batch, N=1.5622, E=0.8240,
KL=0.1315, wKL=0.8500]
Train E34: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5277, E=0.8278,
KL=0.1307, wKL=0.8500]
Train E34: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5277, E=0.8278,
KL=0.1307, wKL=0.8500]
Train E34: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4747, E=0.8235,
KL=0.1339, wKL=0.8500]
Train E34: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4747, E=0.8235,
KL=0.1339, wKL=0.8500]
Train E34: 72%|███████▏ | 18/25 [00:26<00:09, 1.42s/batch, N=1.5264, E=0.8245,
KL=0.1317, wKL=0.8500]
Train E34: 76%|███████▌ | 19/25 [00:26<00:08, 1.41s/batch, N=1.5264, E=0.8245,
KL=0.1317, wKL=0.8500]
Train E34: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4576, E=0.8191,
KL=0.1305, wKL=0.8500]
Train E34: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.4576, E=0.8191,
KL=0.1305, wKL=0.8500]
Train E34: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.4281, E=0.8215,
KL=0.1317, wKL=0.8500]
Train E34: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4281, E=0.8215,
KL=0.1317, wKL=0.8500]
Train E34: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.5200, E=0.8273,
KL=0.1291, wKL=0.8500]
Train E34: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.5200, E=0.8273,
KL=0.1291, wKL=0.8500]
Train E34: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5702, E=0.8251,
KL=0.1286, wKL=0.8500]
Train E34: 92%|█████████▏| 23/25 [00:32<00:02, 1.47s/batch, N=1.5702, E=0.8251,
KL=0.1286, wKL=0.8500]
Train E34: 92%|█████████▏| 23/25 [00:34<00:02, 1.47s/batch, N=1.3967, E=0.8190,
KL=0.1278, wKL=0.8500]
Train E34: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.3967, E=0.8190,
KL=0.1278, wKL=0.8500]
Train E34: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5163, E=0.8254,
KL=0.1268, wKL=0.8500]
Train E34: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.5163, E=0.8254,
KL=0.1268, wKL=0.8500]
Train E34: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5163, E=0.8254,
KL=0.1268, wKL=0.8500]
1243.8s 142 [Epoch 034] Total: 2.3688 | N: 1.4894 | E: 0.8235 | KL(0.85×0.5):
0.1314
1278.5s 143 Train E35: 0%| | 0/25 [00:00<?, ?batch/s]
Train E35: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4119, E=0.8222, KL=0.1271,
wKL=0.8750]
Train E35: 4%|▍ | 1/25 [00:01<00:32, 1.33s/batch, N=1.4119, E=0.8222,
KL=0.1271, wKL=0.8750]
Train E35: 4%|▍ | 1/25 [00:02<00:32, 1.33s/batch, N=1.3558, E=0.8200,
KL=0.1256, wKL=0.8750]
Train E35: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.3558, E=0.8200,
KL=0.1256, wKL=0.8750]
Train E35: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4430, E=0.8227,
KL=0.1269, wKL=0.8750]
Train E35: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4430, E=0.8227,
KL=0.1269, wKL=0.8750]
Train E35: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4491, E=0.8281,
KL=0.1276, wKL=0.8750]
Train E35: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4491, E=0.8281,
KL=0.1276, wKL=0.8750]
Train E35: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4885, E=0.8313,
KL=0.1279, wKL=0.8750]
Train E35: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4885, E=0.8313,
KL=0.1279, wKL=0.8750]
Train E35: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4725, E=0.8245,
KL=0.1279, wKL=0.8750]
Train E35: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.4725, E=0.8245,
KL=0.1279, wKL=0.8750]
Train E35: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.4273, E=0.8281,
KL=0.1280, wKL=0.8750]
Train E35: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4273, E=0.8281,
KL=0.1280, wKL=0.8750]
Train E35: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.5760, E=0.8268,
KL=0.1300, wKL=0.8750]
Train E35: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.5760, E=0.8268,
KL=0.1300, wKL=0.8750]
Train E35: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.5736, E=0.8220,
KL=0.1293, wKL=0.8750]
Train E35: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.5736, E=0.8220,
KL=0.1293, wKL=0.8750]
Train E35: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.5052, E=0.8220,
KL=0.1286, wKL=0.8750]
Train E35: 40%|████ | 10/25 [00:13<00:20, 1.37s/batch, N=1.5052, E=0.8220,
KL=0.1286, wKL=0.8750]
Train E35: 40%|████ | 10/25 [00:15<00:20, 1.37s/batch, N=1.5273, E=0.8229,
KL=0.1287, wKL=0.8750]
Train E35: 44%|████▍ | 11/25 [00:15<00:19, 1.37s/batch, N=1.5273, E=0.8229,
KL=0.1287, wKL=0.8750]
Train E35: 44%|████▍ | 11/25 [00:16<00:19, 1.37s/batch, N=1.4564, E=0.8216,
KL=0.1291, wKL=0.8750]
Train E35: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4564, E=0.8216,
KL=0.1291, wKL=0.8750]
Train E35: 48%|████▊ | 12/25 [00:17<00:17, 1.38s/batch, N=1.5736, E=0.8234,
KL=0.1274, wKL=0.8750]
Train E35: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.5736, E=0.8234,
KL=0.1274, wKL=0.8750]
Train E35: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4625, E=0.8197,
KL=0.1275, wKL=0.8750]
Train E35: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4625, E=0.8197,
KL=0.1275, wKL=0.8750]
Train E35: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5224, E=0.8275,
KL=0.1267, wKL=0.8750]
Train E35: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.5224, E=0.8275,
KL=0.1267, wKL=0.8750]
Train E35: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5614, E=0.8171,
KL=0.1269, wKL=0.8750]
Train E35: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5614, E=0.8171,
KL=0.1269, wKL=0.8750]
Train E35: 64%|██████▍ | 16/25 [00:24<00:12, 1.39s/batch, N=1.5558, E=0.8228,
KL=0.1270, wKL=0.8750]
Train E35: 68%|██████▊ | 17/25 [00:24<00:12, 1.57s/batch, N=1.5558, E=0.8228,
KL=0.1270, wKL=0.8750]
Train E35: 68%|██████▊ | 17/25 [00:25<00:12, 1.57s/batch, N=1.4441, E=0.8234,
KL=0.1248, wKL=0.8750]
Train E35: 72%|███████▏ | 18/25 [00:25<00:10, 1.52s/batch, N=1.4441, E=0.8234,
KL=0.1248, wKL=0.8750]
Train E35: 72%|███████▏ | 18/25 [00:26<00:10, 1.52s/batch, N=1.4841, E=0.8242,
KL=0.1254, wKL=0.8750]
Train E35: 76%|███████▌ | 19/25 [00:26<00:09, 1.52s/batch, N=1.4841, E=0.8242,
KL=0.1254, wKL=0.8750]
Train E35: 76%|███████▌ | 19/25 [00:28<00:09, 1.52s/batch, N=1.5467, E=0.8268,
KL=0.1266, wKL=0.8750]
Train E35: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.5467, E=0.8268,
KL=0.1266, wKL=0.8750]
Train E35: 80%|████████ | 20/25 [00:29<00:07, 1.50s/batch, N=1.5198, E=0.8200,
KL=0.1253, wKL=0.8750]
Train E35: 84%|████████▍ | 21/25 [00:29<00:06, 1.51s/batch, N=1.5198, E=0.8200,
KL=0.1253, wKL=0.8750]
Train E35: 84%|████████▍ | 21/25 [00:31<00:06, 1.51s/batch, N=1.4474, E=0.8262,
KL=0.1250, wKL=0.8750]
Train E35: 88%|████████▊ | 22/25 [00:31<00:04, 1.47s/batch, N=1.4474, E=0.8262,
KL=0.1250, wKL=0.8750]
Train E35: 88%|████████▊ | 22/25 [00:32<00:04, 1.47s/batch, N=1.4772, E=0.8219,
KL=0.1249, wKL=0.8750]
Train E35: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.4772, E=0.8219,
KL=0.1249, wKL=0.8750]
Train E35: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.4919, E=0.8213,
KL=0.1238, wKL=0.8750]
Train E35: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4919, E=0.8213,
KL=0.1238, wKL=0.8750]
Train E35: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4301, E=0.8243,
KL=0.1248, wKL=0.8750]
Train E35: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4301, E=0.8243,
KL=0.1248, wKL=0.8750]
Train E35: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4301, E=0.8243,
KL=0.1248, wKL=0.8750]
1278.5s 144 [Epoch 035] Total: 2.3687 | N: 1.4895 | E: 0.8236 | KL(0.88×0.5):
0.1270
1313.3s 145 Train E36: 0%| | 0/25 [00:00<?, ?batch/s]
Train E36: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5319, E=0.8262, KL=0.1229,
wKL=0.9000]
Train E36: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5319, E=0.8262,
KL=0.1229, wKL=0.9000]
Train E36: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4719, E=0.8225,
KL=0.1243, wKL=0.9000]
Train E36: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4719, E=0.8225,
KL=0.1243, wKL=0.9000]
Train E36: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.5257, E=0.8176,
KL=0.1259, wKL=0.9000]
Train E36: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5257, E=0.8176,
KL=0.1259, wKL=0.9000]
Train E36: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.3931, E=0.8285,
KL=0.1216, wKL=0.9000]
Train E36: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.3931, E=0.8285,
KL=0.1216, wKL=0.9000]
Train E36: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4468, E=0.8228,
KL=0.1233, wKL=0.9000]
Train E36: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4468, E=0.8228,
KL=0.1233, wKL=0.9000]
Train E36: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5588, E=0.8214,
KL=0.1244, wKL=0.9000]
Train E36: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.5588, E=0.8214,
KL=0.1244, wKL=0.9000]
Train E36: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.6293, E=0.8263,
KL=0.1227, wKL=0.9000]
Train E36: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.6293, E=0.8263,
KL=0.1227, wKL=0.9000]
Train E36: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4875, E=0.8223,
KL=0.1228, wKL=0.9000]
Train E36: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.4875, E=0.8223,
KL=0.1228, wKL=0.9000]
Train E36: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4666, E=0.8268,
KL=0.1216, wKL=0.9000]
Train E36: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4666, E=0.8268,
KL=0.1216, wKL=0.9000]
Train E36: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4493, E=0.8238,
KL=0.1203, wKL=0.9000]
Train E36: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4493, E=0.8238,
KL=0.1203, wKL=0.9000]
Train E36: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4521, E=0.8247,
KL=0.1207, wKL=0.9000]
Train E36: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4521, E=0.8247,
KL=0.1207, wKL=0.9000]
Train E36: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4667, E=0.8258,
KL=0.1210, wKL=0.9000]
Train E36: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4667, E=0.8258,
KL=0.1210, wKL=0.9000]
Train E36: 48%|████▊ | 12/25 [00:17<00:18, 1.40s/batch, N=1.5384, E=0.8258,
KL=0.1205, wKL=0.9000]
Train E36: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.5384, E=0.8258,
KL=0.1205, wKL=0.9000]
Train E36: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4458, E=0.8254,
KL=0.1208, wKL=0.9000]
Train E36: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4458, E=0.8254,
KL=0.1208, wKL=0.9000]
Train E36: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5352, E=0.8223,
KL=0.1206, wKL=0.9000]
Train E36: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.5352, E=0.8223,
KL=0.1206, wKL=0.9000]
Train E36: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4949, E=0.8235,
KL=0.1207, wKL=0.9000]
Train E36: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.4949, E=0.8235,
KL=0.1207, wKL=0.9000]
Train E36: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.5505, E=0.8257,
KL=0.1228, wKL=0.9000]
Train E36: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.5505, E=0.8257,
KL=0.1228, wKL=0.9000]
Train E36: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5315, E=0.8177,
KL=0.1215, wKL=0.9000]
Train E36: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5315, E=0.8177,
KL=0.1215, wKL=0.9000]
Train E36: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.4923, E=0.8238,
KL=0.1214, wKL=0.9000]
Train E36: 76%|███████▌ | 19/25 [00:27<00:10, 1.67s/batch, N=1.4923, E=0.8238,
KL=0.1214, wKL=0.9000]
Train E36: 76%|███████▌ | 19/25 [00:28<00:10, 1.67s/batch, N=1.4555, E=0.8272,
KL=0.1211, wKL=0.9000]
Train E36: 80%|████████ | 20/25 [00:28<00:07, 1.58s/batch, N=1.4555, E=0.8272,
KL=0.1211, wKL=0.9000]
Train E36: 80%|████████ | 20/25 [00:30<00:07, 1.58s/batch, N=1.4590, E=0.8239,
KL=0.1198, wKL=0.9000]
Train E36: 84%|████████▍ | 21/25 [00:30<00:06, 1.52s/batch, N=1.4590, E=0.8239,
KL=0.1198, wKL=0.9000]
Train E36: 84%|████████▍ | 21/25 [00:31<00:06, 1.52s/batch, N=1.4606, E=0.8261,
KL=0.1196, wKL=0.9000]
Train E36: 88%|████████▊ | 22/25 [00:31<00:04, 1.48s/batch, N=1.4606, E=0.8261,
KL=0.1196, wKL=0.9000]
Train E36: 88%|████████▊ | 22/25 [00:32<00:04, 1.48s/batch, N=1.4828, E=0.8198,
KL=0.1187, wKL=0.9000]
Train E36: 92%|█████████▏| 23/25 [00:32<00:02, 1.46s/batch, N=1.4828, E=0.8198,
KL=0.1187, wKL=0.9000]
Train E36: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.4979, E=0.8214,
KL=0.1177, wKL=0.9000]
Train E36: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4979, E=0.8214,
KL=0.1177, wKL=0.9000]
Train E36: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.3230, E=0.8255,
KL=0.1172, wKL=0.9000]
Train E36: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.3230, E=0.8255,
KL=0.1172, wKL=0.9000]
Train E36: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.3230, E=0.8255,
KL=0.1172, wKL=0.9000]
1313.3s 146 [Epoch 036] Total: 2.3683 | N: 1.4898 | E: 0.8238 | KL(0.90×0.5):
0.1214
1348.2s 147 Train E37: 0%| | 0/25 [00:00<?, ?batch/s]
Train E37: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5144, E=0.8198, KL=0.1178,
wKL=0.9250]
Train E37: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.5144, E=0.8198,
KL=0.1178, wKL=0.9250]
Train E37: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4818, E=0.8241,
KL=0.1157, wKL=0.9250]
Train E37: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4818, E=0.8241,
KL=0.1157, wKL=0.9250]
Train E37: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4888, E=0.8240,
KL=0.1154, wKL=0.9250]
Train E37: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4888, E=0.8240,
KL=0.1154, wKL=0.9250]
Train E37: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4117, E=0.8245,
KL=0.1166, wKL=0.9250]
Train E37: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4117, E=0.8245,
KL=0.1166, wKL=0.9250]
Train E37: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4826, E=0.8183,
KL=0.1159, wKL=0.9250]
Train E37: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4826, E=0.8183,
KL=0.1159, wKL=0.9250]
Train E37: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.5471, E=0.8263,
KL=0.1181, wKL=0.9250]
Train E37: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5471, E=0.8263,
KL=0.1181, wKL=0.9250]
Train E37: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4718, E=0.8259,
KL=0.1173, wKL=0.9250]
Train E37: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4718, E=0.8259,
KL=0.1173, wKL=0.9250]
Train E37: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4982, E=0.8239,
KL=0.1169, wKL=0.9250]
Train E37: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4982, E=0.8239,
KL=0.1169, wKL=0.9250]
Train E37: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5117, E=0.8258,
KL=0.1210, wKL=0.9250]
Train E37: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5117, E=0.8258,
KL=0.1210, wKL=0.9250]
Train E37: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5031, E=0.8210,
KL=0.1190, wKL=0.9250]
Train E37: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5031, E=0.8210,
KL=0.1190, wKL=0.9250]
Train E37: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5789, E=0.8203,
KL=0.1182, wKL=0.9250]
Train E37: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.5789, E=0.8203,
KL=0.1182, wKL=0.9250]
Train E37: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.4493, E=0.8230,
KL=0.1185, wKL=0.9250]
Train E37: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.4493, E=0.8230,
KL=0.1185, wKL=0.9250]
Train E37: 48%|████▊ | 12/25 [00:17<00:17, 1.38s/batch, N=1.4436, E=0.8216,
KL=0.1172, wKL=0.9250]
Train E37: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4436, E=0.8216,
KL=0.1172, wKL=0.9250]
Train E37: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4973, E=0.8286,
KL=0.1157, wKL=0.9250]
Train E37: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4973, E=0.8286,
KL=0.1157, wKL=0.9250]
Train E37: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4384, E=0.8237,
KL=0.1156, wKL=0.9250]
Train E37: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.4384, E=0.8237,
KL=0.1156, wKL=0.9250]
Train E37: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5674, E=0.8234,
KL=0.1145, wKL=0.9250]
Train E37: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5674, E=0.8234,
KL=0.1145, wKL=0.9250]
Train E37: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4754, E=0.8260,
KL=0.1123, wKL=0.9250]
Train E37: 68%|██████▊ | 17/25 [00:23<00:11, 1.48s/batch, N=1.4754, E=0.8260,
KL=0.1123, wKL=0.9250]
Train E37: 68%|██████▊ | 17/25 [00:25<00:11, 1.48s/batch, N=1.4090, E=0.8216,
KL=0.1117, wKL=0.9250]
Train E37: 72%|███████▏ | 18/25 [00:25<00:10, 1.46s/batch, N=1.4090, E=0.8216,
KL=0.1117, wKL=0.9250]
Train E37: 72%|███████▏ | 18/25 [00:26<00:10, 1.46s/batch, N=1.5696, E=0.8261,
KL=0.1121, wKL=0.9250]
Train E37: 76%|███████▌ | 19/25 [00:26<00:08, 1.44s/batch, N=1.5696, E=0.8261,
KL=0.1121, wKL=0.9250]
Train E37: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.4805, E=0.8253,
KL=0.1124, wKL=0.9250]
Train E37: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.4805, E=0.8253,
KL=0.1124, wKL=0.9250]
Train E37: 80%|████████ | 20/25 [00:29<00:07, 1.43s/batch, N=1.4649, E=0.8268,
KL=0.1114, wKL=0.9250]
Train E37: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4649, E=0.8268,
KL=0.1114, wKL=0.9250]
Train E37: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4756, E=0.8249,
KL=0.1117, wKL=0.9250]
Train E37: 88%|████████▊ | 22/25 [00:30<00:04, 1.41s/batch, N=1.4756, E=0.8249,
KL=0.1117, wKL=0.9250]
Train E37: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4898, E=0.8279,
KL=0.1139, wKL=0.9250]
Train E37: 92%|█████████▏| 23/25 [00:32<00:03, 1.60s/batch, N=1.4898, E=0.8279,
KL=0.1139, wKL=0.9250]
Train E37: 92%|█████████▏| 23/25 [00:34<00:03, 1.60s/batch, N=1.4877, E=0.8234,
KL=0.1141, wKL=0.9250]
Train E37: 96%|█████████▌| 24/25 [00:34<00:01, 1.53s/batch, N=1.4877, E=0.8234,
KL=0.1141, wKL=0.9250]
Train E37: 96%|█████████▌| 24/25 [00:34<00:01, 1.53s/batch, N=1.5251, E=0.8217,
KL=0.1137, wKL=0.9250]
Train E37: 100%|██████████| 25/25 [00:34<00:00, 1.25s/batch, N=1.5251, E=0.8217,
KL=0.1137, wKL=0.9250]
Train E37: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5251, E=0.8217,
KL=0.1137, wKL=0.9250]
1348.2s 148 [Epoch 037] Total: 2.3671 | N: 1.4897 | E: 0.8240 | KL(0.93×0.5):
0.1155
1382.9s 149 Train E38: 0%| | 0/25 [00:00<?, ?batch/s]
Train E38: 0%| | 0/25 [00:01<?, ?batch/s, N=1.3975, E=0.8231, KL=0.1160,
wKL=0.9500]
Train E38: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.3975, E=0.8231,
KL=0.1160, wKL=0.9500]
Train E38: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4900, E=0.8273,
KL=0.1127, wKL=0.9500]
Train E38: 8%|▊ | 2/25 [00:02<00:32, 1.40s/batch, N=1.4900, E=0.8273,
KL=0.1127, wKL=0.9500]
Train E38: 8%|▊ | 2/25 [00:04<00:32, 1.40s/batch, N=1.5265, E=0.8256,
KL=0.1116, wKL=0.9500]
Train E38: 12%|█▏ | 3/25 [00:04<00:31, 1.41s/batch, N=1.5265, E=0.8256,
KL=0.1116, wKL=0.9500]
Train E38: 12%|█▏ | 3/25 [00:05<00:31, 1.41s/batch, N=1.4461, E=0.8215,
KL=0.1112, wKL=0.9500]
Train E38: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4461, E=0.8215,
KL=0.1112, wKL=0.9500]
Train E38: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4482, E=0.8210,
KL=0.1090, wKL=0.9500]
Train E38: 20%|██ | 5/25 [00:07<00:28, 1.40s/batch, N=1.4482, E=0.8210,
KL=0.1090, wKL=0.9500]
Train E38: 20%|██ | 5/25 [00:08<00:28, 1.40s/batch, N=1.5409, E=0.8273,
KL=0.1097, wKL=0.9500]
Train E38: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.5409, E=0.8273,
KL=0.1097, wKL=0.9500]
Train E38: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.5331, E=0.8227,
KL=0.1090, wKL=0.9500]
Train E38: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.5331, E=0.8227,
KL=0.1090, wKL=0.9500]
Train E38: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4959, E=0.8238,
KL=0.1073, wKL=0.9500]
Train E38: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4959, E=0.8238,
KL=0.1073, wKL=0.9500]
Train E38: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4676, E=0.8257,
KL=0.1088, wKL=0.9500]
Train E38: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.4676, E=0.8257,
KL=0.1088, wKL=0.9500]
Train E38: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5022, E=0.8219,
KL=0.1098, wKL=0.9500]
Train E38: 40%|████ | 10/25 [00:14<00:20, 1.40s/batch, N=1.5022, E=0.8219,
KL=0.1098, wKL=0.9500]
Train E38: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4515, E=0.8239,
KL=0.1085, wKL=0.9500]
Train E38: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4515, E=0.8239,
KL=0.1085, wKL=0.9500]
Train E38: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.5876, E=0.8209,
KL=0.1102, wKL=0.9500]
Train E38: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5876, E=0.8209,
KL=0.1102, wKL=0.9500]
Train E38: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4640, E=0.8252,
KL=0.1093, wKL=0.9500]
Train E38: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.4640, E=0.8252,
KL=0.1093, wKL=0.9500]
Train E38: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4884, E=0.8259,
KL=0.1089, wKL=0.9500]
Train E38: 56%|█████▌ | 14/25 [00:19<00:16, 1.51s/batch, N=1.4884, E=0.8259,
KL=0.1089, wKL=0.9500]
Train E38: 56%|█████▌ | 14/25 [00:21<00:16, 1.51s/batch, N=1.4373, E=0.8257,
KL=0.1082, wKL=0.9500]
Train E38: 60%|██████ | 15/25 [00:21<00:15, 1.51s/batch, N=1.4373, E=0.8257,
KL=0.1082, wKL=0.9500]
Train E38: 60%|██████ | 15/25 [00:22<00:15, 1.51s/batch, N=1.5021, E=0.8243,
KL=0.1090, wKL=0.9500]
Train E38: 64%|██████▍ | 16/25 [00:22<00:13, 1.48s/batch, N=1.5021, E=0.8243,
KL=0.1090, wKL=0.9500]
Train E38: 64%|██████▍ | 16/25 [00:24<00:13, 1.48s/batch, N=1.5413, E=0.8266,
KL=0.1100, wKL=0.9500]
Train E38: 68%|██████▊ | 17/25 [00:24<00:11, 1.45s/batch, N=1.5413, E=0.8266,
KL=0.1100, wKL=0.9500]
Train E38: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.5677, E=0.8260,
KL=0.1085, wKL=0.9500]
Train E38: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.5677, E=0.8260,
KL=0.1085, wKL=0.9500]
Train E38: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.3979, E=0.8245,
KL=0.1075, wKL=0.9500]
Train E38: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.3979, E=0.8245,
KL=0.1075, wKL=0.9500]
Train E38: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.4382, E=0.8224,
KL=0.1079, wKL=0.9500]
Train E38: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4382, E=0.8224,
KL=0.1079, wKL=0.9500]
Train E38: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.4632, E=0.8234,
KL=0.1076, wKL=0.9500]
Train E38: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4632, E=0.8234,
KL=0.1076, wKL=0.9500]
Train E38: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4659, E=0.8185,
KL=0.1075, wKL=0.9500]
Train E38: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4659, E=0.8185,
KL=0.1075, wKL=0.9500]
Train E38: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5386, E=0.8241,
KL=0.1066, wKL=0.9500]
Train E38: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.5386, E=0.8241,
KL=0.1066, wKL=0.9500]
Train E38: 92%|█████████▏| 23/25 [00:34<00:02, 1.42s/batch, N=1.5154, E=0.8234,
KL=0.1064, wKL=0.9500]
Train E38: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5154, E=0.8234,
KL=0.1064, wKL=0.9500]
Train E38: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5601, E=0.8308,
KL=0.1049, wKL=0.9500]
Train E38: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.5601, E=0.8308,
KL=0.1049, wKL=0.9500]
Train E38: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5601, E=0.8308,
KL=0.1049, wKL=0.9500]
1382.9s 150 [Epoch 038] Total: 2.3649 | N: 1.4890 | E: 0.8241 | KL(0.95×0.5):
0.1091
1417.8s 151 Train E39: 0%| | 0/25 [00:00<?, ?batch/s]
Train E39: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5663, E=0.8283, KL=0.1038,
wKL=0.9750]
Train E39: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5663, E=0.8283,
KL=0.1038, wKL=0.9750]
Train E39: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4691, E=0.8250,
KL=0.1042, wKL=0.9750]
Train E39: 8%|▊ | 2/25 [00:02<00:30, 1.35s/batch, N=1.4691, E=0.8250,
KL=0.1042, wKL=0.9750]
Train E39: 8%|▊ | 2/25 [00:04<00:30, 1.35s/batch, N=1.5091, E=0.8242,
KL=0.1039, wKL=0.9750]
Train E39: 12%|█▏ | 3/25 [00:04<00:36, 1.64s/batch, N=1.5091, E=0.8242,
KL=0.1039, wKL=0.9750]
Train E39: 12%|█▏ | 3/25 [00:06<00:36, 1.64s/batch, N=1.5274, E=0.8254,
KL=0.1034, wKL=0.9750]
Train E39: 16%|█▌ | 4/25 [00:06<00:32, 1.53s/batch, N=1.5274, E=0.8254,
KL=0.1034, wKL=0.9750]
Train E39: 16%|█▌ | 4/25 [00:07<00:32, 1.53s/batch, N=1.4498, E=0.8186,
KL=0.1039, wKL=0.9750]
Train E39: 20%|██ | 5/25 [00:07<00:29, 1.49s/batch, N=1.4498, E=0.8186,
KL=0.1039, wKL=0.9750]
Train E39: 20%|██ | 5/25 [00:08<00:29, 1.49s/batch, N=1.4019, E=0.8267,
KL=0.1015, wKL=0.9750]
Train E39: 24%|██▍ | 6/25 [00:08<00:27, 1.45s/batch, N=1.4019, E=0.8267,
KL=0.1015, wKL=0.9750]
Train E39: 24%|██▍ | 6/25 [00:10<00:27, 1.45s/batch, N=1.4461, E=0.8224,
KL=0.1029, wKL=0.9750]
Train E39: 28%|██▊ | 7/25 [00:10<00:25, 1.43s/batch, N=1.4461, E=0.8224,
KL=0.1029, wKL=0.9750]
Train E39: 28%|██▊ | 7/25 [00:11<00:25, 1.43s/batch, N=1.5134, E=0.8244,
KL=0.1037, wKL=0.9750]
Train E39: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5134, E=0.8244,
KL=0.1037, wKL=0.9750]
Train E39: 32%|███▏ | 8/25 [00:13<00:24, 1.42s/batch, N=1.4426, E=0.8210,
KL=0.1026, wKL=0.9750]
Train E39: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.4426, E=0.8210,
KL=0.1026, wKL=0.9750]
Train E39: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5256, E=0.8261,
KL=0.1022, wKL=0.9750]
Train E39: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.5256, E=0.8261,
KL=0.1022, wKL=0.9750]
Train E39: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4434, E=0.8220,
KL=0.1027, wKL=0.9750]
Train E39: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4434, E=0.8220,
KL=0.1027, wKL=0.9750]
Train E39: 44%|████▍ | 11/25 [00:17<00:19, 1.41s/batch, N=1.5193, E=0.8200,
KL=0.1035, wKL=0.9750]
Train E39: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.5193, E=0.8200,
KL=0.1035, wKL=0.9750]
Train E39: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.4843, E=0.8254,
KL=0.1043, wKL=0.9750]
Train E39: 52%|█████▏ | 13/25 [00:18<00:17, 1.46s/batch, N=1.4843, E=0.8254,
KL=0.1043, wKL=0.9750]
Train E39: 52%|█████▏ | 13/25 [00:20<00:17, 1.46s/batch, N=1.4627, E=0.8220,
KL=0.1030, wKL=0.9750]
Train E39: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4627, E=0.8220,
KL=0.1030, wKL=0.9750]
Train E39: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.4044, E=0.8224,
KL=0.1012, wKL=0.9750]
Train E39: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4044, E=0.8224,
KL=0.1012, wKL=0.9750]
Train E39: 60%|██████ | 15/25 [00:23<00:14, 1.42s/batch, N=1.5735, E=0.8245,
KL=0.1029, wKL=0.9750]
Train E39: 64%|██████▍ | 16/25 [00:23<00:12, 1.42s/batch, N=1.5735, E=0.8245,
KL=0.1029, wKL=0.9750]
Train E39: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.4624, E=0.8236,
KL=0.1002, wKL=0.9750]
Train E39: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4624, E=0.8236,
KL=0.1002, wKL=0.9750]
Train E39: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.3989, E=0.8231,
KL=0.1024, wKL=0.9750]
Train E39: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.3989, E=0.8231,
KL=0.1024, wKL=0.9750]
Train E39: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5731, E=0.8240,
KL=0.1013, wKL=0.9750]
Train E39: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5731, E=0.8240,
KL=0.1013, wKL=0.9750]
Train E39: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5301, E=0.8249,
KL=0.0999, wKL=0.9750]
Train E39: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5301, E=0.8249,
KL=0.0999, wKL=0.9750]
Train E39: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.5093, E=0.8247,
KL=0.1007, wKL=0.9750]
Train E39: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5093, E=0.8247,
KL=0.1007, wKL=0.9750]
Train E39: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4519, E=0.8224,
KL=0.1003, wKL=0.9750]
Train E39: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4519, E=0.8224,
KL=0.1003, wKL=0.9750]
Train E39: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.5691, E=0.8293,
KL=0.0998, wKL=0.9750]
Train E39: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5691, E=0.8293,
KL=0.0998, wKL=0.9750]
Train E39: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4963, E=0.8251,
KL=0.1032, wKL=0.9750]
Train E39: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4963, E=0.8251,
KL=0.1032, wKL=0.9750]
Train E39: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4479, E=0.8215,
KL=0.1012, wKL=0.9750]
Train E39: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4479, E=0.8215,
KL=0.1012, wKL=0.9750]
Train E39: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4479, E=0.8215,
KL=0.1012, wKL=0.9750]
1417.8s 152 [Epoch 039] Total: 2.3619 | N: 1.4881 | E: 0.8239 | KL(0.97×0.5):
0.1024
1452.8s 153 Train E40: 0%| | 0/25 [00:00<?, ?batch/s]
Train E40: 0%| | 0/25 [00:01<?, ?batch/s, N=1.3696, E=0.8220, KL=0.0987,
wKL=1.0000]
Train E40: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.3696, E=0.8220,
KL=0.0987, wKL=1.0000]
Train E40: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4088, E=0.8241,
KL=0.1001, wKL=1.0000]
Train E40: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4088, E=0.8241,
KL=0.1001, wKL=1.0000]
Train E40: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.5651, E=0.8238,
KL=0.0999, wKL=1.0000]
Train E40: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5651, E=0.8238,
KL=0.0999, wKL=1.0000]
Train E40: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.6169, E=0.8232,
KL=0.0998, wKL=1.0000]
Train E40: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.6169, E=0.8232,
KL=0.0998, wKL=1.0000]
Train E40: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.5516, E=0.8270,
KL=0.0997, wKL=1.0000]
Train E40: 20%|██ | 5/25 [00:06<00:27, 1.40s/batch, N=1.5516, E=0.8270,
KL=0.0997, wKL=1.0000]
Train E40: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.4170, E=0.8236,
KL=0.0993, wKL=1.0000]
Train E40: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4170, E=0.8236,
KL=0.0993, wKL=1.0000]
Train E40: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.6072, E=0.8236,
KL=0.0992, wKL=1.0000]
Train E40: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.6072, E=0.8236,
KL=0.0992, wKL=1.0000]
Train E40: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5424, E=0.8221,
KL=0.0992, wKL=1.0000]
Train E40: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.5424, E=0.8221,
KL=0.0992, wKL=1.0000]
Train E40: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.5120, E=0.8222,
KL=0.0980, wKL=1.0000]
Train E40: 36%|███▌ | 9/25 [00:12<00:23, 1.47s/batch, N=1.5120, E=0.8222,
KL=0.0980, wKL=1.0000]
Train E40: 36%|███▌ | 9/25 [00:14<00:23, 1.47s/batch, N=1.4295, E=0.8260,
KL=0.0966, wKL=1.0000]
Train E40: 40%|████ | 10/25 [00:14<00:22, 1.48s/batch, N=1.4295, E=0.8260,
KL=0.0966, wKL=1.0000]
Train E40: 40%|████ | 10/25 [00:16<00:22, 1.48s/batch, N=1.5243, E=0.8246,
KL=0.0986, wKL=1.0000]
Train E40: 44%|████▍ | 11/25 [00:16<00:22, 1.64s/batch, N=1.5243, E=0.8246,
KL=0.0986, wKL=1.0000]
Train E40: 44%|████▍ | 11/25 [00:17<00:22, 1.64s/batch, N=1.4758, E=0.8238,
KL=0.0974, wKL=1.0000]
Train E40: 48%|████▊ | 12/25 [00:17<00:20, 1.56s/batch, N=1.4758, E=0.8238,
KL=0.0974, wKL=1.0000]
Train E40: 48%|████▊ | 12/25 [00:19<00:20, 1.56s/batch, N=1.5246, E=0.8238,
KL=0.0983, wKL=1.0000]
Train E40: 52%|█████▏ | 13/25 [00:19<00:18, 1.50s/batch, N=1.5246, E=0.8238,
KL=0.0983, wKL=1.0000]
Train E40: 52%|█████▏ | 13/25 [00:20<00:18, 1.50s/batch, N=1.5209, E=0.8293,
KL=0.0968, wKL=1.0000]
Train E40: 56%|█████▌ | 14/25 [00:20<00:16, 1.46s/batch, N=1.5209, E=0.8293,
KL=0.0968, wKL=1.0000]
Train E40: 56%|█████▌ | 14/25 [00:21<00:16, 1.46s/batch, N=1.4759, E=0.8231,
KL=0.0966, wKL=1.0000]
Train E40: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.4759, E=0.8231,
KL=0.0966, wKL=1.0000]
Train E40: 60%|██████ | 15/25 [00:23<00:14, 1.44s/batch, N=1.4312, E=0.8215,
KL=0.0975, wKL=1.0000]
Train E40: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.4312, E=0.8215,
KL=0.0975, wKL=1.0000]
Train E40: 64%|██████▍ | 16/25 [00:24<00:12, 1.44s/batch, N=1.4289, E=0.8247,
KL=0.0971, wKL=1.0000]
Train E40: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.4289, E=0.8247,
KL=0.0971, wKL=1.0000]
Train E40: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.5366, E=0.8221,
KL=0.0965, wKL=1.0000]
Train E40: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5366, E=0.8221,
KL=0.0965, wKL=1.0000]
Train E40: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5216, E=0.8250,
KL=0.0959, wKL=1.0000]
Train E40: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5216, E=0.8250,
KL=0.0959, wKL=1.0000]
Train E40: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5196, E=0.8227,
KL=0.0954, wKL=1.0000]
Train E40: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5196, E=0.8227,
KL=0.0954, wKL=1.0000]
Train E40: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.4418, E=0.8174,
KL=0.0963, wKL=1.0000]
Train E40: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.4418, E=0.8174,
KL=0.0963, wKL=1.0000]
Train E40: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.4178, E=0.8258,
KL=0.0947, wKL=1.0000]
Train E40: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4178, E=0.8258,
KL=0.0947, wKL=1.0000]
Train E40: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4469, E=0.8241,
KL=0.0962, wKL=1.0000]
Train E40: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4469, E=0.8241,
KL=0.0962, wKL=1.0000]
Train E40: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5050, E=0.8180,
KL=0.0967, wKL=1.0000]
Train E40: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5050, E=0.8180,
KL=0.0967, wKL=1.0000]
Train E40: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.2705, E=0.8167,
KL=0.0937, wKL=1.0000]
Train E40: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.2705, E=0.8167,
KL=0.0937, wKL=1.0000]
Train E40: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.2705, E=0.8167,
KL=0.0937, wKL=1.0000]
1452.8s 154 [Epoch 040] Total: 2.3597 | N: 1.4875 | E: 0.8234 | KL(1.00×0.5):
0.0976
1452.8s 155 Saved checkpoint: /kaggle/working/checkpoints/gvae_40_epoch040.pt
1487.8s 156 Train E41: 0%| | 0/25 [00:00<?, ?batch/s]
Train E41: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4295, E=0.8238, KL=0.0955,
wKL=1.0000]
Train E41: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.4295, E=0.8238,
KL=0.0955, wKL=1.0000]
Train E41: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.5074, E=0.8212,
KL=0.0939, wKL=1.0000]
Train E41: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.5074, E=0.8212,
KL=0.0939, wKL=1.0000]
Train E41: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4313, E=0.8225,
KL=0.0940, wKL=1.0000]
Train E41: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4313, E=0.8225,
KL=0.0940, wKL=1.0000]
Train E41: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.6103, E=0.8265,
KL=0.0960, wKL=1.0000]
Train E41: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.6103, E=0.8265,
KL=0.0960, wKL=1.0000]
Train E41: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.5094, E=0.8247,
KL=0.0928, wKL=1.0000]
Train E41: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5094, E=0.8247,
KL=0.0928, wKL=1.0000]
Train E41: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5119, E=0.8205,
KL=0.0950, wKL=1.0000]
Train E41: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.5119, E=0.8205,
KL=0.0950, wKL=1.0000]
Train E41: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4478, E=0.8201,
KL=0.0936, wKL=1.0000]
Train E41: 28%|██▊ | 7/25 [00:09<00:26, 1.45s/batch, N=1.4478, E=0.8201,
KL=0.0936, wKL=1.0000]
Train E41: 28%|██▊ | 7/25 [00:11<00:26, 1.45s/batch, N=1.4589, E=0.8244,
KL=0.0929, wKL=1.0000]
Train E41: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.4589, E=0.8244,
KL=0.0929, wKL=1.0000]
Train E41: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.5028, E=0.8208,
KL=0.0939, wKL=1.0000]
Train E41: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.5028, E=0.8208,
KL=0.0939, wKL=1.0000]
Train E41: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4986, E=0.8230,
KL=0.0933, wKL=1.0000]
Train E41: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4986, E=0.8230,
KL=0.0933, wKL=1.0000]
Train E41: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4687, E=0.8233,
KL=0.0919, wKL=1.0000]
Train E41: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4687, E=0.8233,
KL=0.0919, wKL=1.0000]
Train E41: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4581, E=0.8253,
KL=0.0944, wKL=1.0000]
Train E41: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4581, E=0.8253,
KL=0.0944, wKL=1.0000]
Train E41: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.5169, E=0.8264,
KL=0.0924, wKL=1.0000]
Train E41: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5169, E=0.8264,
KL=0.0924, wKL=1.0000]
Train E41: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.4955, E=0.8208,
KL=0.0941, wKL=1.0000]
Train E41: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4955, E=0.8208,
KL=0.0941, wKL=1.0000]
Train E41: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4991, E=0.8247,
KL=0.0930, wKL=1.0000]
Train E41: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.4991, E=0.8247,
KL=0.0930, wKL=1.0000]
Train E41: 60%|██████ | 15/25 [00:23<00:13, 1.40s/batch, N=1.5031, E=0.8200,
KL=0.0943, wKL=1.0000]
Train E41: 64%|██████▍ | 16/25 [00:23<00:14, 1.59s/batch, N=1.5031, E=0.8200,
KL=0.0943, wKL=1.0000]
Train E41: 64%|██████▍ | 16/25 [00:24<00:14, 1.59s/batch, N=1.5046, E=0.8189,
KL=0.0959, wKL=1.0000]
Train E41: 68%|██████▊ | 17/25 [00:24<00:12, 1.53s/batch, N=1.5046, E=0.8189,
KL=0.0959, wKL=1.0000]
Train E41: 68%|██████▊ | 17/25 [00:25<00:12, 1.53s/batch, N=1.5181, E=0.8237,
KL=0.0933, wKL=1.0000]
Train E41: 72%|███████▏ | 18/25 [00:25<00:10, 1.48s/batch, N=1.5181, E=0.8237,
KL=0.0933, wKL=1.0000]
Train E41: 72%|███████▏ | 18/25 [00:27<00:10, 1.48s/batch, N=1.4315, E=0.8232,
KL=0.0941, wKL=1.0000]
Train E41: 76%|███████▌ | 19/25 [00:27<00:08, 1.46s/batch, N=1.4315, E=0.8232,
KL=0.0941, wKL=1.0000]
Train E41: 76%|███████▌ | 19/25 [00:28<00:08, 1.46s/batch, N=1.4045, E=0.8238,
KL=0.0940, wKL=1.0000]
Train E41: 80%|████████ | 20/25 [00:28<00:07, 1.45s/batch, N=1.4045, E=0.8238,
KL=0.0940, wKL=1.0000]
Train E41: 80%|████████ | 20/25 [00:30<00:07, 1.45s/batch, N=1.5760, E=0.8215,
KL=0.0949, wKL=1.0000]
Train E41: 84%|████████▍ | 21/25 [00:30<00:05, 1.45s/batch, N=1.5760, E=0.8215,
KL=0.0949, wKL=1.0000]
Train E41: 84%|████████▍ | 21/25 [00:31<00:05, 1.45s/batch, N=1.4635, E=0.8189,
KL=0.0929, wKL=1.0000]
Train E41: 88%|████████▊ | 22/25 [00:31<00:04, 1.43s/batch, N=1.4635, E=0.8189,
KL=0.0929, wKL=1.0000]
Train E41: 88%|████████▊ | 22/25 [00:32<00:04, 1.43s/batch, N=1.4638, E=0.8204,
KL=0.0927, wKL=1.0000]
Train E41: 92%|█████████▏| 23/25 [00:32<00:02, 1.43s/batch, N=1.4638, E=0.8204,
KL=0.0927, wKL=1.0000]
Train E41: 92%|█████████▏| 23/25 [00:34<00:02, 1.43s/batch, N=1.5066, E=0.8214,
KL=0.0907, wKL=1.0000]
Train E41: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5066, E=0.8214,
KL=0.0907, wKL=1.0000]
Train E41: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4054, E=0.8254,
KL=0.0902, wKL=1.0000]
Train E41: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4054, E=0.8254,
KL=0.0902, wKL=1.0000]
Train E41: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4054, E=0.8254,
KL=0.0902, wKL=1.0000]
1487.8s 157 [Epoch 041] Total: 2.3562 | N: 1.4868 | E: 0.8225 | KL(1.00×0.5):
0.0937
1522.8s 158 Train E42: 0%| | 0/25 [00:00<?, ?batch/s]
Train E42: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5679, E=0.8247, KL=0.0900,
wKL=1.0000]
Train E42: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5679, E=0.8247,
KL=0.0900, wKL=1.0000]
Train E42: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4692, E=0.8231,
KL=0.0891, wKL=1.0000]
Train E42: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4692, E=0.8231,
KL=0.0891, wKL=1.0000]
Train E42: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4993, E=0.8243,
KL=0.0893, wKL=1.0000]
Train E42: 12%|█▏ | 3/25 [00:04<00:30, 1.40s/batch, N=1.4993, E=0.8243,
KL=0.0893, wKL=1.0000]
Train E42: 12%|█▏ | 3/25 [00:05<00:30, 1.40s/batch, N=1.5005, E=0.8239,
KL=0.0894, wKL=1.0000]
Train E42: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.5005, E=0.8239,
KL=0.0894, wKL=1.0000]
Train E42: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4276, E=0.8201,
KL=0.0894, wKL=1.0000]
Train E42: 20%|██ | 5/25 [00:07<00:30, 1.51s/batch, N=1.4276, E=0.8201,
KL=0.0894, wKL=1.0000]
Train E42: 20%|██ | 5/25 [00:08<00:30, 1.51s/batch, N=1.5631, E=0.8267,
KL=0.0908, wKL=1.0000]
Train E42: 24%|██▍ | 6/25 [00:08<00:28, 1.48s/batch, N=1.5631, E=0.8267,
KL=0.0908, wKL=1.0000]
Train E42: 24%|██▍ | 6/25 [00:10<00:28, 1.48s/batch, N=1.4921, E=0.8216,
KL=0.0902, wKL=1.0000]
Train E42: 28%|██▊ | 7/25 [00:10<00:26, 1.45s/batch, N=1.4921, E=0.8216,
KL=0.0902, wKL=1.0000]
Train E42: 28%|██▊ | 7/25 [00:11<00:26, 1.45s/batch, N=1.5528, E=0.8221,
KL=0.0926, wKL=1.0000]
Train E42: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.5528, E=0.8221,
KL=0.0926, wKL=1.0000]
Train E42: 32%|███▏ | 8/25 [00:12<00:24, 1.42s/batch, N=1.4827, E=0.8232,
KL=0.0912, wKL=1.0000]
Train E42: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.4827, E=0.8232,
KL=0.0912, wKL=1.0000]
Train E42: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.5228, E=0.8163,
KL=0.0920, wKL=1.0000]
Train E42: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.5228, E=0.8163,
KL=0.0920, wKL=1.0000]
Train E42: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.3665, E=0.8197,
KL=0.0921, wKL=1.0000]
Train E42: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.3665, E=0.8197,
KL=0.0921, wKL=1.0000]
Train E42: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.4547, E=0.8205,
KL=0.0900, wKL=1.0000]
Train E42: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4547, E=0.8205,
KL=0.0900, wKL=1.0000]
Train E42: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4777, E=0.8198,
KL=0.0913, wKL=1.0000]
Train E42: 52%|█████▏ | 13/25 [00:18<00:16, 1.38s/batch, N=1.4777, E=0.8198,
KL=0.0913, wKL=1.0000]
Train E42: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.4691, E=0.8234,
KL=0.0908, wKL=1.0000]
Train E42: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.4691, E=0.8234,
KL=0.0908, wKL=1.0000]
Train E42: 56%|█████▌ | 14/25 [00:21<00:15, 1.38s/batch, N=1.4833, E=0.8217,
KL=0.0901, wKL=1.0000]
Train E42: 60%|██████ | 15/25 [00:21<00:13, 1.38s/batch, N=1.4833, E=0.8217,
KL=0.0901, wKL=1.0000]
Train E42: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4593, E=0.8242,
KL=0.0894, wKL=1.0000]
Train E42: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4593, E=0.8242,
KL=0.0894, wKL=1.0000]
Train E42: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4372, E=0.8232,
KL=0.0893, wKL=1.0000]
Train E42: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.4372, E=0.8232,
KL=0.0893, wKL=1.0000]
Train E42: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.5919, E=0.8243,
KL=0.0901, wKL=1.0000]
Train E42: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.5919, E=0.8243,
KL=0.0901, wKL=1.0000]
Train E42: 72%|███████▏ | 18/25 [00:27<00:09, 1.39s/batch, N=1.5327, E=0.8221,
KL=0.0902, wKL=1.0000]
Train E42: 76%|███████▌ | 19/25 [00:27<00:09, 1.59s/batch, N=1.5327, E=0.8221,
KL=0.0902, wKL=1.0000]
Train E42: 76%|███████▌ | 19/25 [00:28<00:09, 1.59s/batch, N=1.4933, E=0.8172,
KL=0.0898, wKL=1.0000]
Train E42: 80%|████████ | 20/25 [00:28<00:07, 1.53s/batch, N=1.4933, E=0.8172,
KL=0.0898, wKL=1.0000]
Train E42: 80%|████████ | 20/25 [00:30<00:07, 1.53s/batch, N=1.5468, E=0.8256,
KL=0.0910, wKL=1.0000]
Train E42: 84%|████████▍ | 21/25 [00:30<00:05, 1.50s/batch, N=1.5468, E=0.8256,
KL=0.0910, wKL=1.0000]
Train E42: 84%|████████▍ | 21/25 [00:31<00:05, 1.50s/batch, N=1.4116, E=0.8284,
KL=0.0887, wKL=1.0000]
Train E42: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4116, E=0.8284,
KL=0.0887, wKL=1.0000]
Train E42: 88%|████████▊ | 22/25 [00:33<00:04, 1.46s/batch, N=1.4466, E=0.8237,
KL=0.0892, wKL=1.0000]
Train E42: 92%|█████████▏| 23/25 [00:33<00:02, 1.46s/batch, N=1.4466, E=0.8237,
KL=0.0892, wKL=1.0000]
Train E42: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.4603, E=0.8193,
KL=0.0892, wKL=1.0000]
Train E42: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4603, E=0.8193,
KL=0.0892, wKL=1.0000]
Train E42: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.4155, E=0.8167,
KL=0.0890, wKL=1.0000]
Train E42: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4155, E=0.8167,
KL=0.0890, wKL=1.0000]
Train E42: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4155, E=0.8167,
KL=0.0890, wKL=1.0000]
1522.8s 159 [Epoch 042] Total: 2.3541 | N: 1.4866 | E: 0.8224 | KL(1.00×0.5):
0.0902
1558.0s 160 Train E43: 0%| | 0/25 [00:00<?, ?batch/s]
Train E43: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4376, E=0.8203, KL=0.0887,
wKL=1.0000]
Train E43: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.4376, E=0.8203,
KL=0.0887, wKL=1.0000]
Train E43: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4334, E=0.8222,
KL=0.0875, wKL=1.0000]
Train E43: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.4334, E=0.8222,
KL=0.0875, wKL=1.0000]
Train E43: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.4085, E=0.8206,
KL=0.0869, wKL=1.0000]
Train E43: 12%|█▏ | 3/25 [00:04<00:31, 1.43s/batch, N=1.4085, E=0.8206,
KL=0.0869, wKL=1.0000]
Train E43: 12%|█▏ | 3/25 [00:05<00:31, 1.43s/batch, N=1.4554, E=0.8175,
KL=0.0886, wKL=1.0000]
Train E43: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4554, E=0.8175,
KL=0.0886, wKL=1.0000]
Train E43: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.4750, E=0.8217,
KL=0.0868, wKL=1.0000]
Train E43: 20%|██ | 5/25 [00:07<00:28, 1.40s/batch, N=1.4750, E=0.8217,
KL=0.0868, wKL=1.0000]
Train E43: 20%|██ | 5/25 [00:08<00:28, 1.40s/batch, N=1.5020, E=0.8252,
KL=0.0877, wKL=1.0000]
Train E43: 24%|██▍ | 6/25 [00:08<00:27, 1.43s/batch, N=1.5020, E=0.8252,
KL=0.0877, wKL=1.0000]
Train E43: 24%|██▍ | 6/25 [00:09<00:27, 1.43s/batch, N=1.5210, E=0.8181,
KL=0.0879, wKL=1.0000]
Train E43: 28%|██▊ | 7/25 [00:09<00:25, 1.44s/batch, N=1.5210, E=0.8181,
KL=0.0879, wKL=1.0000]
Train E43: 28%|██▊ | 7/25 [00:11<00:25, 1.44s/batch, N=1.4709, E=0.8267,
KL=0.0871, wKL=1.0000]
Train E43: 32%|███▏ | 8/25 [00:11<00:24, 1.43s/batch, N=1.4709, E=0.8267,
KL=0.0871, wKL=1.0000]
Train E43: 32%|███▏ | 8/25 [00:12<00:24, 1.43s/batch, N=1.5285, E=0.8248,
KL=0.0880, wKL=1.0000]
Train E43: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.5285, E=0.8248,
KL=0.0880, wKL=1.0000]
Train E43: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.4978, E=0.8221,
KL=0.0889, wKL=1.0000]
Train E43: 40%|████ | 10/25 [00:14<00:20, 1.40s/batch, N=1.4978, E=0.8221,
KL=0.0889, wKL=1.0000]
Train E43: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4292, E=0.8223,
KL=0.0887, wKL=1.0000]
Train E43: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4292, E=0.8223,
KL=0.0887, wKL=1.0000]
Train E43: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4865, E=0.8197,
KL=0.0885, wKL=1.0000]
Train E43: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4865, E=0.8197,
KL=0.0885, wKL=1.0000]
Train E43: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4476, E=0.8265,
KL=0.0875, wKL=1.0000]
Train E43: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4476, E=0.8265,
KL=0.0875, wKL=1.0000]
Train E43: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4860, E=0.8204,
KL=0.0866, wKL=1.0000]
Train E43: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.4860, E=0.8204,
KL=0.0866, wKL=1.0000]
Train E43: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5598, E=0.8239,
KL=0.0890, wKL=1.0000]
Train E43: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5598, E=0.8239,
KL=0.0890, wKL=1.0000]
Train E43: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.4854, E=0.8245,
KL=0.0862, wKL=1.0000]
Train E43: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4854, E=0.8245,
KL=0.0862, wKL=1.0000]
Train E43: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4742, E=0.8242,
KL=0.0865, wKL=1.0000]
Train E43: 68%|██████▊ | 17/25 [00:23<00:11, 1.41s/batch, N=1.4742, E=0.8242,
KL=0.0865, wKL=1.0000]
Train E43: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5693, E=0.8223,
KL=0.0868, wKL=1.0000]
Train E43: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5693, E=0.8223,
KL=0.0868, wKL=1.0000]
Train E43: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4116, E=0.8206,
KL=0.0861, wKL=1.0000]
Train E43: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4116, E=0.8206,
KL=0.0861, wKL=1.0000]
Train E43: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.5046, E=0.8238,
KL=0.0863, wKL=1.0000]
Train E43: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5046, E=0.8238,
KL=0.0863, wKL=1.0000]
Train E43: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.4901, E=0.8210,
KL=0.0879, wKL=1.0000]
Train E43: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4901, E=0.8210,
KL=0.0879, wKL=1.0000]
Train E43: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5365, E=0.8272,
KL=0.0863, wKL=1.0000]
Train E43: 88%|████████▊ | 22/25 [00:30<00:04, 1.40s/batch, N=1.5365, E=0.8272,
KL=0.0863, wKL=1.0000]
Train E43: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4894, E=0.8221,
KL=0.0868, wKL=1.0000]
Train E43: 92%|█████████▏| 23/25 [00:32<00:03, 1.58s/batch, N=1.4894, E=0.8221,
KL=0.0868, wKL=1.0000]
Train E43: 92%|█████████▏| 23/25 [00:34<00:03, 1.58s/batch, N=1.4767, E=0.8214,
KL=0.0859, wKL=1.0000]
Train E43: 96%|█████████▌| 24/25 [00:34<00:01, 1.57s/batch, N=1.4767, E=0.8214,
KL=0.0859, wKL=1.0000]
Train E43: 96%|█████████▌| 24/25 [00:35<00:01, 1.57s/batch, N=1.6989, E=0.8144,
KL=0.0906, wKL=1.0000]
Train E43: 100%|██████████| 25/25 [00:35<00:00, 1.30s/batch, N=1.6989, E=0.8144,
KL=0.0906, wKL=1.0000]
Train E43: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.6989, E=0.8144,
KL=0.0906, wKL=1.0000]
1558.0s 161 [Epoch 043] Total: 2.3521 | N: 1.4860 | E: 0.8223 | KL(1.00×0.5):
0.0874
1592.4s 162 Train E44: 0%| | 0/25 [00:00<?, ?batch/s]
Train E44: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5135, E=0.8233, KL=0.0859,
wKL=1.0000]
Train E44: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.5135, E=0.8233,
KL=0.0859, wKL=1.0000]
Train E44: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.4919, E=0.8259,
KL=0.0862, wKL=1.0000]
Train E44: 8%|▊ | 2/25 [00:02<00:32, 1.41s/batch, N=1.4919, E=0.8259,
KL=0.0862, wKL=1.0000]
Train E44: 8%|▊ | 2/25 [00:04<00:32, 1.41s/batch, N=1.5137, E=0.8189,
KL=0.0862, wKL=1.0000]
Train E44: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5137, E=0.8189,
KL=0.0862, wKL=1.0000]
Train E44: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5374, E=0.8227,
KL=0.0866, wKL=1.0000]
Train E44: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5374, E=0.8227,
KL=0.0866, wKL=1.0000]
Train E44: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5020, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5020, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4688, E=0.8241,
KL=0.0847, wKL=1.0000]
Train E44: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4688, E=0.8241,
KL=0.0847, wKL=1.0000]
Train E44: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4678, E=0.8250,
KL=0.0855, wKL=1.0000]
Train E44: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4678, E=0.8250,
KL=0.0855, wKL=1.0000]
Train E44: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.5351, E=0.8278,
KL=0.0862, wKL=1.0000]
Train E44: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5351, E=0.8278,
KL=0.0862, wKL=1.0000]
Train E44: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4878, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4878, E=0.8218,
KL=0.0859, wKL=1.0000]
Train E44: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.4413, E=0.8215,
KL=0.0852, wKL=1.0000]
Train E44: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4413, E=0.8215,
KL=0.0852, wKL=1.0000]
Train E44: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4637, E=0.8221,
KL=0.0855, wKL=1.0000]
Train E44: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4637, E=0.8221,
KL=0.0855, wKL=1.0000]
Train E44: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5963, E=0.8214,
KL=0.0871, wKL=1.0000]
Train E44: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.5963, E=0.8214,
KL=0.0871, wKL=1.0000]
Train E44: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.3942, E=0.8189,
KL=0.0854, wKL=1.0000]
Train E44: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.3942, E=0.8189,
KL=0.0854, wKL=1.0000]
Train E44: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.6081, E=0.8221,
KL=0.0860, wKL=1.0000]
Train E44: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.6081, E=0.8221,
KL=0.0860, wKL=1.0000]
Train E44: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.3839, E=0.8217,
KL=0.0862, wKL=1.0000]
Train E44: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.3839, E=0.8217,
KL=0.0862, wKL=1.0000]
Train E44: 60%|██████ | 15/25 [00:22<00:14, 1.41s/batch, N=1.3449, E=0.8229,
KL=0.0844, wKL=1.0000]
Train E44: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.3449, E=0.8229,
KL=0.0844, wKL=1.0000]
Train E44: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4630, E=0.8242,
KL=0.0863, wKL=1.0000]
Train E44: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4630, E=0.8242,
KL=0.0863, wKL=1.0000]
Train E44: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4858, E=0.8177,
KL=0.0861, wKL=1.0000]
Train E44: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.4858, E=0.8177,
KL=0.0861, wKL=1.0000]
Train E44: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4612, E=0.8204,
KL=0.0858, wKL=1.0000]
Train E44: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4612, E=0.8204,
KL=0.0858, wKL=1.0000]
Train E44: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.4517, E=0.8208,
KL=0.0855, wKL=1.0000]
Train E44: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.4517, E=0.8208,
KL=0.0855, wKL=1.0000]
Train E44: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5385, E=0.8196,
KL=0.0849, wKL=1.0000]
Train E44: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5385, E=0.8196,
KL=0.0849, wKL=1.0000]
Train E44: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.5211, E=0.8272,
KL=0.0836, wKL=1.0000]
Train E44: 88%|████████▊ | 22/25 [00:30<00:04, 1.43s/batch, N=1.5211, E=0.8272,
KL=0.0836, wKL=1.0000]
Train E44: 88%|████████▊ | 22/25 [00:32<00:04, 1.43s/batch, N=1.5204, E=0.8202,
KL=0.0833, wKL=1.0000]
Train E44: 92%|█████████▏| 23/25 [00:32<00:02, 1.47s/batch, N=1.5204, E=0.8202,
KL=0.0833, wKL=1.0000]
Train E44: 92%|█████████▏| 23/25 [00:33<00:02, 1.47s/batch, N=1.4915, E=0.8211,
KL=0.0840, wKL=1.0000]
Train E44: 96%|█████████▌| 24/25 [00:33<00:01, 1.46s/batch, N=1.4915, E=0.8211,
KL=0.0840, wKL=1.0000]
Train E44: 96%|█████████▌| 24/25 [00:34<00:01, 1.46s/batch, N=1.4126, E=0.8153,
KL=0.0828, wKL=1.0000]
Train E44: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.4126, E=0.8153,
KL=0.0828, wKL=1.0000]
Train E44: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4126, E=0.8153,
KL=0.0828, wKL=1.0000]
1592.4s 163 [Epoch 044] Total: 2.3504 | N: 1.4856 | E: 0.8221 | KL(1.00×0.5):
0.0855
1627.6s 164 Train E45: 0%| | 0/25 [00:00<?, ?batch/s]
Train E45: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4566, E=0.8248, KL=0.0829,
wKL=1.0000]
Train E45: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4566, E=0.8248,
KL=0.0829, wKL=1.0000]
Train E45: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5751, E=0.8237,
KL=0.0836, wKL=1.0000]
Train E45: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5751, E=0.8237,
KL=0.0836, wKL=1.0000]
Train E45: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4525, E=0.8171,
KL=0.0842, wKL=1.0000]
Train E45: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4525, E=0.8171,
KL=0.0842, wKL=1.0000]
Train E45: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.3471, E=0.8157,
KL=0.0834, wKL=1.0000]
Train E45: 16%|█▌ | 4/25 [00:05<00:28, 1.35s/batch, N=1.3471, E=0.8157,
KL=0.0834, wKL=1.0000]
Train E45: 16%|█▌ | 4/25 [00:07<00:28, 1.35s/batch, N=1.4793, E=0.8214,
KL=0.0855, wKL=1.0000]
Train E45: 20%|██ | 5/25 [00:07<00:31, 1.58s/batch, N=1.4793, E=0.8214,
KL=0.0855, wKL=1.0000]
Train E45: 20%|██ | 5/25 [00:08<00:31, 1.58s/batch, N=1.5161, E=0.8222,
KL=0.0856, wKL=1.0000]
Train E45: 24%|██▍ | 6/25 [00:08<00:29, 1.54s/batch, N=1.5161, E=0.8222,
KL=0.0856, wKL=1.0000]
Train E45: 24%|██▍ | 6/25 [00:10<00:29, 1.54s/batch, N=1.4788, E=0.8249,
KL=0.0827, wKL=1.0000]
Train E45: 28%|██▊ | 7/25 [00:10<00:27, 1.50s/batch, N=1.4788, E=0.8249,
KL=0.0827, wKL=1.0000]
Train E45: 28%|██▊ | 7/25 [00:11<00:27, 1.50s/batch, N=1.4658, E=0.8233,
KL=0.0836, wKL=1.0000]
Train E45: 32%|███▏ | 8/25 [00:11<00:24, 1.47s/batch, N=1.4658, E=0.8233,
KL=0.0836, wKL=1.0000]
Train E45: 32%|███▏ | 8/25 [00:13<00:24, 1.47s/batch, N=1.4597, E=0.8182,
KL=0.0835, wKL=1.0000]
Train E45: 36%|███▌ | 9/25 [00:13<00:23, 1.46s/batch, N=1.4597, E=0.8182,
KL=0.0835, wKL=1.0000]
Train E45: 36%|███▌ | 9/25 [00:14<00:23, 1.46s/batch, N=1.5104, E=0.8276,
KL=0.0817, wKL=1.0000]
Train E45: 40%|████ | 10/25 [00:14<00:21, 1.45s/batch, N=1.5104, E=0.8276,
KL=0.0817, wKL=1.0000]
Train E45: 40%|████ | 10/25 [00:15<00:21, 1.45s/batch, N=1.4370, E=0.8272,
KL=0.0828, wKL=1.0000]
Train E45: 44%|████▍ | 11/25 [00:15<00:19, 1.43s/batch, N=1.4370, E=0.8272,
KL=0.0828, wKL=1.0000]
Train E45: 44%|████▍ | 11/25 [00:17<00:19, 1.43s/batch, N=1.5280, E=0.8220,
KL=0.0831, wKL=1.0000]
Train E45: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.5280, E=0.8220,
KL=0.0831, wKL=1.0000]
Train E45: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5195, E=0.8234,
KL=0.0818, wKL=1.0000]
Train E45: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5195, E=0.8234,
KL=0.0818, wKL=1.0000]
Train E45: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4473, E=0.8211,
KL=0.0823, wKL=1.0000]
Train E45: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4473, E=0.8211,
KL=0.0823, wKL=1.0000]
Train E45: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.6315, E=0.8238,
KL=0.0830, wKL=1.0000]
Train E45: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.6315, E=0.8238,
KL=0.0830, wKL=1.0000]
Train E45: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.5951, E=0.8163,
KL=0.0837, wKL=1.0000]
Train E45: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5951, E=0.8163,
KL=0.0837, wKL=1.0000]
Train E45: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.4949, E=0.8217,
KL=0.0831, wKL=1.0000]
Train E45: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4949, E=0.8217,
KL=0.0831, wKL=1.0000]
Train E45: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.4731, E=0.8241,
KL=0.0812, wKL=1.0000]
Train E45: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4731, E=0.8241,
KL=0.0812, wKL=1.0000]
Train E45: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4672, E=0.8301,
KL=0.0817, wKL=1.0000]
Train E45: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.4672, E=0.8301,
KL=0.0817, wKL=1.0000]
Train E45: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4109, E=0.8223,
KL=0.0828, wKL=1.0000]
Train E45: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.4109, E=0.8223,
KL=0.0828, wKL=1.0000]
Train E45: 80%|████████ | 20/25 [00:30<00:07, 1.50s/batch, N=1.4622, E=0.8208,
KL=0.0813, wKL=1.0000]
Train E45: 84%|████████▍ | 21/25 [00:30<00:05, 1.47s/batch, N=1.4622, E=0.8208,
KL=0.0813, wKL=1.0000]
Train E45: 84%|████████▍ | 21/25 [00:31<00:05, 1.47s/batch, N=1.4943, E=0.8205,
KL=0.0822, wKL=1.0000]
Train E45: 88%|████████▊ | 22/25 [00:31<00:04, 1.45s/batch, N=1.4943, E=0.8205,
KL=0.0822, wKL=1.0000]
Train E45: 88%|████████▊ | 22/25 [00:33<00:04, 1.45s/batch, N=1.4580, E=0.8217,
KL=0.0820, wKL=1.0000]
Train E45: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.4580, E=0.8217,
KL=0.0820, wKL=1.0000]
Train E45: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.5441, E=0.8238,
KL=0.0818, wKL=1.0000]
Train E45: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.5441, E=0.8238,
KL=0.0818, wKL=1.0000]
Train E45: 96%|█████████▌| 24/25 [00:35<00:01, 1.42s/batch, N=1.4128, E=0.8160,
KL=0.0819, wKL=1.0000]
Train E45: 100%|██████████| 25/25 [00:35<00:00, 1.18s/batch, N=1.4128, E=0.8160,
KL=0.0819, wKL=1.0000]
Train E45: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4128, E=0.8160,
KL=0.0819, wKL=1.0000]
1627.6s 165 [Epoch 045] Total: 2.3502 | N: 1.4864 | E: 0.8223 | KL(1.00×0.5):
0.0829
1662.7s 166 Train E46: 0%| | 0/25 [00:00<?, ?batch/s]
Train E46: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4966, E=0.8271, KL=0.0809,
wKL=1.0000]
Train E46: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4966, E=0.8271,
KL=0.0809, wKL=1.0000]
Train E46: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.6388, E=0.8261,
KL=0.0831, wKL=1.0000]
Train E46: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.6388, E=0.8261,
KL=0.0831, wKL=1.0000]
Train E46: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4539, E=0.8223,
KL=0.0807, wKL=1.0000]
Train E46: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4539, E=0.8223,
KL=0.0807, wKL=1.0000]
Train E46: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4620, E=0.8223,
KL=0.0815, wKL=1.0000]
Train E46: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4620, E=0.8223,
KL=0.0815, wKL=1.0000]
Train E46: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4527, E=0.8231,
KL=0.0815, wKL=1.0000]
Train E46: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4527, E=0.8231,
KL=0.0815, wKL=1.0000]
Train E46: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4528, E=0.8258,
KL=0.0810, wKL=1.0000]
Train E46: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4528, E=0.8258,
KL=0.0810, wKL=1.0000]
Train E46: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4294, E=0.8198,
KL=0.0808, wKL=1.0000]
Train E46: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4294, E=0.8198,
KL=0.0808, wKL=1.0000]
Train E46: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5097, E=0.8203,
KL=0.0816, wKL=1.0000]
Train E46: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.5097, E=0.8203,
KL=0.0816, wKL=1.0000]
Train E46: 32%|███▏ | 8/25 [00:13<00:23, 1.41s/batch, N=1.4672, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 36%|███▌ | 9/25 [00:13<00:25, 1.62s/batch, N=1.4672, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 36%|███▌ | 9/25 [00:14<00:25, 1.62s/batch, N=1.5093, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 40%|████ | 10/25 [00:14<00:23, 1.54s/batch, N=1.5093, E=0.8235,
KL=0.0812, wKL=1.0000]
Train E46: 40%|████ | 10/25 [00:16<00:23, 1.54s/batch, N=1.5525, E=0.8247,
KL=0.0813, wKL=1.0000]
Train E46: 44%|████▍ | 11/25 [00:16<00:20, 1.50s/batch, N=1.5525, E=0.8247,
KL=0.0813, wKL=1.0000]
Train E46: 44%|████▍ | 11/25 [00:17<00:20, 1.50s/batch, N=1.4712, E=0.8233,
KL=0.0796, wKL=1.0000]
Train E46: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.4712, E=0.8233,
KL=0.0796, wKL=1.0000]
Train E46: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.4193, E=0.8175,
KL=0.0798, wKL=1.0000]
Train E46: 52%|█████▏ | 13/25 [00:18<00:17, 1.44s/batch, N=1.4193, E=0.8175,
KL=0.0798, wKL=1.0000]
Train E46: 52%|█████▏ | 13/25 [00:20<00:17, 1.44s/batch, N=1.4009, E=0.8212,
KL=0.0791, wKL=1.0000]
Train E46: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4009, E=0.8212,
KL=0.0791, wKL=1.0000]
Train E46: 56%|█████▌ | 14/25 [00:21<00:15, 1.43s/batch, N=1.4093, E=0.8173,
KL=0.0792, wKL=1.0000]
Train E46: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4093, E=0.8173,
KL=0.0792, wKL=1.0000]
Train E46: 60%|██████ | 15/25 [00:23<00:14, 1.42s/batch, N=1.5214, E=0.8253,
KL=0.0805, wKL=1.0000]
Train E46: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.5214, E=0.8253,
KL=0.0805, wKL=1.0000]
Train E46: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.5000, E=0.8233,
KL=0.0791, wKL=1.0000]
Train E46: 68%|██████▊ | 17/25 [00:24<00:11, 1.48s/batch, N=1.5000, E=0.8233,
KL=0.0791, wKL=1.0000]
Train E46: 68%|██████▊ | 17/25 [00:26<00:11, 1.48s/batch, N=1.4953, E=0.8156,
KL=0.0802, wKL=1.0000]
Train E46: 72%|███████▏ | 18/25 [00:26<00:10, 1.48s/batch, N=1.4953, E=0.8156,
KL=0.0802, wKL=1.0000]
Train E46: 72%|███████▏ | 18/25 [00:27<00:10, 1.48s/batch, N=1.4897, E=0.8244,
KL=0.0796, wKL=1.0000]
Train E46: 76%|███████▌ | 19/25 [00:27<00:08, 1.45s/batch, N=1.4897, E=0.8244,
KL=0.0796, wKL=1.0000]
Train E46: 76%|███████▌ | 19/25 [00:28<00:08, 1.45s/batch, N=1.5558, E=0.8178,
KL=0.0799, wKL=1.0000]
Train E46: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.5558, E=0.8178,
KL=0.0799, wKL=1.0000]
Train E46: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4294, E=0.8221,
KL=0.0795, wKL=1.0000]
Train E46: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4294, E=0.8221,
KL=0.0795, wKL=1.0000]
Train E46: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5338, E=0.8214,
KL=0.0790, wKL=1.0000]
Train E46: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5338, E=0.8214,
KL=0.0790, wKL=1.0000]
Train E46: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.4331, E=0.8269,
KL=0.0784, wKL=1.0000]
Train E46: 92%|█████████▏| 23/25 [00:33<00:02, 1.43s/batch, N=1.4331, E=0.8269,
KL=0.0784, wKL=1.0000]
Train E46: 92%|█████████▏| 23/25 [00:34<00:02, 1.43s/batch, N=1.5634, E=0.8249,
KL=0.0794, wKL=1.0000]
Train E46: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5634, E=0.8249,
KL=0.0794, wKL=1.0000]
Train E46: 96%|█████████▌| 24/25 [00:35<00:01, 1.41s/batch, N=1.5010, E=0.8203,
KL=0.0796, wKL=1.0000]
Train E46: 100%|██████████| 25/25 [00:35<00:00, 1.18s/batch, N=1.5010, E=0.8203,
KL=0.0796, wKL=1.0000]
Train E46: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.5010, E=0.8203,
KL=0.0796, wKL=1.0000]
1662.7s 167 [Epoch 046] Total: 2.3482 | N: 1.4856 | E: 0.8224 | KL(1.00×0.5):
0.0803
1697.4s 168 Train E47: 0%| | 0/25 [00:00<?, ?batch/s]
Train E47: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5041, E=0.8229, KL=0.0790,
wKL=1.0000]
Train E47: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5041, E=0.8229,
KL=0.0790, wKL=1.0000]
Train E47: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4062, E=0.8228,
KL=0.0783, wKL=1.0000]
Train E47: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4062, E=0.8228,
KL=0.0783, wKL=1.0000]
Train E47: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.5236, E=0.8230,
KL=0.0788, wKL=1.0000]
Train E47: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5236, E=0.8230,
KL=0.0788, wKL=1.0000]
Train E47: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.3876, E=0.8250,
KL=0.0786, wKL=1.0000]
Train E47: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.3876, E=0.8250,
KL=0.0786, wKL=1.0000]
Train E47: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4904, E=0.8264,
KL=0.0785, wKL=1.0000]
Train E47: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4904, E=0.8264,
KL=0.0785, wKL=1.0000]
Train E47: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4356, E=0.8172,
KL=0.0777, wKL=1.0000]
Train E47: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4356, E=0.8172,
KL=0.0777, wKL=1.0000]
Train E47: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5207, E=0.8126,
KL=0.0784, wKL=1.0000]
Train E47: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.5207, E=0.8126,
KL=0.0784, wKL=1.0000]
Train E47: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4146, E=0.8214,
KL=0.0768, wKL=1.0000]
Train E47: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4146, E=0.8214,
KL=0.0768, wKL=1.0000]
Train E47: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5255, E=0.8271,
KL=0.0762, wKL=1.0000]
Train E47: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.5255, E=0.8271,
KL=0.0762, wKL=1.0000]
Train E47: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4402, E=0.8255,
KL=0.0775, wKL=1.0000]
Train E47: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4402, E=0.8255,
KL=0.0775, wKL=1.0000]
Train E47: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5578, E=0.8218,
KL=0.0780, wKL=1.0000]
Train E47: 44%|████▍ | 11/25 [00:15<00:19, 1.37s/batch, N=1.5578, E=0.8218,
KL=0.0780, wKL=1.0000]
Train E47: 44%|████▍ | 11/25 [00:16<00:19, 1.37s/batch, N=1.4389, E=0.8236,
KL=0.0778, wKL=1.0000]
Train E47: 48%|████▊ | 12/25 [00:16<00:17, 1.37s/batch, N=1.4389, E=0.8236,
KL=0.0778, wKL=1.0000]
Train E47: 48%|████▊ | 12/25 [00:17<00:17, 1.37s/batch, N=1.4299, E=0.8229,
KL=0.0773, wKL=1.0000]
Train E47: 52%|█████▏ | 13/25 [00:17<00:16, 1.40s/batch, N=1.4299, E=0.8229,
KL=0.0773, wKL=1.0000]
Train E47: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5222, E=0.8264,
KL=0.0771, wKL=1.0000]
Train E47: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.5222, E=0.8264,
KL=0.0771, wKL=1.0000]
Train E47: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4223, E=0.8198,
KL=0.0779, wKL=1.0000]
Train E47: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.4223, E=0.8198,
KL=0.0779, wKL=1.0000]
Train E47: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5693, E=0.8243,
KL=0.0800, wKL=1.0000]
Train E47: 64%|██████▍ | 16/25 [00:22<00:13, 1.47s/batch, N=1.5693, E=0.8243,
KL=0.0800, wKL=1.0000]
Train E47: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5303, E=0.8221,
KL=0.0770, wKL=1.0000]
Train E47: 68%|██████▊ | 17/25 [00:24<00:13, 1.63s/batch, N=1.5303, E=0.8221,
KL=0.0770, wKL=1.0000]
Train E47: 68%|██████▊ | 17/25 [00:25<00:13, 1.63s/batch, N=1.4786, E=0.8264,
KL=0.0772, wKL=1.0000]
Train E47: 72%|███████▏ | 18/25 [00:25<00:10, 1.56s/batch, N=1.4786, E=0.8264,
KL=0.0772, wKL=1.0000]
Train E47: 72%|███████▏ | 18/25 [00:27<00:10, 1.56s/batch, N=1.5342, E=0.8165,
KL=0.0779, wKL=1.0000]
Train E47: 76%|███████▌ | 19/25 [00:27<00:09, 1.50s/batch, N=1.5342, E=0.8165,
KL=0.0779, wKL=1.0000]
Train E47: 76%|███████▌ | 19/25 [00:28<00:09, 1.50s/batch, N=1.5173, E=0.8201,
KL=0.0778, wKL=1.0000]
Train E47: 80%|████████ | 20/25 [00:28<00:07, 1.47s/batch, N=1.5173, E=0.8201,
KL=0.0778, wKL=1.0000]
Train E47: 80%|████████ | 20/25 [00:29<00:07, 1.47s/batch, N=1.4978, E=0.8211,
KL=0.0773, wKL=1.0000]
Train E47: 84%|████████▍ | 21/25 [00:29<00:05, 1.44s/batch, N=1.4978, E=0.8211,
KL=0.0773, wKL=1.0000]
Train E47: 84%|████████▍ | 21/25 [00:31<00:05, 1.44s/batch, N=1.4767, E=0.8249,
KL=0.0755, wKL=1.0000]
Train E47: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.4767, E=0.8249,
KL=0.0755, wKL=1.0000]
Train E47: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5094, E=0.8205,
KL=0.0748, wKL=1.0000]
Train E47: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5094, E=0.8205,
KL=0.0748, wKL=1.0000]
Train E47: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5368, E=0.8225,
KL=0.0760, wKL=1.0000]
Train E47: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5368, E=0.8225,
KL=0.0760, wKL=1.0000]
Train E47: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4317, E=0.8261,
KL=0.0752, wKL=1.0000]
Train E47: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4317, E=0.8261,
KL=0.0752, wKL=1.0000]
Train E47: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4317, E=0.8261,
KL=0.0752, wKL=1.0000]
1697.4s 169 [Epoch 047] Total: 2.3465 | N: 1.4853 | E: 0.8224 | KL(1.00×0.5):
0.0775
1732.1s 170 Train E48: 0%| | 0/25 [00:00<?, ?batch/s]
Train E48: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4546, E=0.8230, KL=0.0749,
wKL=1.0000]
Train E48: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4546, E=0.8230,
KL=0.0749, wKL=1.0000]
Train E48: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5069, E=0.8200,
KL=0.0758, wKL=1.0000]
Train E48: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5069, E=0.8200,
KL=0.0758, wKL=1.0000]
Train E48: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4519, E=0.8235,
KL=0.0762, wKL=1.0000]
Train E48: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4519, E=0.8235,
KL=0.0762, wKL=1.0000]
Train E48: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5137, E=0.8239,
KL=0.0764, wKL=1.0000]
Train E48: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5137, E=0.8239,
KL=0.0764, wKL=1.0000]
Train E48: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5730, E=0.8238,
KL=0.0769, wKL=1.0000]
Train E48: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5730, E=0.8238,
KL=0.0769, wKL=1.0000]
Train E48: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4092, E=0.8225,
KL=0.0763, wKL=1.0000]
Train E48: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.4092, E=0.8225,
KL=0.0763, wKL=1.0000]
Train E48: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.4801, E=0.8253,
KL=0.0755, wKL=1.0000]
Train E48: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4801, E=0.8253,
KL=0.0755, wKL=1.0000]
Train E48: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4735, E=0.8235,
KL=0.0765, wKL=1.0000]
Train E48: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.4735, E=0.8235,
KL=0.0765, wKL=1.0000]
Train E48: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.5277, E=0.8214,
KL=0.0778, wKL=1.0000]
Train E48: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.5277, E=0.8214,
KL=0.0778, wKL=1.0000]
Train E48: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4368, E=0.8221,
KL=0.0733, wKL=1.0000]
Train E48: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4368, E=0.8221,
KL=0.0733, wKL=1.0000]
Train E48: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4703, E=0.8245,
KL=0.0745, wKL=1.0000]
Train E48: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4703, E=0.8245,
KL=0.0745, wKL=1.0000]
Train E48: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4519, E=0.8240,
KL=0.0728, wKL=1.0000]
Train E48: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.4519, E=0.8240,
KL=0.0728, wKL=1.0000]
Train E48: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5084, E=0.8214,
KL=0.0736, wKL=1.0000]
Train E48: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.5084, E=0.8214,
KL=0.0736, wKL=1.0000]
Train E48: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4798, E=0.8219,
KL=0.0738, wKL=1.0000]
Train E48: 56%|█████▌ | 14/25 [00:19<00:15, 1.43s/batch, N=1.4798, E=0.8219,
KL=0.0738, wKL=1.0000]
Train E48: 56%|█████▌ | 14/25 [00:20<00:15, 1.43s/batch, N=1.4692, E=0.8182,
KL=0.0735, wKL=1.0000]
Train E48: 60%|██████ | 15/25 [00:20<00:14, 1.40s/batch, N=1.4692, E=0.8182,
KL=0.0735, wKL=1.0000]
Train E48: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.5161, E=0.8219,
KL=0.0728, wKL=1.0000]
Train E48: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5161, E=0.8219,
KL=0.0728, wKL=1.0000]
Train E48: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4922, E=0.8257,
KL=0.0728, wKL=1.0000]
Train E48: 68%|██████▊ | 17/25 [00:23<00:11, 1.42s/batch, N=1.4922, E=0.8257,
KL=0.0728, wKL=1.0000]
Train E48: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4482, E=0.8237,
KL=0.0725, wKL=1.0000]
Train E48: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4482, E=0.8237,
KL=0.0725, wKL=1.0000]
Train E48: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.5042, E=0.8226,
KL=0.0728, wKL=1.0000]
Train E48: 76%|███████▌ | 19/25 [00:27<00:09, 1.60s/batch, N=1.5042, E=0.8226,
KL=0.0728, wKL=1.0000]
Train E48: 76%|███████▌ | 19/25 [00:28<00:09, 1.60s/batch, N=1.4764, E=0.8249,
KL=0.0731, wKL=1.0000]
Train E48: 80%|████████ | 20/25 [00:28<00:07, 1.53s/batch, N=1.4764, E=0.8249,
KL=0.0731, wKL=1.0000]
Train E48: 80%|████████ | 20/25 [00:29<00:07, 1.53s/batch, N=1.4676, E=0.8211,
KL=0.0728, wKL=1.0000]
Train E48: 84%|████████▍ | 21/25 [00:29<00:05, 1.50s/batch, N=1.4676, E=0.8211,
KL=0.0728, wKL=1.0000]
Train E48: 84%|████████▍ | 21/25 [00:31<00:05, 1.50s/batch, N=1.4947, E=0.8231,
KL=0.0716, wKL=1.0000]
Train E48: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4947, E=0.8231,
KL=0.0716, wKL=1.0000]
Train E48: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.5155, E=0.8196,
KL=0.0756, wKL=1.0000]
Train E48: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.5155, E=0.8196,
KL=0.0756, wKL=1.0000]
Train E48: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.5301, E=0.8187,
KL=0.0733, wKL=1.0000]
Train E48: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5301, E=0.8187,
KL=0.0733, wKL=1.0000]
Train E48: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4765, E=0.8208,
KL=0.0715, wKL=1.0000]
Train E48: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.4765, E=0.8208,
KL=0.0715, wKL=1.0000]
Train E48: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4765, E=0.8208,
KL=0.0715, wKL=1.0000]
1732.1s 171 [Epoch 048] Total: 2.3450 | N: 1.4854 | E: 0.8225 | KL(1.00×0.5):
0.0743
1767.0s 172 Train E49: 0%| | 0/25 [00:00<?, ?batch/s]
Train E49: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4446, E=0.8252, KL=0.0722,
wKL=1.0000]
Train E49: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4446, E=0.8252,
KL=0.0722, wKL=1.0000]
Train E49: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5524, E=0.8241,
KL=0.0718, wKL=1.0000]
Train E49: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5524, E=0.8241,
KL=0.0718, wKL=1.0000]
Train E49: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.5319, E=0.8233,
KL=0.0727, wKL=1.0000]
Train E49: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5319, E=0.8233,
KL=0.0727, wKL=1.0000]
Train E49: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5242, E=0.8232,
KL=0.0719, wKL=1.0000]
Train E49: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.5242, E=0.8232,
KL=0.0719, wKL=1.0000]
Train E49: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.4718, E=0.8235,
KL=0.0703, wKL=1.0000]
Train E49: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4718, E=0.8235,
KL=0.0703, wKL=1.0000]
Train E49: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5581, E=0.8252,
KL=0.0707, wKL=1.0000]
Train E49: 24%|██▍ | 6/25 [00:08<00:25, 1.36s/batch, N=1.5581, E=0.8252,
KL=0.0707, wKL=1.0000]
Train E49: 24%|██▍ | 6/25 [00:09<00:25, 1.36s/batch, N=1.4666, E=0.8218,
KL=0.0705, wKL=1.0000]
Train E49: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4666, E=0.8218,
KL=0.0705, wKL=1.0000]
Train E49: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4446, E=0.8248,
KL=0.0686, wKL=1.0000]
Train E49: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4446, E=0.8248,
KL=0.0686, wKL=1.0000]
Train E49: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5240, E=0.8243,
KL=0.0712, wKL=1.0000]
Train E49: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.5240, E=0.8243,
KL=0.0712, wKL=1.0000]
Train E49: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5354, E=0.8227,
KL=0.0694, wKL=1.0000]
Train E49: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.5354, E=0.8227,
KL=0.0694, wKL=1.0000]
Train E49: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.3782, E=0.8197,
KL=0.0694, wKL=1.0000]
Train E49: 44%|████▍ | 11/25 [00:15<00:20, 1.45s/batch, N=1.3782, E=0.8197,
KL=0.0694, wKL=1.0000]
Train E49: 44%|████▍ | 11/25 [00:16<00:20, 1.45s/batch, N=1.5673, E=0.8279,
KL=0.0697, wKL=1.0000]
Train E49: 48%|████▊ | 12/25 [00:16<00:18, 1.44s/batch, N=1.5673, E=0.8279,
KL=0.0697, wKL=1.0000]
Train E49: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.3759, E=0.8246,
KL=0.0693, wKL=1.0000]
Train E49: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.3759, E=0.8246,
KL=0.0693, wKL=1.0000]
Train E49: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.5125, E=0.8227,
KL=0.0704, wKL=1.0000]
Train E49: 56%|█████▌ | 14/25 [00:19<00:15, 1.44s/batch, N=1.5125, E=0.8227,
KL=0.0704, wKL=1.0000]
Train E49: 56%|█████▌ | 14/25 [00:21<00:15, 1.44s/batch, N=1.4641, E=0.8244,
KL=0.0685, wKL=1.0000]
Train E49: 60%|██████ | 15/25 [00:21<00:14, 1.42s/batch, N=1.4641, E=0.8244,
KL=0.0685, wKL=1.0000]
Train E49: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.5103, E=0.8148,
KL=0.0710, wKL=1.0000]
Train E49: 64%|██████▍ | 16/25 [00:22<00:12, 1.41s/batch, N=1.5103, E=0.8148,
KL=0.0710, wKL=1.0000]
Train E49: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4939, E=0.8181,
KL=0.0697, wKL=1.0000]
Train E49: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4939, E=0.8181,
KL=0.0697, wKL=1.0000]
Train E49: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4437, E=0.8232,
KL=0.0683, wKL=1.0000]
Train E49: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4437, E=0.8232,
KL=0.0683, wKL=1.0000]
Train E49: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4717, E=0.8259,
KL=0.0691, wKL=1.0000]
Train E49: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4717, E=0.8259,
KL=0.0691, wKL=1.0000]
Train E49: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.5957, E=0.8254,
KL=0.0716, wKL=1.0000]
Train E49: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.5957, E=0.8254,
KL=0.0716, wKL=1.0000]
Train E49: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.3224, E=0.8180,
KL=0.0671, wKL=1.0000]
Train E49: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.3224, E=0.8180,
KL=0.0671, wKL=1.0000]
Train E49: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.5174, E=0.8230,
KL=0.0697, wKL=1.0000]
Train E49: 88%|████████▊ | 22/25 [00:30<00:04, 1.39s/batch, N=1.5174, E=0.8230,
KL=0.0697, wKL=1.0000]
Train E49: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4841, E=0.8243,
KL=0.0677, wKL=1.0000]
Train E49: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4841, E=0.8243,
KL=0.0677, wKL=1.0000]
Train E49: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4632, E=0.8186,
KL=0.0672, wKL=1.0000]
Train E49: 96%|█████████▌| 24/25 [00:34<00:01, 1.58s/batch, N=1.4632, E=0.8186,
KL=0.0672, wKL=1.0000]
Train E49: 96%|█████████▌| 24/25 [00:34<00:01, 1.58s/batch, N=1.4540, E=0.8214,
KL=0.0676, wKL=1.0000]
Train E49: 100%|██████████| 25/25 [00:34<00:00, 1.29s/batch, N=1.4540, E=0.8214,
KL=0.0676, wKL=1.0000]
Train E49: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4540, E=0.8214,
KL=0.0676, wKL=1.0000]
1767.0s 173 [Epoch 049] Total: 2.3428 | N: 1.4850 | E: 0.8228 | KL(1.00×0.5):
0.0699
1801.1s 174 Train E50: 0%| | 0/25 [00:00<?, ?batch/s]
Train E50: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5561, E=0.8204, KL=0.0672,
wKL=1.0000]
Train E50: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5561, E=0.8204,
KL=0.0672, wKL=1.0000]
Train E50: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4961, E=0.8182,
KL=0.0662, wKL=1.0000]
Train E50: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.4961, E=0.8182,
KL=0.0662, wKL=1.0000]
Train E50: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4259, E=0.8215,
KL=0.0659, wKL=1.0000]
Train E50: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4259, E=0.8215,
KL=0.0659, wKL=1.0000]
Train E50: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4951, E=0.8241,
KL=0.0660, wKL=1.0000]
Train E50: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4951, E=0.8241,
KL=0.0660, wKL=1.0000]
Train E50: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4179, E=0.8273,
KL=0.0643, wKL=1.0000]
Train E50: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4179, E=0.8273,
KL=0.0643, wKL=1.0000]
Train E50: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4940, E=0.8202,
KL=0.0660, wKL=1.0000]
Train E50: 24%|██▍ | 6/25 [00:08<00:26, 1.37s/batch, N=1.4940, E=0.8202,
KL=0.0660, wKL=1.0000]
Train E50: 24%|██▍ | 6/25 [00:09<00:26, 1.37s/batch, N=1.4698, E=0.8197,
KL=0.0655, wKL=1.0000]
Train E50: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4698, E=0.8197,
KL=0.0655, wKL=1.0000]
Train E50: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4582, E=0.8228,
KL=0.0664, wKL=1.0000]
Train E50: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4582, E=0.8228,
KL=0.0664, wKL=1.0000]
Train E50: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.5089, E=0.8228,
KL=0.0667, wKL=1.0000]
Train E50: 36%|███▌ | 9/25 [00:12<00:23, 1.44s/batch, N=1.5089, E=0.8228,
KL=0.0667, wKL=1.0000]
Train E50: 36%|███▌ | 9/25 [00:14<00:23, 1.44s/batch, N=1.4875, E=0.8236,
KL=0.0669, wKL=1.0000]
Train E50: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.4875, E=0.8236,
KL=0.0669, wKL=1.0000]
Train E50: 40%|████ | 10/25 [00:15<00:21, 1.44s/batch, N=1.5300, E=0.8237,
KL=0.0678, wKL=1.0000]
Train E50: 44%|████▍ | 11/25 [00:15<00:20, 1.43s/batch, N=1.5300, E=0.8237,
KL=0.0678, wKL=1.0000]
Train E50: 44%|████▍ | 11/25 [00:16<00:20, 1.43s/batch, N=1.4916, E=0.8245,
KL=0.0674, wKL=1.0000]
Train E50: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4916, E=0.8245,
KL=0.0674, wKL=1.0000]
Train E50: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.4580, E=0.8196,
KL=0.0668, wKL=1.0000]
Train E50: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.4580, E=0.8196,
KL=0.0668, wKL=1.0000]
Train E50: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4585, E=0.8247,
KL=0.0662, wKL=1.0000]
Train E50: 56%|█████▌ | 14/25 [00:19<00:15, 1.40s/batch, N=1.4585, E=0.8247,
KL=0.0662, wKL=1.0000]
Train E50: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4893, E=0.8228,
KL=0.0665, wKL=1.0000]
Train E50: 60%|██████ | 15/25 [00:20<00:13, 1.39s/batch, N=1.4893, E=0.8228,
KL=0.0665, wKL=1.0000]
Train E50: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.5102, E=0.8212,
KL=0.0670, wKL=1.0000]
Train E50: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5102, E=0.8212,
KL=0.0670, wKL=1.0000]
Train E50: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5306, E=0.8231,
KL=0.0663, wKL=1.0000]
Train E50: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.5306, E=0.8231,
KL=0.0663, wKL=1.0000]
Train E50: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4957, E=0.8194,
KL=0.0654, wKL=1.0000]
Train E50: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.4957, E=0.8194,
KL=0.0654, wKL=1.0000]
Train E50: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4519, E=0.8215,
KL=0.0645, wKL=1.0000]
Train E50: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4519, E=0.8215,
KL=0.0645, wKL=1.0000]
Train E50: 76%|███████▌ | 19/25 [00:27<00:08, 1.39s/batch, N=1.5363, E=0.8255,
KL=0.0645, wKL=1.0000]
Train E50: 80%|████████ | 20/25 [00:27<00:06, 1.39s/batch, N=1.5363, E=0.8255,
KL=0.0645, wKL=1.0000]
Train E50: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.5143, E=0.8225,
KL=0.0664, wKL=1.0000]
Train E50: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5143, E=0.8225,
KL=0.0664, wKL=1.0000]
Train E50: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.3579, E=0.8151,
KL=0.0641, wKL=1.0000]
Train E50: 88%|████████▊ | 22/25 [00:30<00:04, 1.38s/batch, N=1.3579, E=0.8151,
KL=0.0641, wKL=1.0000]
Train E50: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.4420, E=0.8223,
KL=0.0641, wKL=1.0000]
Train E50: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.4420, E=0.8223,
KL=0.0641, wKL=1.0000]
Train E50: 92%|█████████▏| 23/25 [00:33<00:02, 1.38s/batch, N=1.5206, E=0.8216,
KL=0.0642, wKL=1.0000]
Train E50: 96%|█████████▌| 24/25 [00:33<00:01, 1.39s/batch, N=1.5206, E=0.8216,
KL=0.0642, wKL=1.0000]
Train E50: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5017, E=0.8153,
KL=0.0661, wKL=1.0000]
Train E50: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5017, E=0.8153,
KL=0.0661, wKL=1.0000]
Train E50: 100%|██████████| 25/25 [00:34<00:00, 1.36s/batch, N=1.5017, E=0.8153,
KL=0.0661, wKL=1.0000]
1801.1s 175 [Epoch 050] Total: 2.3384 | N: 1.4835 | E: 0.8219 | KL(1.00×0.5):
0.0659
1801.1s 176 Saved checkpoint: /kaggle/working/checkpoints/gvae_50_epoch050.pt
1835.6s 177 Train E51: 0%| | 0/25 [00:00<?, ?batch/s]
Train E51: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4596, E=0.8239, KL=0.0659,
wKL=1.0000]
Train E51: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4596, E=0.8239,
KL=0.0659, wKL=1.0000]
Train E51: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5342, E=0.8252,
KL=0.0650, wKL=1.0000]
Train E51: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.5342, E=0.8252,
KL=0.0650, wKL=1.0000]
Train E51: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.4610, E=0.8193,
KL=0.0647, wKL=1.0000]
Train E51: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4610, E=0.8193,
KL=0.0647, wKL=1.0000]
Train E51: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4688, E=0.8237,
KL=0.0647, wKL=1.0000]
Train E51: 16%|█▌ | 4/25 [00:05<00:28, 1.34s/batch, N=1.4688, E=0.8237,
KL=0.0647, wKL=1.0000]
Train E51: 16%|█▌ | 4/25 [00:07<00:28, 1.34s/batch, N=1.5779, E=0.8233,
KL=0.0662, wKL=1.0000]
Train E51: 20%|██ | 5/25 [00:07<00:31, 1.57s/batch, N=1.5779, E=0.8233,
KL=0.0662, wKL=1.0000]
Train E51: 20%|██ | 5/25 [00:08<00:31, 1.57s/batch, N=1.4651, E=0.8290,
KL=0.0662, wKL=1.0000]
Train E51: 24%|██▍ | 6/25 [00:08<00:28, 1.50s/batch, N=1.4651, E=0.8290,
KL=0.0662, wKL=1.0000]
Train E51: 24%|██▍ | 6/25 [00:10<00:28, 1.50s/batch, N=1.4969, E=0.8176,
KL=0.0658, wKL=1.0000]
Train E51: 28%|██▊ | 7/25 [00:10<00:27, 1.53s/batch, N=1.4969, E=0.8176,
KL=0.0658, wKL=1.0000]
Train E51: 28%|██▊ | 7/25 [00:11<00:27, 1.53s/batch, N=1.4891, E=0.8194,
KL=0.0653, wKL=1.0000]
Train E51: 32%|███▏ | 8/25 [00:11<00:25, 1.49s/batch, N=1.4891, E=0.8194,
KL=0.0653, wKL=1.0000]
Train E51: 32%|███▏ | 8/25 [00:13<00:25, 1.49s/batch, N=1.4327, E=0.8201,
KL=0.0661, wKL=1.0000]
Train E51: 36%|███▌ | 9/25 [00:13<00:23, 1.45s/batch, N=1.4327, E=0.8201,
KL=0.0661, wKL=1.0000]
Train E51: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.5221, E=0.8225,
KL=0.0653, wKL=1.0000]
Train E51: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.5221, E=0.8225,
KL=0.0653, wKL=1.0000]
Train E51: 40%|████ | 10/25 [00:15<00:21, 1.44s/batch, N=1.4690, E=0.8167,
KL=0.0650, wKL=1.0000]
Train E51: 44%|████▍ | 11/25 [00:15<00:19, 1.43s/batch, N=1.4690, E=0.8167,
KL=0.0650, wKL=1.0000]
Train E51: 44%|████▍ | 11/25 [00:17<00:19, 1.43s/batch, N=1.4986, E=0.8261,
KL=0.0640, wKL=1.0000]
Train E51: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4986, E=0.8261,
KL=0.0640, wKL=1.0000]
Train E51: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5334, E=0.8180,
KL=0.0643, wKL=1.0000]
Train E51: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5334, E=0.8180,
KL=0.0643, wKL=1.0000]
Train E51: 52%|█████▏ | 13/25 [00:20<00:16, 1.40s/batch, N=1.4541, E=0.8207,
KL=0.0642, wKL=1.0000]
Train E51: 56%|█████▌ | 14/25 [00:20<00:15, 1.40s/batch, N=1.4541, E=0.8207,
KL=0.0642, wKL=1.0000]
Train E51: 56%|█████▌ | 14/25 [00:21<00:15, 1.40s/batch, N=1.4852, E=0.8202,
KL=0.0640, wKL=1.0000]
Train E51: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4852, E=0.8202,
KL=0.0640, wKL=1.0000]
Train E51: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4519, E=0.8181,
KL=0.0644, wKL=1.0000]
Train E51: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4519, E=0.8181,
KL=0.0644, wKL=1.0000]
Train E51: 64%|██████▍ | 16/25 [00:24<00:12, 1.39s/batch, N=1.4425, E=0.8193,
KL=0.0641, wKL=1.0000]
Train E51: 68%|██████▊ | 17/25 [00:24<00:11, 1.38s/batch, N=1.4425, E=0.8193,
KL=0.0641, wKL=1.0000]
Train E51: 68%|██████▊ | 17/25 [00:25<00:11, 1.38s/batch, N=1.5141, E=0.8253,
KL=0.0633, wKL=1.0000]
Train E51: 72%|███████▏ | 18/25 [00:25<00:09, 1.38s/batch, N=1.5141, E=0.8253,
KL=0.0633, wKL=1.0000]
Train E51: 72%|███████▏ | 18/25 [00:26<00:09, 1.38s/batch, N=1.5034, E=0.8206,
KL=0.0640, wKL=1.0000]
Train E51: 76%|███████▌ | 19/25 [00:26<00:08, 1.38s/batch, N=1.5034, E=0.8206,
KL=0.0640, wKL=1.0000]
Train E51: 76%|███████▌ | 19/25 [00:28<00:08, 1.38s/batch, N=1.5178, E=0.8169,
KL=0.0640, wKL=1.0000]
Train E51: 80%|████████ | 20/25 [00:28<00:06, 1.37s/batch, N=1.5178, E=0.8169,
KL=0.0640, wKL=1.0000]
Train E51: 80%|████████ | 20/25 [00:29<00:06, 1.37s/batch, N=1.4794, E=0.8214,
KL=0.0637, wKL=1.0000]
Train E51: 84%|████████▍ | 21/25 [00:29<00:05, 1.38s/batch, N=1.4794, E=0.8214,
KL=0.0637, wKL=1.0000]
Train E51: 84%|████████▍ | 21/25 [00:31<00:05, 1.38s/batch, N=1.4109, E=0.8204,
KL=0.0632, wKL=1.0000]
Train E51: 88%|████████▊ | 22/25 [00:31<00:04, 1.38s/batch, N=1.4109, E=0.8204,
KL=0.0632, wKL=1.0000]
Train E51: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.3979, E=0.8160,
KL=0.0630, wKL=1.0000]
Train E51: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.3979, E=0.8160,
KL=0.0630, wKL=1.0000]
Train E51: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.5633, E=0.8216,
KL=0.0644, wKL=1.0000]
Train E51: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.5633, E=0.8216,
KL=0.0644, wKL=1.0000]
Train E51: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4130, E=0.8244,
KL=0.0629, wKL=1.0000]
Train E51: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.4130, E=0.8244,
KL=0.0629, wKL=1.0000]
Train E51: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4130, E=0.8244,
KL=0.0629, wKL=1.0000]
1835.6s 178 [Epoch 051] Total: 2.3367 | N: 1.4833 | E: 0.8211 | KL(1.00×0.5):
0.0646
1870.5s 179 Train E52: 0%| | 0/25 [00:00<?, ?batch/s]
Train E52: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4659, E=0.8226, KL=0.0637,
wKL=1.0000]
Train E52: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.4659, E=0.8226,
KL=0.0637, wKL=1.0000]
Train E52: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.5334, E=0.8172,
KL=0.0635, wKL=1.0000]
Train E52: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5334, E=0.8172,
KL=0.0635, wKL=1.0000]
Train E52: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.4983, E=0.8172,
KL=0.0642, wKL=1.0000]
Train E52: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4983, E=0.8172,
KL=0.0642, wKL=1.0000]
Train E52: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4641, E=0.8219,
KL=0.0634, wKL=1.0000]
Train E52: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4641, E=0.8219,
KL=0.0634, wKL=1.0000]
Train E52: 16%|█▌ | 4/25 [00:07<00:29, 1.39s/batch, N=1.4626, E=0.8222,
KL=0.0639, wKL=1.0000]
Train E52: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.4626, E=0.8222,
KL=0.0639, wKL=1.0000]
Train E52: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.4692, E=0.8196,
KL=0.0632, wKL=1.0000]
Train E52: 24%|██▍ | 6/25 [00:08<00:27, 1.45s/batch, N=1.4692, E=0.8196,
KL=0.0632, wKL=1.0000]
Train E52: 24%|██▍ | 6/25 [00:10<00:27, 1.45s/batch, N=1.5219, E=0.8206,
KL=0.0635, wKL=1.0000]
Train E52: 28%|██▊ | 7/25 [00:10<00:29, 1.64s/batch, N=1.5219, E=0.8206,
KL=0.0635, wKL=1.0000]
Train E52: 28%|██▊ | 7/25 [00:11<00:29, 1.64s/batch, N=1.4448, E=0.8178,
KL=0.0643, wKL=1.0000]
Train E52: 32%|███▏ | 8/25 [00:11<00:26, 1.57s/batch, N=1.4448, E=0.8178,
KL=0.0643, wKL=1.0000]
Train E52: 32%|███▏ | 8/25 [00:13<00:26, 1.57s/batch, N=1.4479, E=0.8258,
KL=0.0625, wKL=1.0000]
Train E52: 36%|███▌ | 9/25 [00:13<00:24, 1.50s/batch, N=1.4479, E=0.8258,
KL=0.0625, wKL=1.0000]
Train E52: 36%|███▌ | 9/25 [00:14<00:24, 1.50s/batch, N=1.5461, E=0.8251,
KL=0.0629, wKL=1.0000]
Train E52: 40%|████ | 10/25 [00:14<00:22, 1.47s/batch, N=1.5461, E=0.8251,
KL=0.0629, wKL=1.0000]
Train E52: 40%|████ | 10/25 [00:16<00:22, 1.47s/batch, N=1.5123, E=0.8203,
KL=0.0633, wKL=1.0000]
Train E52: 44%|████▍ | 11/25 [00:16<00:20, 1.47s/batch, N=1.5123, E=0.8203,
KL=0.0633, wKL=1.0000]
Train E52: 44%|████▍ | 11/25 [00:17<00:20, 1.47s/batch, N=1.4959, E=0.8209,
KL=0.0630, wKL=1.0000]
Train E52: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.4959, E=0.8209,
KL=0.0630, wKL=1.0000]
Train E52: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.4674, E=0.8174,
KL=0.0633, wKL=1.0000]
Train E52: 52%|█████▏ | 13/25 [00:18<00:17, 1.43s/batch, N=1.4674, E=0.8174,
KL=0.0633, wKL=1.0000]
Train E52: 52%|█████▏ | 13/25 [00:20<00:17, 1.43s/batch, N=1.4535, E=0.8191,
KL=0.0638, wKL=1.0000]
Train E52: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.4535, E=0.8191,
KL=0.0638, wKL=1.0000]
Train E52: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.4727, E=0.8206,
KL=0.0647, wKL=1.0000]
Train E52: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.4727, E=0.8206,
KL=0.0647, wKL=1.0000]
Train E52: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4882, E=0.8141,
KL=0.0642, wKL=1.0000]
Train E52: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4882, E=0.8141,
KL=0.0642, wKL=1.0000]
Train E52: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4582, E=0.8223,
KL=0.0633, wKL=1.0000]
Train E52: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4582, E=0.8223,
KL=0.0633, wKL=1.0000]
Train E52: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.4778, E=0.8210,
KL=0.0637, wKL=1.0000]
Train E52: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4778, E=0.8210,
KL=0.0637, wKL=1.0000]
Train E52: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.5649, E=0.8214,
KL=0.0632, wKL=1.0000]
Train E52: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5649, E=0.8214,
KL=0.0632, wKL=1.0000]
Train E52: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5156, E=0.8246,
KL=0.0638, wKL=1.0000]
Train E52: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.5156, E=0.8246,
KL=0.0638, wKL=1.0000]
Train E52: 80%|████████ | 20/25 [00:30<00:06, 1.40s/batch, N=1.4987, E=0.8233,
KL=0.0633, wKL=1.0000]
Train E52: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.4987, E=0.8233,
KL=0.0633, wKL=1.0000]
Train E52: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.4377, E=0.8216,
KL=0.0640, wKL=1.0000]
Train E52: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4377, E=0.8216,
KL=0.0640, wKL=1.0000]
Train E52: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4873, E=0.8256,
KL=0.0634, wKL=1.0000]
Train E52: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.4873, E=0.8256,
KL=0.0634, wKL=1.0000]
Train E52: 92%|█████████▏| 23/25 [00:34<00:02, 1.38s/batch, N=1.3629, E=0.8216,
KL=0.0623, wKL=1.0000]
Train E52: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.3629, E=0.8216,
KL=0.0623, wKL=1.0000]
Train E52: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5957, E=0.8207,
KL=0.0636, wKL=1.0000]
Train E52: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.5957, E=0.8207,
KL=0.0636, wKL=1.0000]
Train E52: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5957, E=0.8207,
KL=0.0636, wKL=1.0000]
1870.5s 180 [Epoch 052] Total: 2.3358 | N: 1.4831 | E: 0.8210 | KL(1.00×0.5):
0.0635
1905.6s 181 Train E53: 0%| | 0/25 [00:00<?, ?batch/s]
Train E53: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4278, E=0.8251, KL=0.0628,
wKL=1.0000]
Train E53: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.4278, E=0.8251,
KL=0.0628, wKL=1.0000]
Train E53: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.4902, E=0.8207,
KL=0.0623, wKL=1.0000]
Train E53: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.4902, E=0.8207,
KL=0.0623, wKL=1.0000]
Train E53: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.5219, E=0.8184,
KL=0.0624, wKL=1.0000]
Train E53: 12%|█▏ | 3/25 [00:04<00:32, 1.49s/batch, N=1.5219, E=0.8184,
KL=0.0624, wKL=1.0000]
Train E53: 12%|█▏ | 3/25 [00:05<00:32, 1.49s/batch, N=1.5057, E=0.8212,
KL=0.0616, wKL=1.0000]
Train E53: 16%|█▌ | 4/25 [00:05<00:30, 1.44s/batch, N=1.5057, E=0.8212,
KL=0.0616, wKL=1.0000]
Train E53: 16%|█▌ | 4/25 [00:07<00:30, 1.44s/batch, N=1.4231, E=0.8216,
KL=0.0610, wKL=1.0000]
Train E53: 20%|██ | 5/25 [00:07<00:28, 1.41s/batch, N=1.4231, E=0.8216,
KL=0.0610, wKL=1.0000]
Train E53: 20%|██ | 5/25 [00:08<00:28, 1.41s/batch, N=1.4501, E=0.8189,
KL=0.0625, wKL=1.0000]
Train E53: 24%|██▍ | 6/25 [00:08<00:26, 1.41s/batch, N=1.4501, E=0.8189,
KL=0.0625, wKL=1.0000]
Train E53: 24%|██▍ | 6/25 [00:09<00:26, 1.41s/batch, N=1.4824, E=0.8223,
KL=0.0606, wKL=1.0000]
Train E53: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4824, E=0.8223,
KL=0.0606, wKL=1.0000]
Train E53: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4502, E=0.8260,
KL=0.0622, wKL=1.0000]
Train E53: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.4502, E=0.8260,
KL=0.0622, wKL=1.0000]
Train E53: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.4644, E=0.8243,
KL=0.0616, wKL=1.0000]
Train E53: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4644, E=0.8243,
KL=0.0616, wKL=1.0000]
Train E53: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.5173, E=0.8211,
KL=0.0618, wKL=1.0000]
Train E53: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.5173, E=0.8211,
KL=0.0618, wKL=1.0000]
Train E53: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4500, E=0.8231,
KL=0.0623, wKL=1.0000]
Train E53: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4500, E=0.8231,
KL=0.0623, wKL=1.0000]
Train E53: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.5075, E=0.8144,
KL=0.0623, wKL=1.0000]
Train E53: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5075, E=0.8144,
KL=0.0623, wKL=1.0000]
Train E53: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4229, E=0.8179,
KL=0.0625, wKL=1.0000]
Train E53: 52%|█████▏ | 13/25 [00:18<00:19, 1.59s/batch, N=1.4229, E=0.8179,
KL=0.0625, wKL=1.0000]
Train E53: 52%|█████▏ | 13/25 [00:20<00:19, 1.59s/batch, N=1.5100, E=0.8202,
KL=0.0621, wKL=1.0000]
Train E53: 56%|█████▌ | 14/25 [00:20<00:16, 1.54s/batch, N=1.5100, E=0.8202,
KL=0.0621, wKL=1.0000]
Train E53: 56%|█████▌ | 14/25 [00:21<00:16, 1.54s/batch, N=1.5266, E=0.8181,
KL=0.0621, wKL=1.0000]
Train E53: 60%|██████ | 15/25 [00:21<00:14, 1.49s/batch, N=1.5266, E=0.8181,
KL=0.0621, wKL=1.0000]
Train E53: 60%|██████ | 15/25 [00:23<00:14, 1.49s/batch, N=1.4419, E=0.8183,
KL=0.0629, wKL=1.0000]
Train E53: 64%|██████▍ | 16/25 [00:23<00:13, 1.47s/batch, N=1.4419, E=0.8183,
KL=0.0629, wKL=1.0000]
Train E53: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5207, E=0.8210,
KL=0.0617, wKL=1.0000]
Train E53: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5207, E=0.8210,
KL=0.0617, wKL=1.0000]
Train E53: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4742, E=0.8171,
KL=0.0628, wKL=1.0000]
Train E53: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.4742, E=0.8171,
KL=0.0628, wKL=1.0000]
Train E53: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.4699, E=0.8203,
KL=0.0625, wKL=1.0000]
Train E53: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.4699, E=0.8203,
KL=0.0625, wKL=1.0000]
Train E53: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5714, E=0.8255,
KL=0.0630, wKL=1.0000]
Train E53: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.5714, E=0.8255,
KL=0.0630, wKL=1.0000]
Train E53: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.4449, E=0.8193,
KL=0.0631, wKL=1.0000]
Train E53: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4449, E=0.8193,
KL=0.0631, wKL=1.0000]
Train E53: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.4569, E=0.8236,
KL=0.0623, wKL=1.0000]
Train E53: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4569, E=0.8236,
KL=0.0623, wKL=1.0000]
Train E53: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.4723, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E53: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.4723, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E53: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.5997, E=0.8220,
KL=0.0630, wKL=1.0000]
Train E53: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5997, E=0.8220,
KL=0.0630, wKL=1.0000]
Train E53: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.4308, E=0.8287,
KL=0.0619, wKL=1.0000]
Train E53: 100%|██████████| 25/25 [00:35<00:00, 1.20s/batch, N=1.4308, E=0.8287,
KL=0.0619, wKL=1.0000]
Train E53: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4308, E=0.8287,
KL=0.0619, wKL=1.0000]
1905.6s 182 [Epoch 053] Total: 2.3348 | N: 1.4825 | E: 0.8211 | KL(1.00×0.5):
0.0623
1940.8s 183 Train E54: 0%| | 0/25 [00:00<?, ?batch/s]
Train E54: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4421, E=0.8257, KL=0.0602,
wKL=1.0000]
Train E54: 4%|▍ | 1/25 [00:01<00:35, 1.47s/batch, N=1.4421, E=0.8257,
KL=0.0602, wKL=1.0000]
Train E54: 4%|▍ | 1/25 [00:02<00:35, 1.47s/batch, N=1.3872, E=0.8171,
KL=0.0609, wKL=1.0000]
Train E54: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.3872, E=0.8171,
KL=0.0609, wKL=1.0000]
Train E54: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.5473, E=0.8245,
KL=0.0618, wKL=1.0000]
Train E54: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.5473, E=0.8245,
KL=0.0618, wKL=1.0000]
Train E54: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.5784, E=0.8198,
KL=0.0613, wKL=1.0000]
Train E54: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.5784, E=0.8198,
KL=0.0613, wKL=1.0000]
Train E54: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4545, E=0.8201,
KL=0.0611, wKL=1.0000]
Train E54: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4545, E=0.8201,
KL=0.0611, wKL=1.0000]
Train E54: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5236, E=0.8230,
KL=0.0627, wKL=1.0000]
Train E54: 24%|██▍ | 6/25 [00:08<00:25, 1.37s/batch, N=1.5236, E=0.8230,
KL=0.0627, wKL=1.0000]
Train E54: 24%|██▍ | 6/25 [00:09<00:25, 1.37s/batch, N=1.4935, E=0.8242,
KL=0.0612, wKL=1.0000]
Train E54: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4935, E=0.8242,
KL=0.0612, wKL=1.0000]
Train E54: 28%|██▊ | 7/25 [00:11<00:24, 1.37s/batch, N=1.4967, E=0.8218,
KL=0.0632, wKL=1.0000]
Train E54: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4967, E=0.8218,
KL=0.0632, wKL=1.0000]
Train E54: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4116, E=0.8214,
KL=0.0614, wKL=1.0000]
Train E54: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4116, E=0.8214,
KL=0.0614, wKL=1.0000]
Train E54: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4566, E=0.8238,
KL=0.0613, wKL=1.0000]
Train E54: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4566, E=0.8238,
KL=0.0613, wKL=1.0000]
Train E54: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4585, E=0.8250,
KL=0.0618, wKL=1.0000]
Train E54: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4585, E=0.8250,
KL=0.0618, wKL=1.0000]
Train E54: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4688, E=0.8234,
KL=0.0623, wKL=1.0000]
Train E54: 48%|████▊ | 12/25 [00:16<00:18, 1.42s/batch, N=1.4688, E=0.8234,
KL=0.0623, wKL=1.0000]
Train E54: 48%|████▊ | 12/25 [00:18<00:18, 1.42s/batch, N=1.4281, E=0.8225,
KL=0.0605, wKL=1.0000]
Train E54: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4281, E=0.8225,
KL=0.0605, wKL=1.0000]
Train E54: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.4751, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E54: 56%|█████▌ | 14/25 [00:19<00:15, 1.44s/batch, N=1.4751, E=0.8240,
KL=0.0633, wKL=1.0000]
Train E54: 56%|█████▌ | 14/25 [00:21<00:15, 1.44s/batch, N=1.4879, E=0.8194,
KL=0.0605, wKL=1.0000]
Train E54: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.4879, E=0.8194,
KL=0.0605, wKL=1.0000]
Train E54: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.5082, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 64%|██████▍ | 16/25 [00:23<00:14, 1.60s/batch, N=1.5082, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 64%|██████▍ | 16/25 [00:24<00:14, 1.60s/batch, N=1.4793, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 68%|██████▊ | 17/25 [00:24<00:12, 1.54s/batch, N=1.4793, E=0.8198,
KL=0.0607, wKL=1.0000]
Train E54: 68%|██████▊ | 17/25 [00:25<00:12, 1.54s/batch, N=1.5570, E=0.8159,
KL=0.0620, wKL=1.0000]
Train E54: 72%|███████▏ | 18/25 [00:25<00:10, 1.52s/batch, N=1.5570, E=0.8159,
KL=0.0620, wKL=1.0000]
Train E54: 72%|███████▏ | 18/25 [00:27<00:10, 1.52s/batch, N=1.4193, E=0.8215,
KL=0.0597, wKL=1.0000]
Train E54: 76%|███████▌ | 19/25 [00:27<00:08, 1.49s/batch, N=1.4193, E=0.8215,
KL=0.0597, wKL=1.0000]
Train E54: 76%|███████▌ | 19/25 [00:28<00:08, 1.49s/batch, N=1.4836, E=0.8162,
KL=0.0612, wKL=1.0000]
Train E54: 80%|████████ | 20/25 [00:28<00:07, 1.46s/batch, N=1.4836, E=0.8162,
KL=0.0612, wKL=1.0000]
Train E54: 80%|████████ | 20/25 [00:30<00:07, 1.46s/batch, N=1.5155, E=0.8202,
KL=0.0619, wKL=1.0000]
Train E54: 84%|████████▍ | 21/25 [00:30<00:05, 1.45s/batch, N=1.5155, E=0.8202,
KL=0.0619, wKL=1.0000]
Train E54: 84%|████████▍ | 21/25 [00:31<00:05, 1.45s/batch, N=1.5202, E=0.8242,
KL=0.0614, wKL=1.0000]
Train E54: 88%|████████▊ | 22/25 [00:31<00:04, 1.48s/batch, N=1.5202, E=0.8242,
KL=0.0614, wKL=1.0000]
Train E54: 88%|████████▊ | 22/25 [00:33<00:04, 1.48s/batch, N=1.4819, E=0.8165,
KL=0.0624, wKL=1.0000]
Train E54: 92%|█████████▏| 23/25 [00:33<00:03, 1.50s/batch, N=1.4819, E=0.8165,
KL=0.0624, wKL=1.0000]
Train E54: 92%|█████████▏| 23/25 [00:34<00:03, 1.50s/batch, N=1.5146, E=0.8249,
KL=0.0610, wKL=1.0000]
Train E54: 96%|█████████▌| 24/25 [00:34<00:01, 1.46s/batch, N=1.5146, E=0.8249,
KL=0.0610, wKL=1.0000]
Train E54: 96%|█████████▌| 24/25 [00:35<00:01, 1.46s/batch, N=1.4522, E=0.8171,
KL=0.0613, wKL=1.0000]
Train E54: 100%|██████████| 25/25 [00:35<00:00, 1.22s/batch, N=1.4522, E=0.8171,
KL=0.0613, wKL=1.0000]
Train E54: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4522, E=0.8171,
KL=0.0613, wKL=1.0000]
1940.9s 184 [Epoch 054] Total: 2.3345 | N: 1.4824 | E: 0.8214 | KL(1.00×0.5):
0.0614
1975.7s 185 Train E55: 0%| | 0/25 [00:00<?, ?batch/s]
Train E55: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4193, E=0.8191, KL=0.0614,
wKL=1.0000]
Train E55: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4193, E=0.8191,
KL=0.0614, wKL=1.0000]
Train E55: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.4716, E=0.8212,
KL=0.0607, wKL=1.0000]
Train E55: 8%|▊ | 2/25 [00:02<00:30, 1.35s/batch, N=1.4716, E=0.8212,
KL=0.0607, wKL=1.0000]
Train E55: 8%|▊ | 2/25 [00:04<00:30, 1.35s/batch, N=1.5290, E=0.8176,
KL=0.0618, wKL=1.0000]
Train E55: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5290, E=0.8176,
KL=0.0618, wKL=1.0000]
Train E55: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4785, E=0.8136,
KL=0.0613, wKL=1.0000]
Train E55: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.4785, E=0.8136,
KL=0.0613, wKL=1.0000]
Train E55: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.4825, E=0.8174,
KL=0.0608, wKL=1.0000]
Train E55: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4825, E=0.8174,
KL=0.0608, wKL=1.0000]
Train E55: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.3969, E=0.8212,
KL=0.0596, wKL=1.0000]
Train E55: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.3969, E=0.8212,
KL=0.0596, wKL=1.0000]
Train E55: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4467, E=0.8203,
KL=0.0601, wKL=1.0000]
Train E55: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4467, E=0.8203,
KL=0.0601, wKL=1.0000]
Train E55: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.5206, E=0.8188,
KL=0.0620, wKL=1.0000]
Train E55: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5206, E=0.8188,
KL=0.0620, wKL=1.0000]
Train E55: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5845, E=0.8203,
KL=0.0606, wKL=1.0000]
Train E55: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5845, E=0.8203,
KL=0.0606, wKL=1.0000]
Train E55: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4354, E=0.8240,
KL=0.0603, wKL=1.0000]
Train E55: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.4354, E=0.8240,
KL=0.0603, wKL=1.0000]
Train E55: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4740, E=0.8239,
KL=0.0610, wKL=1.0000]
Train E55: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4740, E=0.8239,
KL=0.0610, wKL=1.0000]
Train E55: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.0597, wKL=1.0000]
Train E55: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5007, E=0.8232,
KL=0.0597, wKL=1.0000]
Train E55: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.5621, E=0.8259,
KL=0.0602, wKL=1.0000]
Train E55: 52%|█████▏ | 13/25 [00:17<00:16, 1.38s/batch, N=1.5621, E=0.8259,
KL=0.0602, wKL=1.0000]
Train E55: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.4759, E=0.8193,
KL=0.0604, wKL=1.0000]
Train E55: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4759, E=0.8193,
KL=0.0604, wKL=1.0000]
Train E55: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.4622, E=0.8227,
KL=0.0605, wKL=1.0000]
Train E55: 60%|██████ | 15/25 [00:20<00:13, 1.40s/batch, N=1.4622, E=0.8227,
KL=0.0605, wKL=1.0000]
Train E55: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.4558, E=0.8225,
KL=0.0592, wKL=1.0000]
Train E55: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4558, E=0.8225,
KL=0.0592, wKL=1.0000]
Train E55: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.4286, E=0.8178,
KL=0.0605, wKL=1.0000]
Train E55: 68%|██████▊ | 17/25 [00:23<00:11, 1.40s/batch, N=1.4286, E=0.8178,
KL=0.0605, wKL=1.0000]
Train E55: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.5827, E=0.8210,
KL=0.0612, wKL=1.0000]
Train E55: 72%|███████▏ | 18/25 [00:24<00:09, 1.39s/batch, N=1.5827, E=0.8210,
KL=0.0612, wKL=1.0000]
Train E55: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4752, E=0.8240,
KL=0.0606, wKL=1.0000]
Train E55: 76%|███████▌ | 19/25 [00:26<00:08, 1.38s/batch, N=1.4752, E=0.8240,
KL=0.0606, wKL=1.0000]
Train E55: 76%|███████▌ | 19/25 [00:28<00:08, 1.38s/batch, N=1.5546, E=0.8258,
KL=0.0615, wKL=1.0000]
Train E55: 80%|████████ | 20/25 [00:28<00:08, 1.61s/batch, N=1.5546, E=0.8258,
KL=0.0615, wKL=1.0000]
Train E55: 80%|████████ | 20/25 [00:29<00:08, 1.61s/batch, N=1.4775, E=0.8217,
KL=0.0603, wKL=1.0000]
Train E55: 84%|████████▍ | 21/25 [00:29<00:06, 1.58s/batch, N=1.4775, E=0.8217,
KL=0.0603, wKL=1.0000]
Train E55: 84%|████████▍ | 21/25 [00:31<00:06, 1.58s/batch, N=1.3895, E=0.8185,
KL=0.0607, wKL=1.0000]
Train E55: 88%|████████▊ | 22/25 [00:31<00:04, 1.51s/batch, N=1.3895, E=0.8185,
KL=0.0607, wKL=1.0000]
Train E55: 88%|████████▊ | 22/25 [00:32<00:04, 1.51s/batch, N=1.5480, E=0.8249,
KL=0.0624, wKL=1.0000]
Train E55: 92%|█████████▏| 23/25 [00:32<00:02, 1.48s/batch, N=1.5480, E=0.8249,
KL=0.0624, wKL=1.0000]
Train E55: 92%|█████████▏| 23/25 [00:34<00:02, 1.48s/batch, N=1.4393, E=0.8189,
KL=0.0590, wKL=1.0000]
Train E55: 96%|█████████▌| 24/25 [00:34<00:01, 1.47s/batch, N=1.4393, E=0.8189,
KL=0.0590, wKL=1.0000]
Train E55: 96%|█████████▌| 24/25 [00:34<00:01, 1.47s/batch, N=1.4660, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E55: 100%|██████████| 25/25 [00:34<00:00, 1.22s/batch, N=1.4660, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E55: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4660, E=0.8250,
KL=0.0587, wKL=1.0000]
1975.7s 186 [Epoch 055] Total: 2.3340 | N: 1.4827 | E: 0.8211 | KL(1.00×0.5):
0.0606
2009.9s 187 Train E56: 0%| | 0/25 [00:00<?, ?batch/s]
Train E56: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4897, E=0.8172, KL=0.0608,
wKL=1.0000]
Train E56: 4%|▍ | 1/25 [00:01<00:33, 1.39s/batch, N=1.4897, E=0.8172,
KL=0.0608, wKL=1.0000]
Train E56: 4%|▍ | 1/25 [00:02<00:33, 1.39s/batch, N=1.5538, E=0.8217,
KL=0.0593, wKL=1.0000]
Train E56: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.5538, E=0.8217,
KL=0.0593, wKL=1.0000]
Train E56: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4686, E=0.8256,
KL=0.0599, wKL=1.0000]
Train E56: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4686, E=0.8256,
KL=0.0599, wKL=1.0000]
Train E56: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.5151, E=0.8162,
KL=0.0611, wKL=1.0000]
Train E56: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5151, E=0.8162,
KL=0.0611, wKL=1.0000]
Train E56: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.4412, E=0.8174,
KL=0.0599, wKL=1.0000]
Train E56: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.4412, E=0.8174,
KL=0.0599, wKL=1.0000]
Train E56: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4282, E=0.8252,
KL=0.0594, wKL=1.0000]
Train E56: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.4282, E=0.8252,
KL=0.0594, wKL=1.0000]
Train E56: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4506, E=0.8197,
KL=0.0601, wKL=1.0000]
Train E56: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4506, E=0.8197,
KL=0.0601, wKL=1.0000]
Train E56: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.4377, E=0.8204,
KL=0.0600, wKL=1.0000]
Train E56: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4377, E=0.8204,
KL=0.0600, wKL=1.0000]
Train E56: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4778, E=0.8185,
KL=0.0595, wKL=1.0000]
Train E56: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4778, E=0.8185,
KL=0.0595, wKL=1.0000]
Train E56: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5146, E=0.8198,
KL=0.0592, wKL=1.0000]
Train E56: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5146, E=0.8198,
KL=0.0592, wKL=1.0000]
Train E56: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.6206, E=0.8271,
KL=0.0609, wKL=1.0000]
Train E56: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.6206, E=0.8271,
KL=0.0609, wKL=1.0000]
Train E56: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4893, E=0.8230,
KL=0.0592, wKL=1.0000]
Train E56: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4893, E=0.8230,
KL=0.0592, wKL=1.0000]
Train E56: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4349, E=0.8215,
KL=0.0590, wKL=1.0000]
Train E56: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4349, E=0.8215,
KL=0.0590, wKL=1.0000]
Train E56: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4728, E=0.8194,
KL=0.0601, wKL=1.0000]
Train E56: 56%|█████▌ | 14/25 [00:19<00:15, 1.37s/batch, N=1.4728, E=0.8194,
KL=0.0601, wKL=1.0000]
Train E56: 56%|█████▌ | 14/25 [00:20<00:15, 1.37s/batch, N=1.4217, E=0.8214,
KL=0.0590, wKL=1.0000]
Train E56: 60%|██████ | 15/25 [00:20<00:13, 1.38s/batch, N=1.4217, E=0.8214,
KL=0.0590, wKL=1.0000]
Train E56: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4738, E=0.8265,
KL=0.0592, wKL=1.0000]
Train E56: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4738, E=0.8265,
KL=0.0592, wKL=1.0000]
Train E56: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4171, E=0.8223,
KL=0.0588, wKL=1.0000]
Train E56: 68%|██████▊ | 17/25 [00:23<00:11, 1.38s/batch, N=1.4171, E=0.8223,
KL=0.0588, wKL=1.0000]
Train E56: 68%|██████▊ | 17/25 [00:25<00:11, 1.38s/batch, N=1.4776, E=0.8200,
KL=0.0592, wKL=1.0000]
Train E56: 72%|███████▏ | 18/25 [00:25<00:10, 1.44s/batch, N=1.4776, E=0.8200,
KL=0.0592, wKL=1.0000]
Train E56: 72%|███████▏ | 18/25 [00:26<00:10, 1.44s/batch, N=1.5783, E=0.8205,
KL=0.0593, wKL=1.0000]
Train E56: 76%|███████▌ | 19/25 [00:26<00:08, 1.44s/batch, N=1.5783, E=0.8205,
KL=0.0593, wKL=1.0000]
Train E56: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.5008, E=0.8202,
KL=0.0601, wKL=1.0000]
Train E56: 80%|████████ | 20/25 [00:27<00:07, 1.43s/batch, N=1.5008, E=0.8202,
KL=0.0601, wKL=1.0000]
Train E56: 80%|████████ | 20/25 [00:29<00:07, 1.43s/batch, N=1.5469, E=0.8231,
KL=0.0598, wKL=1.0000]
Train E56: 84%|████████▍ | 21/25 [00:29<00:05, 1.43s/batch, N=1.5469, E=0.8231,
KL=0.0598, wKL=1.0000]
Train E56: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.3813, E=0.8227,
KL=0.0578, wKL=1.0000]
Train E56: 88%|████████▊ | 22/25 [00:30<00:04, 1.42s/batch, N=1.3813, E=0.8227,
KL=0.0578, wKL=1.0000]
Train E56: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.4835, E=0.8200,
KL=0.0595, wKL=1.0000]
Train E56: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4835, E=0.8200,
KL=0.0595, wKL=1.0000]
Train E56: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4326, E=0.8203,
KL=0.0588, wKL=1.0000]
Train E56: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.4326, E=0.8203,
KL=0.0588, wKL=1.0000]
Train E56: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.6360, E=0.8177,
KL=0.0595, wKL=1.0000]
Train E56: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.6360, E=0.8177,
KL=0.0595, wKL=1.0000]
Train E56: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.6360, E=0.8177,
KL=0.0595, wKL=1.0000]
2009.9s 188 [Epoch 056] Total: 2.3331 | N: 1.4822 | E: 0.8212 | KL(1.00×0.5):
0.0596
2044.8s 189 Train E57: 0%| | 0/25 [00:00<?, ?batch/s]
Train E57: 0%| | 0/25 [00:02<?, ?batch/s, N=1.4930, E=0.8226, KL=0.0590,
wKL=1.0000]
Train E57: 4%|▍ | 1/25 [00:02<00:48, 2.02s/batch, N=1.4930, E=0.8226,
KL=0.0590, wKL=1.0000]
Train E57: 4%|▍ | 1/25 [00:03<00:48, 2.02s/batch, N=1.4115, E=0.8207,
KL=0.0583, wKL=1.0000]
Train E57: 8%|▊ | 2/25 [00:03<00:38, 1.66s/batch, N=1.4115, E=0.8207,
KL=0.0583, wKL=1.0000]
Train E57: 8%|▊ | 2/25 [00:04<00:38, 1.66s/batch, N=1.4536, E=0.8215,
KL=0.0598, wKL=1.0000]
Train E57: 12%|█▏ | 3/25 [00:04<00:33, 1.52s/batch, N=1.4536, E=0.8215,
KL=0.0598, wKL=1.0000]
Train E57: 12%|█▏ | 3/25 [00:06<00:33, 1.52s/batch, N=1.5131, E=0.8186,
KL=0.0597, wKL=1.0000]
Train E57: 16%|█▌ | 4/25 [00:06<00:30, 1.46s/batch, N=1.5131, E=0.8186,
KL=0.0597, wKL=1.0000]
Train E57: 16%|█▌ | 4/25 [00:07<00:30, 1.46s/batch, N=1.5687, E=0.8208,
KL=0.0592, wKL=1.0000]
Train E57: 20%|██ | 5/25 [00:07<00:28, 1.43s/batch, N=1.5687, E=0.8208,
KL=0.0592, wKL=1.0000]
Train E57: 20%|██ | 5/25 [00:08<00:28, 1.43s/batch, N=1.4961, E=0.8180,
KL=0.0589, wKL=1.0000]
Train E57: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.4961, E=0.8180,
KL=0.0589, wKL=1.0000]
Train E57: 24%|██▍ | 6/25 [00:10<00:26, 1.40s/batch, N=1.4652, E=0.8226,
KL=0.0596, wKL=1.0000]
Train E57: 28%|██▊ | 7/25 [00:10<00:25, 1.40s/batch, N=1.4652, E=0.8226,
KL=0.0596, wKL=1.0000]
Train E57: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4354, E=0.8162,
KL=0.0586, wKL=1.0000]
Train E57: 32%|███▏ | 8/25 [00:11<00:24, 1.42s/batch, N=1.4354, E=0.8162,
KL=0.0586, wKL=1.0000]
Train E57: 32%|███▏ | 8/25 [00:13<00:24, 1.42s/batch, N=1.4943, E=0.8167,
KL=0.0592, wKL=1.0000]
Train E57: 36%|███▌ | 9/25 [00:13<00:22, 1.41s/batch, N=1.4943, E=0.8167,
KL=0.0592, wKL=1.0000]
Train E57: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.5461, E=0.8270,
KL=0.0593, wKL=1.0000]
Train E57: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.5461, E=0.8270,
KL=0.0593, wKL=1.0000]
Train E57: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4839, E=0.8195,
KL=0.0601, wKL=1.0000]
Train E57: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4839, E=0.8195,
KL=0.0601, wKL=1.0000]
Train E57: 44%|████▍ | 11/25 [00:17<00:19, 1.39s/batch, N=1.4294, E=0.8251,
KL=0.0602, wKL=1.0000]
Train E57: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4294, E=0.8251,
KL=0.0602, wKL=1.0000]
Train E57: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4766, E=0.8220,
KL=0.0609, wKL=1.0000]
Train E57: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4766, E=0.8220,
KL=0.0609, wKL=1.0000]
Train E57: 52%|█████▏ | 13/25 [00:20<00:16, 1.39s/batch, N=1.5450, E=0.8208,
KL=0.0598, wKL=1.0000]
Train E57: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5450, E=0.8208,
KL=0.0598, wKL=1.0000]
Train E57: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4963, E=0.8199,
KL=0.0607, wKL=1.0000]
Train E57: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4963, E=0.8199,
KL=0.0607, wKL=1.0000]
Train E57: 60%|██████ | 15/25 [00:23<00:14, 1.40s/batch, N=1.5216, E=0.8200,
KL=0.0586, wKL=1.0000]
Train E57: 64%|██████▍ | 16/25 [00:23<00:13, 1.48s/batch, N=1.5216, E=0.8200,
KL=0.0586, wKL=1.0000]
Train E57: 64%|██████▍ | 16/25 [00:24<00:13, 1.48s/batch, N=1.4825, E=0.8172,
KL=0.0588, wKL=1.0000]
Train E57: 68%|██████▊ | 17/25 [00:24<00:11, 1.45s/batch, N=1.4825, E=0.8172,
KL=0.0588, wKL=1.0000]
Train E57: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.5928, E=0.8191,
KL=0.0582, wKL=1.0000]
Train E57: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.5928, E=0.8191,
KL=0.0582, wKL=1.0000]
Train E57: 72%|███████▏ | 18/25 [00:27<00:10, 1.43s/batch, N=1.4310, E=0.8216,
KL=0.0576, wKL=1.0000]
Train E57: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.4310, E=0.8216,
KL=0.0576, wKL=1.0000]
Train E57: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.4585, E=0.8227,
KL=0.0576, wKL=1.0000]
Train E57: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4585, E=0.8227,
KL=0.0576, wKL=1.0000]
Train E57: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4809, E=0.8252,
KL=0.0576, wKL=1.0000]
Train E57: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4809, E=0.8252,
KL=0.0576, wKL=1.0000]
Train E57: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.3490, E=0.8262,
KL=0.0572, wKL=1.0000]
Train E57: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.3490, E=0.8262,
KL=0.0572, wKL=1.0000]
Train E57: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.5422, E=0.8256,
KL=0.0582, wKL=1.0000]
Train E57: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5422, E=0.8256,
KL=0.0582, wKL=1.0000]
Train E57: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4474, E=0.8198,
KL=0.0576, wKL=1.0000]
Train E57: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4474, E=0.8198,
KL=0.0576, wKL=1.0000]
Train E57: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4220, E=0.8269,
KL=0.0570, wKL=1.0000]
Train E57: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4220, E=0.8269,
KL=0.0570, wKL=1.0000]
Train E57: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4220, E=0.8269,
KL=0.0570, wKL=1.0000]
2044.8s 190 [Epoch 057] Total: 2.3336 | N: 1.4829 | E: 0.8213 | KL(1.00×0.5):
0.0589
2079.9s 191 Train E58: 0%| | 0/25 [00:00<?, ?batch/s]
Train E58: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5280, E=0.8213, KL=0.0587,
wKL=1.0000]
Train E58: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5280, E=0.8213,
KL=0.0587, wKL=1.0000]
Train E58: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4515, E=0.8206,
KL=0.0579, wKL=1.0000]
Train E58: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4515, E=0.8206,
KL=0.0579, wKL=1.0000]
Train E58: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4281, E=0.8224,
KL=0.0595, wKL=1.0000]
Train E58: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4281, E=0.8224,
KL=0.0595, wKL=1.0000]
Train E58: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.3850, E=0.8196,
KL=0.0585, wKL=1.0000]
Train E58: 16%|█▌ | 4/25 [00:05<00:28, 1.35s/batch, N=1.3850, E=0.8196,
KL=0.0585, wKL=1.0000]
Train E58: 16%|█▌ | 4/25 [00:06<00:28, 1.35s/batch, N=1.4562, E=0.8218,
KL=0.0587, wKL=1.0000]
Train E58: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4562, E=0.8218,
KL=0.0587, wKL=1.0000]
Train E58: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5751, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E58: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5751, E=0.8250,
KL=0.0587, wKL=1.0000]
Train E58: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5012, E=0.8229,
KL=0.0596, wKL=1.0000]
Train E58: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5012, E=0.8229,
KL=0.0596, wKL=1.0000]
Train E58: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4887, E=0.8213,
KL=0.0595, wKL=1.0000]
Train E58: 32%|███▏ | 8/25 [00:11<00:27, 1.60s/batch, N=1.4887, E=0.8213,
KL=0.0595, wKL=1.0000]
Train E58: 32%|███▏ | 8/25 [00:13<00:27, 1.60s/batch, N=1.5004, E=0.8208,
KL=0.0599, wKL=1.0000]
Train E58: 36%|███▌ | 9/25 [00:13<00:24, 1.53s/batch, N=1.5004, E=0.8208,
KL=0.0599, wKL=1.0000]
Train E58: 36%|███▌ | 9/25 [00:14<00:24, 1.53s/batch, N=1.5834, E=0.8202,
KL=0.0599, wKL=1.0000]
Train E58: 40%|████ | 10/25 [00:14<00:22, 1.49s/batch, N=1.5834, E=0.8202,
KL=0.0599, wKL=1.0000]
Train E58: 40%|████ | 10/25 [00:15<00:22, 1.49s/batch, N=1.5269, E=0.8251,
KL=0.0597, wKL=1.0000]
Train E58: 44%|████▍ | 11/25 [00:15<00:20, 1.48s/batch, N=1.5269, E=0.8251,
KL=0.0597, wKL=1.0000]
Train E58: 44%|████▍ | 11/25 [00:17<00:20, 1.48s/batch, N=1.5177, E=0.8193,
KL=0.0610, wKL=1.0000]
Train E58: 48%|████▊ | 12/25 [00:17<00:18, 1.46s/batch, N=1.5177, E=0.8193,
KL=0.0610, wKL=1.0000]
Train E58: 48%|████▊ | 12/25 [00:18<00:18, 1.46s/batch, N=1.4307, E=0.8164,
KL=0.0589, wKL=1.0000]
Train E58: 52%|█████▏ | 13/25 [00:18<00:17, 1.43s/batch, N=1.4307, E=0.8164,
KL=0.0589, wKL=1.0000]
Train E58: 52%|█████▏ | 13/25 [00:20<00:17, 1.43s/batch, N=1.4053, E=0.8197,
KL=0.0581, wKL=1.0000]
Train E58: 56%|█████▌ | 14/25 [00:20<00:16, 1.51s/batch, N=1.4053, E=0.8197,
KL=0.0581, wKL=1.0000]
Train E58: 56%|█████▌ | 14/25 [00:21<00:16, 1.51s/batch, N=1.5196, E=0.8211,
KL=0.0577, wKL=1.0000]
Train E58: 60%|██████ | 15/25 [00:21<00:14, 1.48s/batch, N=1.5196, E=0.8211,
KL=0.0577, wKL=1.0000]
Train E58: 60%|██████ | 15/25 [00:23<00:14, 1.48s/batch, N=1.4431, E=0.8264,
KL=0.0567, wKL=1.0000]
Train E58: 64%|██████▍ | 16/25 [00:23<00:13, 1.45s/batch, N=1.4431, E=0.8264,
KL=0.0567, wKL=1.0000]
Train E58: 64%|██████▍ | 16/25 [00:24<00:13, 1.45s/batch, N=1.5717, E=0.8247,
KL=0.0570, wKL=1.0000]
Train E58: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.5717, E=0.8247,
KL=0.0570, wKL=1.0000]
Train E58: 68%|██████▊ | 17/25 [00:26<00:11, 1.44s/batch, N=1.4408, E=0.8238,
KL=0.0566, wKL=1.0000]
Train E58: 72%|███████▏ | 18/25 [00:26<00:10, 1.46s/batch, N=1.4408, E=0.8238,
KL=0.0566, wKL=1.0000]
Train E58: 72%|███████▏ | 18/25 [00:27<00:10, 1.46s/batch, N=1.4716, E=0.8234,
KL=0.0567, wKL=1.0000]
Train E58: 76%|███████▌ | 19/25 [00:27<00:08, 1.43s/batch, N=1.4716, E=0.8234,
KL=0.0567, wKL=1.0000]
Train E58: 76%|███████▌ | 19/25 [00:28<00:08, 1.43s/batch, N=1.4371, E=0.8178,
KL=0.0573, wKL=1.0000]
Train E58: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4371, E=0.8178,
KL=0.0573, wKL=1.0000]
Train E58: 80%|████████ | 20/25 [00:30<00:07, 1.42s/batch, N=1.4514, E=0.8161,
KL=0.0574, wKL=1.0000]
Train E58: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4514, E=0.8161,
KL=0.0574, wKL=1.0000]
Train E58: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5225, E=0.8175,
KL=0.0582, wKL=1.0000]
Train E58: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5225, E=0.8175,
KL=0.0582, wKL=1.0000]
Train E58: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.4665, E=0.8210,
KL=0.0580, wKL=1.0000]
Train E58: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4665, E=0.8210,
KL=0.0580, wKL=1.0000]
Train E58: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4892, E=0.8182,
KL=0.0583, wKL=1.0000]
Train E58: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4892, E=0.8182,
KL=0.0583, wKL=1.0000]
Train E58: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.4484, E=0.8257,
KL=0.0577, wKL=1.0000]
Train E58: 100%|██████████| 25/25 [00:35<00:00, 1.16s/batch, N=1.4484, E=0.8257,
KL=0.0577, wKL=1.0000]
Train E58: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4484, E=0.8257,
KL=0.0577, wKL=1.0000]
2079.9s 192 [Epoch 058] Total: 2.3328 | N: 1.4824 | E: 0.8212 | KL(1.00×0.5):
0.0584
2114.6s 193 Train E59: 0%| | 0/25 [00:00<?, ?batch/s]
Train E59: 0%| | 0/25 [00:01<?, ?batch/s, N=1.3896, E=0.8201, KL=0.0578,
wKL=1.0000]
Train E59: 4%|▍ | 1/25 [00:01<00:33, 1.39s/batch, N=1.3896, E=0.8201,
KL=0.0578, wKL=1.0000]
Train E59: 4%|▍ | 1/25 [00:02<00:33, 1.39s/batch, N=1.4594, E=0.8187,
KL=0.0567, wKL=1.0000]
Train E59: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4594, E=0.8187,
KL=0.0567, wKL=1.0000]
Train E59: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.3810, E=0.8206,
KL=0.0576, wKL=1.0000]
Train E59: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.3810, E=0.8206,
KL=0.0576, wKL=1.0000]
Train E59: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4191, E=0.8234,
KL=0.0569, wKL=1.0000]
Train E59: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4191, E=0.8234,
KL=0.0569, wKL=1.0000]
Train E59: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.5472, E=0.8217,
KL=0.0569, wKL=1.0000]
Train E59: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5472, E=0.8217,
KL=0.0569, wKL=1.0000]
Train E59: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4681, E=0.8221,
KL=0.0564, wKL=1.0000]
Train E59: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4681, E=0.8221,
KL=0.0564, wKL=1.0000]
Train E59: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4830, E=0.8199,
KL=0.0567, wKL=1.0000]
Train E59: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4830, E=0.8199,
KL=0.0567, wKL=1.0000]
Train E59: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.3956, E=0.8221,
KL=0.0567, wKL=1.0000]
Train E59: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.3956, E=0.8221,
KL=0.0567, wKL=1.0000]
Train E59: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.5022, E=0.8210,
KL=0.0589, wKL=1.0000]
Train E59: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.5022, E=0.8210,
KL=0.0589, wKL=1.0000]
Train E59: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4354, E=0.8206,
KL=0.0573, wKL=1.0000]
Train E59: 40%|████ | 10/25 [00:13<00:21, 1.40s/batch, N=1.4354, E=0.8206,
KL=0.0573, wKL=1.0000]
Train E59: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.5134, E=0.8223,
KL=0.0580, wKL=1.0000]
Train E59: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.5134, E=0.8223,
KL=0.0580, wKL=1.0000]
Train E59: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4888, E=0.8170,
KL=0.0576, wKL=1.0000]
Train E59: 48%|████▊ | 12/25 [00:16<00:18, 1.44s/batch, N=1.4888, E=0.8170,
KL=0.0576, wKL=1.0000]
Train E59: 48%|████▊ | 12/25 [00:18<00:18, 1.44s/batch, N=1.4964, E=0.8205,
KL=0.0575, wKL=1.0000]
Train E59: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4964, E=0.8205,
KL=0.0575, wKL=1.0000]
Train E59: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4565, E=0.8209,
KL=0.0567, wKL=1.0000]
Train E59: 56%|█████▌ | 14/25 [00:20<00:17, 1.61s/batch, N=1.4565, E=0.8209,
KL=0.0567, wKL=1.0000]
Train E59: 56%|█████▌ | 14/25 [00:21<00:17, 1.61s/batch, N=1.5354, E=0.8231,
KL=0.0576, wKL=1.0000]
Train E59: 60%|██████ | 15/25 [00:21<00:15, 1.55s/batch, N=1.5354, E=0.8231,
KL=0.0576, wKL=1.0000]
Train E59: 60%|██████ | 15/25 [00:23<00:15, 1.55s/batch, N=1.5142, E=0.8156,
KL=0.0579, wKL=1.0000]
Train E59: 64%|██████▍ | 16/25 [00:23<00:13, 1.50s/batch, N=1.5142, E=0.8156,
KL=0.0579, wKL=1.0000]
Train E59: 64%|██████▍ | 16/25 [00:24<00:13, 1.50s/batch, N=1.5506, E=0.8194,
KL=0.0585, wKL=1.0000]
Train E59: 68%|██████▊ | 17/25 [00:24<00:11, 1.47s/batch, N=1.5506, E=0.8194,
KL=0.0585, wKL=1.0000]
Train E59: 68%|██████▊ | 17/25 [00:25<00:11, 1.47s/batch, N=1.5020, E=0.8224,
KL=0.0567, wKL=1.0000]
Train E59: 72%|███████▏ | 18/25 [00:25<00:10, 1.45s/batch, N=1.5020, E=0.8224,
KL=0.0567, wKL=1.0000]
Train E59: 72%|███████▏ | 18/25 [00:27<00:10, 1.45s/batch, N=1.4980, E=0.8222,
KL=0.0575, wKL=1.0000]
Train E59: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.4980, E=0.8222,
KL=0.0575, wKL=1.0000]
Train E59: 76%|███████▌ | 19/25 [00:28<00:08, 1.42s/batch, N=1.5125, E=0.8258,
KL=0.0579, wKL=1.0000]
Train E59: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5125, E=0.8258,
KL=0.0579, wKL=1.0000]
Train E59: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.5610, E=0.8221,
KL=0.0561, wKL=1.0000]
Train E59: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.5610, E=0.8221,
KL=0.0561, wKL=1.0000]
Train E59: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.4307, E=0.8185,
KL=0.0560, wKL=1.0000]
Train E59: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.4307, E=0.8185,
KL=0.0560, wKL=1.0000]
Train E59: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5730, E=0.8241,
KL=0.0581, wKL=1.0000]
Train E59: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5730, E=0.8241,
KL=0.0581, wKL=1.0000]
Train E59: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4375, E=0.8229,
KL=0.0565, wKL=1.0000]
Train E59: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4375, E=0.8229,
KL=0.0565, wKL=1.0000]
Train E59: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5060, E=0.8232,
KL=0.0591, wKL=1.0000]
Train E59: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.5060, E=0.8232,
KL=0.0591, wKL=1.0000]
Train E59: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5060, E=0.8232,
KL=0.0591, wKL=1.0000]
2114.6s 194 [Epoch 059] Total: 2.3315 | N: 1.4817 | E: 0.8212 | KL(1.00×0.5):
0.0573
2149.4s 195 Train E60: 0%| | 0/25 [00:00<?, ?batch/s]
Train E60: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5427, E=0.8289, KL=0.0561,
wKL=1.0000]
Train E60: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5427, E=0.8289,
KL=0.0561, wKL=1.0000]
Train E60: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5053, E=0.8214,
KL=0.0567, wKL=1.0000]
Train E60: 8%|▊ | 2/25 [00:02<00:31, 1.37s/batch, N=1.5053, E=0.8214,
KL=0.0567, wKL=1.0000]
Train E60: 8%|▊ | 2/25 [00:04<00:31, 1.37s/batch, N=1.4755, E=0.8227,
KL=0.0568, wKL=1.0000]
Train E60: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4755, E=0.8227,
KL=0.0568, wKL=1.0000]
Train E60: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4752, E=0.8237,
KL=0.0571, wKL=1.0000]
Train E60: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4752, E=0.8237,
KL=0.0571, wKL=1.0000]
Train E60: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.5050, E=0.8212,
KL=0.0565, wKL=1.0000]
Train E60: 20%|██ | 5/25 [00:06<00:27, 1.40s/batch, N=1.5050, E=0.8212,
KL=0.0565, wKL=1.0000]
Train E60: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.5242, E=0.8195,
KL=0.0579, wKL=1.0000]
Train E60: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5242, E=0.8195,
KL=0.0579, wKL=1.0000]
Train E60: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5417, E=0.8197,
KL=0.0568, wKL=1.0000]
Train E60: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.5417, E=0.8197,
KL=0.0568, wKL=1.0000]
Train E60: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.5324, E=0.8200,
KL=0.0568, wKL=1.0000]
Train E60: 32%|███▏ | 8/25 [00:11<00:23, 1.37s/batch, N=1.5324, E=0.8200,
KL=0.0568, wKL=1.0000]
Train E60: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.5417, E=0.8272,
KL=0.0563, wKL=1.0000]
Train E60: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.5417, E=0.8272,
KL=0.0563, wKL=1.0000]
Train E60: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.4623, E=0.8212,
KL=0.0561, wKL=1.0000]
Train E60: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.4623, E=0.8212,
KL=0.0561, wKL=1.0000]
Train E60: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4679, E=0.8211,
KL=0.0568, wKL=1.0000]
Train E60: 44%|████▍ | 11/25 [00:15<00:20, 1.44s/batch, N=1.4679, E=0.8211,
KL=0.0568, wKL=1.0000]
Train E60: 44%|████▍ | 11/25 [00:16<00:20, 1.44s/batch, N=1.5331, E=0.8231,
KL=0.0570, wKL=1.0000]
Train E60: 48%|████▊ | 12/25 [00:16<00:18, 1.43s/batch, N=1.5331, E=0.8231,
KL=0.0570, wKL=1.0000]
Train E60: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.4464, E=0.8192,
KL=0.0560, wKL=1.0000]
Train E60: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4464, E=0.8192,
KL=0.0560, wKL=1.0000]
Train E60: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5154, E=0.8192,
KL=0.0562, wKL=1.0000]
Train E60: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5154, E=0.8192,
KL=0.0562, wKL=1.0000]
Train E60: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5149, E=0.8218,
KL=0.0563, wKL=1.0000]
Train E60: 60%|██████ | 15/25 [00:21<00:13, 1.39s/batch, N=1.5149, E=0.8218,
KL=0.0563, wKL=1.0000]
Train E60: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.4454, E=0.8180,
KL=0.0558, wKL=1.0000]
Train E60: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.4454, E=0.8180,
KL=0.0558, wKL=1.0000]
Train E60: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.4025, E=0.8182,
KL=0.0555, wKL=1.0000]
Train E60: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.4025, E=0.8182,
KL=0.0555, wKL=1.0000]
Train E60: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4223, E=0.8136,
KL=0.0567, wKL=1.0000]
Train E60: 72%|███████▏ | 18/25 [00:25<00:09, 1.38s/batch, N=1.4223, E=0.8136,
KL=0.0567, wKL=1.0000]
Train E60: 72%|███████▏ | 18/25 [00:27<00:09, 1.38s/batch, N=1.4164, E=0.8181,
KL=0.0570, wKL=1.0000]
Train E60: 76%|███████▌ | 19/25 [00:27<00:09, 1.59s/batch, N=1.4164, E=0.8181,
KL=0.0570, wKL=1.0000]
Train E60: 76%|███████▌ | 19/25 [00:28<00:09, 1.59s/batch, N=1.5217, E=0.8194,
KL=0.0584, wKL=1.0000]
Train E60: 80%|████████ | 20/25 [00:28<00:07, 1.52s/batch, N=1.5217, E=0.8194,
KL=0.0584, wKL=1.0000]
Train E60: 80%|████████ | 20/25 [00:29<00:07, 1.52s/batch, N=1.3921, E=0.8245,
KL=0.0562, wKL=1.0000]
Train E60: 84%|████████▍ | 21/25 [00:29<00:05, 1.47s/batch, N=1.3921, E=0.8245,
KL=0.0562, wKL=1.0000]
Train E60: 84%|████████▍ | 21/25 [00:31<00:05, 1.47s/batch, N=1.4009, E=0.8258,
KL=0.0587, wKL=1.0000]
Train E60: 88%|████████▊ | 22/25 [00:31<00:04, 1.44s/batch, N=1.4009, E=0.8258,
KL=0.0587, wKL=1.0000]
Train E60: 88%|████████▊ | 22/25 [00:32<00:04, 1.44s/batch, N=1.5431, E=0.8212,
KL=0.0569, wKL=1.0000]
Train E60: 92%|█████████▏| 23/25 [00:32<00:02, 1.43s/batch, N=1.5431, E=0.8212,
KL=0.0569, wKL=1.0000]
Train E60: 92%|█████████▏| 23/25 [00:34<00:02, 1.43s/batch, N=1.4008, E=0.8196,
KL=0.0565, wKL=1.0000]
Train E60: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4008, E=0.8196,
KL=0.0565, wKL=1.0000]
Train E60: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5456, E=0.8192,
KL=0.0580, wKL=1.0000]
Train E60: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5456, E=0.8192,
KL=0.0580, wKL=1.0000]
Train E60: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5456, E=0.8192,
KL=0.0580, wKL=1.0000]
2149.4s 196 [Epoch 060] Total: 2.3310 | N: 1.4815 | E: 0.8212 | KL(1.00×0.5):
0.0567
2149.4s 197 Saved checkpoint: /kaggle/working/checkpoints/gvae_60_epoch060.pt
2183.8s 198 Train E61: 0%| | 0/25 [00:00<?, ?batch/s]
Train E61: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4862, E=0.8194, KL=0.0564,
wKL=1.0000]
Train E61: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.4862, E=0.8194,
KL=0.0564, wKL=1.0000]
Train E61: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.5512, E=0.8199,
KL=0.0557, wKL=1.0000]
Train E61: 8%|▊ | 2/25 [00:02<00:32, 1.40s/batch, N=1.5512, E=0.8199,
KL=0.0557, wKL=1.0000]
Train E61: 8%|▊ | 2/25 [00:04<00:32, 1.40s/batch, N=1.5592, E=0.8242,
KL=0.0570, wKL=1.0000]
Train E61: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.5592, E=0.8242,
KL=0.0570, wKL=1.0000]
Train E61: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.4455, E=0.8231,
KL=0.0552, wKL=1.0000]
Train E61: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4455, E=0.8231,
KL=0.0552, wKL=1.0000]
Train E61: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5418, E=0.8181,
KL=0.0560, wKL=1.0000]
Train E61: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5418, E=0.8181,
KL=0.0560, wKL=1.0000]
Train E61: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.5273, E=0.8180,
KL=0.0562, wKL=1.0000]
Train E61: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.5273, E=0.8180,
KL=0.0562, wKL=1.0000]
Train E61: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5232, E=0.8243,
KL=0.0561, wKL=1.0000]
Train E61: 28%|██▊ | 7/25 [00:09<00:26, 1.45s/batch, N=1.5232, E=0.8243,
KL=0.0561, wKL=1.0000]
Train E61: 28%|██▊ | 7/25 [00:11<00:26, 1.45s/batch, N=1.5159, E=0.8253,
KL=0.0560, wKL=1.0000]
Train E61: 32%|███▏ | 8/25 [00:11<00:25, 1.48s/batch, N=1.5159, E=0.8253,
KL=0.0560, wKL=1.0000]
Train E61: 32%|███▏ | 8/25 [00:12<00:25, 1.48s/batch, N=1.4236, E=0.8191,
KL=0.0556, wKL=1.0000]
Train E61: 36%|███▌ | 9/25 [00:12<00:23, 1.45s/batch, N=1.4236, E=0.8191,
KL=0.0556, wKL=1.0000]
Train E61: 36%|███▌ | 9/25 [00:14<00:23, 1.45s/batch, N=1.4462, E=0.8208,
KL=0.0564, wKL=1.0000]
Train E61: 40%|████ | 10/25 [00:14<00:21, 1.43s/batch, N=1.4462, E=0.8208,
KL=0.0564, wKL=1.0000]
Train E61: 40%|████ | 10/25 [00:15<00:21, 1.43s/batch, N=1.4677, E=0.8263,
KL=0.0557, wKL=1.0000]
Train E61: 44%|████▍ | 11/25 [00:15<00:19, 1.43s/batch, N=1.4677, E=0.8263,
KL=0.0557, wKL=1.0000]
Train E61: 44%|████▍ | 11/25 [00:17<00:19, 1.43s/batch, N=1.4512, E=0.8240,
KL=0.0555, wKL=1.0000]
Train E61: 48%|████▊ | 12/25 [00:17<00:18, 1.41s/batch, N=1.4512, E=0.8240,
KL=0.0555, wKL=1.0000]
Train E61: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5160, E=0.8220,
KL=0.0566, wKL=1.0000]
Train E61: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5160, E=0.8220,
KL=0.0566, wKL=1.0000]
Train E61: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.5105, E=0.8250,
KL=0.0580, wKL=1.0000]
Train E61: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.5105, E=0.8250,
KL=0.0580, wKL=1.0000]
Train E61: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4069, E=0.8171,
KL=0.0553, wKL=1.0000]
Train E61: 60%|██████ | 15/25 [00:21<00:13, 1.40s/batch, N=1.4069, E=0.8171,
KL=0.0553, wKL=1.0000]
Train E61: 60%|██████ | 15/25 [00:22<00:13, 1.40s/batch, N=1.5228, E=0.8184,
KL=0.0563, wKL=1.0000]
Train E61: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.5228, E=0.8184,
KL=0.0563, wKL=1.0000]
Train E61: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4608, E=0.8207,
KL=0.0554, wKL=1.0000]
Train E61: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4608, E=0.8207,
KL=0.0554, wKL=1.0000]
Train E61: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5074, E=0.8205,
KL=0.0553, wKL=1.0000]
Train E61: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5074, E=0.8205,
KL=0.0553, wKL=1.0000]
Train E61: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.4420, E=0.8201,
KL=0.0554, wKL=1.0000]
Train E61: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.4420, E=0.8201,
KL=0.0554, wKL=1.0000]
Train E61: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5041, E=0.8161,
KL=0.0564, wKL=1.0000]
Train E61: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5041, E=0.8161,
KL=0.0564, wKL=1.0000]
Train E61: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.3591, E=0.8210,
KL=0.0553, wKL=1.0000]
Train E61: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.3591, E=0.8210,
KL=0.0553, wKL=1.0000]
Train E61: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4759, E=0.8187,
KL=0.0556, wKL=1.0000]
Train E61: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.4759, E=0.8187,
KL=0.0556, wKL=1.0000]
Train E61: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.4933, E=0.8246,
KL=0.0566, wKL=1.0000]
Train E61: 92%|█████████▏| 23/25 [00:32<00:02, 1.42s/batch, N=1.4933, E=0.8246,
KL=0.0566, wKL=1.0000]
Train E61: 92%|█████████▏| 23/25 [00:33<00:02, 1.42s/batch, N=1.4116, E=0.8201,
KL=0.0560, wKL=1.0000]
Train E61: 96%|█████████▌| 24/25 [00:33<00:01, 1.41s/batch, N=1.4116, E=0.8201,
KL=0.0560, wKL=1.0000]
Train E61: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.4995, E=0.8214,
KL=0.0559, wKL=1.0000]
Train E61: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.4995, E=0.8214,
KL=0.0559, wKL=1.0000]
Train E61: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4995, E=0.8214,
KL=0.0559, wKL=1.0000]
2183.8s 199 [Epoch 061] Total: 2.3306 | N: 1.4815 | E: 0.8211 | KL(1.00×0.5):
0.0560
2218.9s 200 Train E62: 0%| | 0/25 [00:00<?, ?batch/s]
Train E62: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5869, E=0.8184, KL=0.0562,
wKL=1.0000]
Train E62: 4%|▍ | 1/25 [00:01<00:46, 1.94s/batch, N=1.5869, E=0.8184,
KL=0.0562, wKL=1.0000]
Train E62: 4%|▍ | 1/25 [00:03<00:46, 1.94s/batch, N=1.5100, E=0.8260,
KL=0.0556, wKL=1.0000]
Train E62: 8%|▊ | 2/25 [00:03<00:36, 1.60s/batch, N=1.5100, E=0.8260,
KL=0.0556, wKL=1.0000]
Train E62: 8%|▊ | 2/25 [00:04<00:36, 1.60s/batch, N=1.4725, E=0.8156,
KL=0.0555, wKL=1.0000]
Train E62: 12%|█▏ | 3/25 [00:04<00:33, 1.50s/batch, N=1.4725, E=0.8156,
KL=0.0555, wKL=1.0000]
Train E62: 12%|█▏ | 3/25 [00:06<00:33, 1.50s/batch, N=1.4697, E=0.8195,
KL=0.0550, wKL=1.0000]
Train E62: 16%|█▌ | 4/25 [00:06<00:30, 1.45s/batch, N=1.4697, E=0.8195,
KL=0.0550, wKL=1.0000]
Train E62: 16%|█▌ | 4/25 [00:07<00:30, 1.45s/batch, N=1.3685, E=0.8214,
KL=0.0550, wKL=1.0000]
Train E62: 20%|██ | 5/25 [00:07<00:31, 1.57s/batch, N=1.3685, E=0.8214,
KL=0.0550, wKL=1.0000]
Train E62: 20%|██ | 5/25 [00:09<00:31, 1.57s/batch, N=1.4554, E=0.8246,
KL=0.0550, wKL=1.0000]
Train E62: 24%|██▍ | 6/25 [00:09<00:28, 1.51s/batch, N=1.4554, E=0.8246,
KL=0.0550, wKL=1.0000]
Train E62: 24%|██▍ | 6/25 [00:10<00:28, 1.51s/batch, N=1.5100, E=0.8221,
KL=0.0560, wKL=1.0000]
Train E62: 28%|██▊ | 7/25 [00:10<00:26, 1.47s/batch, N=1.5100, E=0.8221,
KL=0.0560, wKL=1.0000]
Train E62: 28%|██▊ | 7/25 [00:12<00:26, 1.47s/batch, N=1.4020, E=0.8235,
KL=0.0549, wKL=1.0000]
Train E62: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.4020, E=0.8235,
KL=0.0549, wKL=1.0000]
Train E62: 32%|███▏ | 8/25 [00:13<00:24, 1.45s/batch, N=1.5667, E=0.8249,
KL=0.0552, wKL=1.0000]
Train E62: 36%|███▌ | 9/25 [00:13<00:22, 1.42s/batch, N=1.5667, E=0.8249,
KL=0.0552, wKL=1.0000]
Train E62: 36%|███▌ | 9/25 [00:14<00:22, 1.42s/batch, N=1.3666, E=0.8176,
KL=0.0546, wKL=1.0000]
Train E62: 40%|████ | 10/25 [00:14<00:21, 1.42s/batch, N=1.3666, E=0.8176,
KL=0.0546, wKL=1.0000]
Train E62: 40%|████ | 10/25 [00:16<00:21, 1.42s/batch, N=1.5454, E=0.8205,
KL=0.0557, wKL=1.0000]
Train E62: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5454, E=0.8205,
KL=0.0557, wKL=1.0000]
Train E62: 44%|████▍ | 11/25 [00:17<00:19, 1.41s/batch, N=1.3799, E=0.8154,
KL=0.0552, wKL=1.0000]
Train E62: 48%|████▊ | 12/25 [00:17<00:18, 1.42s/batch, N=1.3799, E=0.8154,
KL=0.0552, wKL=1.0000]
Train E62: 48%|████▊ | 12/25 [00:19<00:18, 1.42s/batch, N=1.4801, E=0.8241,
KL=0.0553, wKL=1.0000]
Train E62: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4801, E=0.8241,
KL=0.0553, wKL=1.0000]
Train E62: 52%|█████▏ | 13/25 [00:20<00:16, 1.40s/batch, N=1.4903, E=0.8225,
KL=0.0550, wKL=1.0000]
Train E62: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.4903, E=0.8225,
KL=0.0550, wKL=1.0000]
Train E62: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5751, E=0.8206,
KL=0.0559, wKL=1.0000]
Train E62: 60%|██████ | 15/25 [00:21<00:14, 1.41s/batch, N=1.5751, E=0.8206,
KL=0.0559, wKL=1.0000]
Train E62: 60%|██████ | 15/25 [00:23<00:14, 1.41s/batch, N=1.4334, E=0.8185,
KL=0.0555, wKL=1.0000]
Train E62: 64%|██████▍ | 16/25 [00:23<00:12, 1.41s/batch, N=1.4334, E=0.8185,
KL=0.0555, wKL=1.0000]
Train E62: 64%|██████▍ | 16/25 [00:24<00:12, 1.41s/batch, N=1.4481, E=0.8206,
KL=0.0542, wKL=1.0000]
Train E62: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4481, E=0.8206,
KL=0.0542, wKL=1.0000]
Train E62: 68%|██████▊ | 17/25 [00:26<00:11, 1.41s/batch, N=1.5558, E=0.8232,
KL=0.0543, wKL=1.0000]
Train E62: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.5558, E=0.8232,
KL=0.0543, wKL=1.0000]
Train E62: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.4670, E=0.8234,
KL=0.0563, wKL=1.0000]
Train E62: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.4670, E=0.8234,
KL=0.0563, wKL=1.0000]
Train E62: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4810, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E62: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4810, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E62: 80%|████████ | 20/25 [00:30<00:07, 1.41s/batch, N=1.4700, E=0.8230,
KL=0.0560, wKL=1.0000]
Train E62: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.4700, E=0.8230,
KL=0.0560, wKL=1.0000]
Train E62: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.5159, E=0.8183,
KL=0.0559, wKL=1.0000]
Train E62: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5159, E=0.8183,
KL=0.0559, wKL=1.0000]
Train E62: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.5283, E=0.8245,
KL=0.0553, wKL=1.0000]
Train E62: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.5283, E=0.8245,
KL=0.0553, wKL=1.0000]
Train E62: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4722, E=0.8216,
KL=0.0559, wKL=1.0000]
Train E62: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4722, E=0.8216,
KL=0.0559, wKL=1.0000]
Train E62: 96%|█████████▌| 24/25 [00:35<00:01, 1.39s/batch, N=1.4968, E=0.8209,
KL=0.0600, wKL=1.0000]
Train E62: 100%|██████████| 25/25 [00:35<00:00, 1.15s/batch, N=1.4968, E=0.8209,
KL=0.0600, wKL=1.0000]
Train E62: 100%|██████████| 25/25 [00:35<00:00, 1.40s/batch, N=1.4968, E=0.8209,
KL=0.0600, wKL=1.0000]
2218.9s 201 [Epoch 062] Total: 2.3305 | N: 1.4815 | E: 0.8212 | KL(1.00×0.5):
0.0554
2254.0s 202 Train E63: 0%| | 0/25 [00:00<?, ?batch/s]
Train E63: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4612, E=0.8169, KL=0.0559,
wKL=1.0000]
Train E63: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4612, E=0.8169,
KL=0.0559, wKL=1.0000]
Train E63: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5522, E=0.8183,
KL=0.0558, wKL=1.0000]
Train E63: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.5522, E=0.8183,
KL=0.0558, wKL=1.0000]
Train E63: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4666, E=0.8217,
KL=0.0548, wKL=1.0000]
Train E63: 12%|█▏ | 3/25 [00:04<00:31, 1.44s/batch, N=1.4666, E=0.8217,
KL=0.0548, wKL=1.0000]
Train E63: 12%|█▏ | 3/25 [00:05<00:31, 1.44s/batch, N=1.3776, E=0.8235,
KL=0.0553, wKL=1.0000]
Train E63: 16%|█▌ | 4/25 [00:05<00:30, 1.43s/batch, N=1.3776, E=0.8235,
KL=0.0553, wKL=1.0000]
Train E63: 16%|█▌ | 4/25 [00:07<00:30, 1.43s/batch, N=1.5305, E=0.8228,
KL=0.0562, wKL=1.0000]
Train E63: 20%|██ | 5/25 [00:07<00:28, 1.41s/batch, N=1.5305, E=0.8228,
KL=0.0562, wKL=1.0000]
Train E63: 20%|██ | 5/25 [00:09<00:28, 1.41s/batch, N=1.3883, E=0.8177,
KL=0.0548, wKL=1.0000]
Train E63: 24%|██▍ | 6/25 [00:09<00:30, 1.62s/batch, N=1.3883, E=0.8177,
KL=0.0548, wKL=1.0000]
Train E63: 24%|██▍ | 6/25 [00:10<00:30, 1.62s/batch, N=1.4968, E=0.8264,
KL=0.0542, wKL=1.0000]
Train E63: 28%|██▊ | 7/25 [00:10<00:27, 1.54s/batch, N=1.4968, E=0.8264,
KL=0.0542, wKL=1.0000]
Train E63: 28%|██▊ | 7/25 [00:11<00:27, 1.54s/batch, N=1.4666, E=0.8199,
KL=0.0562, wKL=1.0000]
Train E63: 32%|███▏ | 8/25 [00:11<00:25, 1.51s/batch, N=1.4666, E=0.8199,
KL=0.0562, wKL=1.0000]
Train E63: 32%|███▏ | 8/25 [00:13<00:25, 1.51s/batch, N=1.5110, E=0.8223,
KL=0.0540, wKL=1.0000]
Train E63: 36%|███▌ | 9/25 [00:13<00:23, 1.47s/batch, N=1.5110, E=0.8223,
KL=0.0540, wKL=1.0000]
Train E63: 36%|███▌ | 9/25 [00:14<00:23, 1.47s/batch, N=1.4464, E=0.8208,
KL=0.0551, wKL=1.0000]
Train E63: 40%|████ | 10/25 [00:14<00:21, 1.44s/batch, N=1.4464, E=0.8208,
KL=0.0551, wKL=1.0000]
Train E63: 40%|████ | 10/25 [00:16<00:21, 1.44s/batch, N=1.4042, E=0.8183,
KL=0.0533, wKL=1.0000]
Train E63: 44%|████▍ | 11/25 [00:16<00:20, 1.44s/batch, N=1.4042, E=0.8183,
KL=0.0533, wKL=1.0000]
Train E63: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.5291, E=0.8190,
KL=0.0546, wKL=1.0000]
Train E63: 48%|████▊ | 12/25 [00:17<00:18, 1.45s/batch, N=1.5291, E=0.8190,
KL=0.0546, wKL=1.0000]
Train E63: 48%|████▊ | 12/25 [00:18<00:18, 1.45s/batch, N=1.4994, E=0.8217,
KL=0.0532, wKL=1.0000]
Train E63: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4994, E=0.8217,
KL=0.0532, wKL=1.0000]
Train E63: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4757, E=0.8232,
KL=0.0554, wKL=1.0000]
Train E63: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.4757, E=0.8232,
KL=0.0554, wKL=1.0000]
Train E63: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.5515, E=0.8221,
KL=0.0554, wKL=1.0000]
Train E63: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.5515, E=0.8221,
KL=0.0554, wKL=1.0000]
Train E63: 60%|██████ | 15/25 [00:23<00:14, 1.43s/batch, N=1.4774, E=0.8212,
KL=0.0545, wKL=1.0000]
Train E63: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.4774, E=0.8212,
KL=0.0545, wKL=1.0000]
Train E63: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.4834, E=0.8208,
KL=0.0553, wKL=1.0000]
Train E63: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.4834, E=0.8208,
KL=0.0553, wKL=1.0000]
Train E63: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4705, E=0.8189,
KL=0.0550, wKL=1.0000]
Train E63: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4705, E=0.8189,
KL=0.0550, wKL=1.0000]
Train E63: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5043, E=0.8228,
KL=0.0543, wKL=1.0000]
Train E63: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5043, E=0.8228,
KL=0.0543, wKL=1.0000]
Train E63: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4977, E=0.8218,
KL=0.0552, wKL=1.0000]
Train E63: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4977, E=0.8218,
KL=0.0552, wKL=1.0000]
Train E63: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.5200, E=0.8197,
KL=0.0553, wKL=1.0000]
Train E63: 84%|████████▍ | 21/25 [00:30<00:05, 1.41s/batch, N=1.5200, E=0.8197,
KL=0.0553, wKL=1.0000]
Train E63: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4464, E=0.8244,
KL=0.0538, wKL=1.0000]
Train E63: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4464, E=0.8244,
KL=0.0538, wKL=1.0000]
Train E63: 88%|████████▊ | 22/25 [00:33<00:04, 1.41s/batch, N=1.4687, E=0.8249,
KL=0.0544, wKL=1.0000]
Train E63: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4687, E=0.8249,
KL=0.0544, wKL=1.0000]
Train E63: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.5023, E=0.8226,
KL=0.0538, wKL=1.0000]
Train E63: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5023, E=0.8226,
KL=0.0538, wKL=1.0000]
Train E63: 96%|█████████▌| 24/25 [00:35<00:01, 1.40s/batch, N=1.5394, E=0.8284,
KL=0.0542, wKL=1.0000]
Train E63: 100%|██████████| 25/25 [00:35<00:00, 1.19s/batch, N=1.5394, E=0.8284,
KL=0.0542, wKL=1.0000]
Train E63: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.5394, E=0.8284,
KL=0.0542, wKL=1.0000]
2254.0s 203 [Epoch 063] Total: 2.3302 | N: 1.4813 | E: 0.8214 | KL(1.00×0.5):
0.0548
2288.8s 204 Train E64: 0%| | 0/25 [00:00<?, ?batch/s]
Train E64: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4348, E=0.8238, KL=0.0542,
wKL=1.0000]
Train E64: 4%|▍ | 1/25 [00:01<00:34, 1.42s/batch, N=1.4348, E=0.8238,
KL=0.0542, wKL=1.0000]
Train E64: 4%|▍ | 1/25 [00:02<00:34, 1.42s/batch, N=1.5075, E=0.8223,
KL=0.0544, wKL=1.0000]
Train E64: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.5075, E=0.8223,
KL=0.0544, wKL=1.0000]
Train E64: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4151, E=0.8184,
KL=0.0559, wKL=1.0000]
Train E64: 12%|█▏ | 3/25 [00:04<00:30, 1.39s/batch, N=1.4151, E=0.8184,
KL=0.0559, wKL=1.0000]
Train E64: 12%|█▏ | 3/25 [00:05<00:30, 1.39s/batch, N=1.5013, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E64: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.5013, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E64: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.5299, E=0.8214,
KL=0.0540, wKL=1.0000]
Train E64: 20%|██ | 5/25 [00:06<00:27, 1.40s/batch, N=1.5299, E=0.8214,
KL=0.0540, wKL=1.0000]
Train E64: 20%|██ | 5/25 [00:08<00:27, 1.40s/batch, N=1.4526, E=0.8203,
KL=0.0538, wKL=1.0000]
Train E64: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4526, E=0.8203,
KL=0.0538, wKL=1.0000]
Train E64: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4864, E=0.8193,
KL=0.0550, wKL=1.0000]
Train E64: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4864, E=0.8193,
KL=0.0550, wKL=1.0000]
Train E64: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4620, E=0.8191,
KL=0.0538, wKL=1.0000]
Train E64: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4620, E=0.8191,
KL=0.0538, wKL=1.0000]
Train E64: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4288, E=0.8215,
KL=0.0540, wKL=1.0000]
Train E64: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4288, E=0.8215,
KL=0.0540, wKL=1.0000]
Train E64: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.4979, E=0.8187,
KL=0.0544, wKL=1.0000]
Train E64: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4979, E=0.8187,
KL=0.0544, wKL=1.0000]
Train E64: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5414, E=0.8181,
KL=0.0550, wKL=1.0000]
Train E64: 44%|████▍ | 11/25 [00:15<00:22, 1.58s/batch, N=1.5414, E=0.8181,
KL=0.0550, wKL=1.0000]
Train E64: 44%|████▍ | 11/25 [00:17<00:22, 1.58s/batch, N=1.5897, E=0.8265,
KL=0.0552, wKL=1.0000]
Train E64: 48%|████▊ | 12/25 [00:17<00:19, 1.53s/batch, N=1.5897, E=0.8265,
KL=0.0552, wKL=1.0000]
Train E64: 48%|████▊ | 12/25 [00:18<00:19, 1.53s/batch, N=1.4349, E=0.8228,
KL=0.0552, wKL=1.0000]
Train E64: 52%|█████▏ | 13/25 [00:18<00:17, 1.48s/batch, N=1.4349, E=0.8228,
KL=0.0552, wKL=1.0000]
Train E64: 52%|█████▏ | 13/25 [00:20<00:17, 1.48s/batch, N=1.6031, E=0.8219,
KL=0.0543, wKL=1.0000]
Train E64: 56%|█████▌ | 14/25 [00:20<00:15, 1.45s/batch, N=1.6031, E=0.8219,
KL=0.0543, wKL=1.0000]
Train E64: 56%|█████▌ | 14/25 [00:21<00:15, 1.45s/batch, N=1.4090, E=0.8217,
KL=0.0534, wKL=1.0000]
Train E64: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.4090, E=0.8217,
KL=0.0534, wKL=1.0000]
Train E64: 60%|██████ | 15/25 [00:22<00:14, 1.43s/batch, N=1.4542, E=0.8209,
KL=0.0536, wKL=1.0000]
Train E64: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.4542, E=0.8209,
KL=0.0536, wKL=1.0000]
Train E64: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.4342, E=0.8238,
KL=0.0534, wKL=1.0000]
Train E64: 68%|██████▊ | 17/25 [00:24<00:11, 1.41s/batch, N=1.4342, E=0.8238,
KL=0.0534, wKL=1.0000]
Train E64: 68%|██████▊ | 17/25 [00:25<00:11, 1.41s/batch, N=1.5048, E=0.8256,
KL=0.0539, wKL=1.0000]
Train E64: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.5048, E=0.8256,
KL=0.0539, wKL=1.0000]
Train E64: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.4602, E=0.8164,
KL=0.0547, wKL=1.0000]
Train E64: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.4602, E=0.8164,
KL=0.0547, wKL=1.0000]
Train E64: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4523, E=0.8222,
KL=0.0540, wKL=1.0000]
Train E64: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4523, E=0.8222,
KL=0.0540, wKL=1.0000]
Train E64: 80%|████████ | 20/25 [00:29<00:06, 1.40s/batch, N=1.4608, E=0.8221,
KL=0.0541, wKL=1.0000]
Train E64: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.4608, E=0.8221,
KL=0.0541, wKL=1.0000]
Train E64: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.5291, E=0.8252,
KL=0.0542, wKL=1.0000]
Train E64: 88%|████████▊ | 22/25 [00:31<00:04, 1.38s/batch, N=1.5291, E=0.8252,
KL=0.0542, wKL=1.0000]
Train E64: 88%|████████▊ | 22/25 [00:32<00:04, 1.38s/batch, N=1.5219, E=0.8194,
KL=0.0555, wKL=1.0000]
Train E64: 92%|█████████▏| 23/25 [00:32<00:02, 1.45s/batch, N=1.5219, E=0.8194,
KL=0.0555, wKL=1.0000]
Train E64: 92%|█████████▏| 23/25 [00:34<00:02, 1.45s/batch, N=1.3813, E=0.8200,
KL=0.0531, wKL=1.0000]
Train E64: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.3813, E=0.8200,
KL=0.0531, wKL=1.0000]
Train E64: 96%|█████████▌| 24/25 [00:34<00:01, 1.43s/batch, N=1.6153, E=0.8154,
KL=0.0562, wKL=1.0000]
Train E64: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.6153, E=0.8154,
KL=0.0562, wKL=1.0000]
Train E64: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.6153, E=0.8154,
KL=0.0562, wKL=1.0000]
2288.8s 205 [Epoch 064] Total: 2.3297 | N: 1.4812 | E: 0.8213 | KL(1.00×0.5):
0.0544
2323.7s 206 Train E65: 0%| | 0/25 [00:00<?, ?batch/s]
Train E65: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5643, E=0.8207, KL=0.0539,
wKL=1.0000]
Train E65: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5643, E=0.8207,
KL=0.0539, wKL=1.0000]
Train E65: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4792, E=0.8196,
KL=0.0540, wKL=1.0000]
Train E65: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4792, E=0.8196,
KL=0.0540, wKL=1.0000]
Train E65: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.5189, E=0.8176,
KL=0.0541, wKL=1.0000]
Train E65: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.5189, E=0.8176,
KL=0.0541, wKL=1.0000]
Train E65: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.4931, E=0.8186,
KL=0.0535, wKL=1.0000]
Train E65: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4931, E=0.8186,
KL=0.0535, wKL=1.0000]
Train E65: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4659, E=0.8245,
KL=0.0528, wKL=1.0000]
Train E65: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4659, E=0.8245,
KL=0.0528, wKL=1.0000]
Train E65: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4153, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E65: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4153, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E65: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4647, E=0.8252,
KL=0.0531, wKL=1.0000]
Train E65: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4647, E=0.8252,
KL=0.0531, wKL=1.0000]
Train E65: 28%|██▊ | 7/25 [00:11<00:24, 1.39s/batch, N=1.4272, E=0.8183,
KL=0.0534, wKL=1.0000]
Train E65: 32%|███▏ | 8/25 [00:11<00:23, 1.38s/batch, N=1.4272, E=0.8183,
KL=0.0534, wKL=1.0000]
Train E65: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.3457, E=0.8225,
KL=0.0525, wKL=1.0000]
Train E65: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.3457, E=0.8225,
KL=0.0525, wKL=1.0000]
Train E65: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.4709, E=0.8218,
KL=0.0532, wKL=1.0000]
Train E65: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4709, E=0.8218,
KL=0.0532, wKL=1.0000]
Train E65: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5343, E=0.8225,
KL=0.0534, wKL=1.0000]
Train E65: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5343, E=0.8225,
KL=0.0534, wKL=1.0000]
Train E65: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5450, E=0.8258,
KL=0.0532, wKL=1.0000]
Train E65: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5450, E=0.8258,
KL=0.0532, wKL=1.0000]
Train E65: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5394, E=0.8201,
KL=0.0558, wKL=1.0000]
Train E65: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5394, E=0.8201,
KL=0.0558, wKL=1.0000]
Train E65: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4526, E=0.8231,
KL=0.0533, wKL=1.0000]
Train E65: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.4526, E=0.8231,
KL=0.0533, wKL=1.0000]
Train E65: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4885, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E65: 60%|██████ | 15/25 [00:20<00:13, 1.38s/batch, N=1.4885, E=0.8200,
KL=0.0544, wKL=1.0000]
Train E65: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4469, E=0.8196,
KL=0.0533, wKL=1.0000]
Train E65: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4469, E=0.8196,
KL=0.0533, wKL=1.0000]
Train E65: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4468, E=0.8239,
KL=0.0540, wKL=1.0000]
Train E65: 68%|██████▊ | 17/25 [00:24<00:12, 1.59s/batch, N=1.4468, E=0.8239,
KL=0.0540, wKL=1.0000]
Train E65: 68%|██████▊ | 17/25 [00:25<00:12, 1.59s/batch, N=1.4485, E=0.8205,
KL=0.0534, wKL=1.0000]
Train E65: 72%|███████▏ | 18/25 [00:25<00:10, 1.54s/batch, N=1.4485, E=0.8205,
KL=0.0534, wKL=1.0000]
Train E65: 72%|███████▏ | 18/25 [00:27<00:10, 1.54s/batch, N=1.4383, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E65: 76%|███████▌ | 19/25 [00:27<00:08, 1.49s/batch, N=1.4383, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E65: 76%|███████▌ | 19/25 [00:28<00:08, 1.49s/batch, N=1.5229, E=0.8171,
KL=0.0545, wKL=1.0000]
Train E65: 80%|████████ | 20/25 [00:28<00:07, 1.50s/batch, N=1.5229, E=0.8171,
KL=0.0545, wKL=1.0000]
Train E65: 80%|████████ | 20/25 [00:30<00:07, 1.50s/batch, N=1.4957, E=0.8244,
KL=0.0530, wKL=1.0000]
Train E65: 84%|████████▍ | 21/25 [00:30<00:05, 1.50s/batch, N=1.4957, E=0.8244,
KL=0.0530, wKL=1.0000]
Train E65: 84%|████████▍ | 21/25 [00:31<00:05, 1.50s/batch, N=1.4536, E=0.8238,
KL=0.0544, wKL=1.0000]
Train E65: 88%|████████▊ | 22/25 [00:31<00:04, 1.47s/batch, N=1.4536, E=0.8238,
KL=0.0544, wKL=1.0000]
Train E65: 88%|████████▊ | 22/25 [00:32<00:04, 1.47s/batch, N=1.5599, E=0.8233,
KL=0.0543, wKL=1.0000]
Train E65: 92%|█████████▏| 23/25 [00:32<00:02, 1.46s/batch, N=1.5599, E=0.8233,
KL=0.0543, wKL=1.0000]
Train E65: 92%|█████████▏| 23/25 [00:34<00:02, 1.46s/batch, N=1.5398, E=0.8217,
KL=0.0535, wKL=1.0000]
Train E65: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5398, E=0.8217,
KL=0.0535, wKL=1.0000]
Train E65: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4616, E=0.8173,
KL=0.0525, wKL=1.0000]
Train E65: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.4616, E=0.8173,
KL=0.0525, wKL=1.0000]
Train E65: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4616, E=0.8173,
KL=0.0525, wKL=1.0000]
2323.7s 207 [Epoch 065] Total: 2.3294 | N: 1.4812 | E: 0.8213 | KL(1.00×0.5):
0.0537
2358.3s 208 Train E66: 0%| | 0/25 [00:00<?, ?batch/s]
Train E66: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4298, E=0.8203, KL=0.0520,
wKL=1.0000]
Train E66: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4298, E=0.8203,
KL=0.0520, wKL=1.0000]
Train E66: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.4013, E=0.8181,
KL=0.0530, wKL=1.0000]
Train E66: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.4013, E=0.8181,
KL=0.0530, wKL=1.0000]
Train E66: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.5090, E=0.8231,
KL=0.0523, wKL=1.0000]
Train E66: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5090, E=0.8231,
KL=0.0523, wKL=1.0000]
Train E66: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4873, E=0.8234,
KL=0.0527, wKL=1.0000]
Train E66: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.4873, E=0.8234,
KL=0.0527, wKL=1.0000]
Train E66: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4589, E=0.8169,
KL=0.0539, wKL=1.0000]
Train E66: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4589, E=0.8169,
KL=0.0539, wKL=1.0000]
Train E66: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4133, E=0.8201,
KL=0.0526, wKL=1.0000]
Train E66: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4133, E=0.8201,
KL=0.0526, wKL=1.0000]
Train E66: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.3960, E=0.8222,
KL=0.0519, wKL=1.0000]
Train E66: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.3960, E=0.8222,
KL=0.0519, wKL=1.0000]
Train E66: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.4687, E=0.8214,
KL=0.0539, wKL=1.0000]
Train E66: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.4687, E=0.8214,
KL=0.0539, wKL=1.0000]
Train E66: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4718, E=0.8255,
KL=0.0521, wKL=1.0000]
Train E66: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4718, E=0.8255,
KL=0.0521, wKL=1.0000]
Train E66: 36%|███▌ | 9/25 [00:13<00:22, 1.38s/batch, N=1.5631, E=0.8215,
KL=0.0534, wKL=1.0000]
Train E66: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.5631, E=0.8215,
KL=0.0534, wKL=1.0000]
Train E66: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5146, E=0.8166,
KL=0.0576, wKL=1.0000]
Train E66: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.5146, E=0.8166,
KL=0.0576, wKL=1.0000]
Train E66: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.4594, E=0.8241,
KL=0.0531, wKL=1.0000]
Train E66: 48%|████▊ | 12/25 [00:16<00:17, 1.37s/batch, N=1.4594, E=0.8241,
KL=0.0531, wKL=1.0000]
Train E66: 48%|████▊ | 12/25 [00:17<00:17, 1.37s/batch, N=1.4708, E=0.8167,
KL=0.0530, wKL=1.0000]
Train E66: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4708, E=0.8167,
KL=0.0530, wKL=1.0000]
Train E66: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.5087, E=0.8222,
KL=0.0531, wKL=1.0000]
Train E66: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.5087, E=0.8222,
KL=0.0531, wKL=1.0000]
Train E66: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4599, E=0.8217,
KL=0.0525, wKL=1.0000]
Train E66: 60%|██████ | 15/25 [00:20<00:14, 1.40s/batch, N=1.4599, E=0.8217,
KL=0.0525, wKL=1.0000]
Train E66: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.5173, E=0.8200,
KL=0.0530, wKL=1.0000]
Train E66: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5173, E=0.8200,
KL=0.0530, wKL=1.0000]
Train E66: 64%|██████▍ | 16/25 [00:23<00:12, 1.39s/batch, N=1.5032, E=0.8208,
KL=0.0524, wKL=1.0000]
Train E66: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5032, E=0.8208,
KL=0.0524, wKL=1.0000]
Train E66: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4609, E=0.8257,
KL=0.0524, wKL=1.0000]
Train E66: 72%|███████▏ | 18/25 [00:24<00:09, 1.38s/batch, N=1.4609, E=0.8257,
KL=0.0524, wKL=1.0000]
Train E66: 72%|███████▏ | 18/25 [00:27<00:09, 1.38s/batch, N=1.4616, E=0.8238,
KL=0.0531, wKL=1.0000]
Train E66: 76%|███████▌ | 19/25 [00:27<00:09, 1.62s/batch, N=1.4616, E=0.8238,
KL=0.0531, wKL=1.0000]
Train E66: 76%|███████▌ | 19/25 [00:28<00:09, 1.62s/batch, N=1.4918, E=0.8186,
KL=0.0525, wKL=1.0000]
Train E66: 80%|████████ | 20/25 [00:28<00:07, 1.55s/batch, N=1.4918, E=0.8186,
KL=0.0525, wKL=1.0000]
Train E66: 80%|████████ | 20/25 [00:29<00:07, 1.55s/batch, N=1.6526, E=0.8289,
KL=0.0540, wKL=1.0000]
Train E66: 84%|████████▍ | 21/25 [00:29<00:06, 1.50s/batch, N=1.6526, E=0.8289,
KL=0.0540, wKL=1.0000]
Train E66: 84%|████████▍ | 21/25 [00:31<00:06, 1.50s/batch, N=1.4237, E=0.8210,
KL=0.0523, wKL=1.0000]
Train E66: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4237, E=0.8210,
KL=0.0523, wKL=1.0000]
Train E66: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.4858, E=0.8241,
KL=0.0533, wKL=1.0000]
Train E66: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.4858, E=0.8241,
KL=0.0533, wKL=1.0000]
Train E66: 92%|█████████▏| 23/25 [00:33<00:02, 1.44s/batch, N=1.4710, E=0.8188,
KL=0.0547, wKL=1.0000]
Train E66: 96%|█████████▌| 24/25 [00:33<00:01, 1.42s/batch, N=1.4710, E=0.8188,
KL=0.0547, wKL=1.0000]
Train E66: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.6378, E=0.8216,
KL=0.0533, wKL=1.0000]
Train E66: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.6378, E=0.8216,
KL=0.0533, wKL=1.0000]
Train E66: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.6378, E=0.8216,
KL=0.0533, wKL=1.0000]
2358.3s 209 [Epoch 066] Total: 2.3291 | N: 1.4811 | E: 0.8215 | KL(1.00×0.5):
0.0531
2392.3s 210 Train E67: 0%| | 0/25 [00:00<?, ?batch/s]
Train E67: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4827, E=0.8258, KL=0.0535,
wKL=1.0000]
Train E67: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4827, E=0.8258,
KL=0.0535, wKL=1.0000]
Train E67: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4244, E=0.8231,
KL=0.0531, wKL=1.0000]
Train E67: 8%|▊ | 2/25 [00:02<00:31, 1.39s/batch, N=1.4244, E=0.8231,
KL=0.0531, wKL=1.0000]
Train E67: 8%|▊ | 2/25 [00:04<00:31, 1.39s/batch, N=1.4834, E=0.8201,
KL=0.0536, wKL=1.0000]
Train E67: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.4834, E=0.8201,
KL=0.0536, wKL=1.0000]
Train E67: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5212, E=0.8220,
KL=0.0529, wKL=1.0000]
Train E67: 16%|█▌ | 4/25 [00:05<00:28, 1.37s/batch, N=1.5212, E=0.8220,
KL=0.0529, wKL=1.0000]
Train E67: 16%|█▌ | 4/25 [00:06<00:28, 1.37s/batch, N=1.4226, E=0.8193,
KL=0.0518, wKL=1.0000]
Train E67: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4226, E=0.8193,
KL=0.0518, wKL=1.0000]
Train E67: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4276, E=0.8184,
KL=0.0518, wKL=1.0000]
Train E67: 24%|██▍ | 6/25 [00:08<00:26, 1.37s/batch, N=1.4276, E=0.8184,
KL=0.0518, wKL=1.0000]
Train E67: 24%|██▍ | 6/25 [00:09<00:26, 1.37s/batch, N=1.4292, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E67: 28%|██▊ | 7/25 [00:09<00:24, 1.37s/batch, N=1.4292, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E67: 28%|██▊ | 7/25 [00:10<00:24, 1.37s/batch, N=1.5343, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E67: 32%|███▏ | 8/25 [00:10<00:23, 1.37s/batch, N=1.5343, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E67: 32%|███▏ | 8/25 [00:12<00:23, 1.37s/batch, N=1.4281, E=0.8209,
KL=0.0529, wKL=1.0000]
Train E67: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4281, E=0.8209,
KL=0.0529, wKL=1.0000]
Train E67: 36%|███▌ | 9/25 [00:13<00:22, 1.39s/batch, N=1.5147, E=0.8228,
KL=0.0534, wKL=1.0000]
Train E67: 40%|████ | 10/25 [00:13<00:20, 1.39s/batch, N=1.5147, E=0.8228,
KL=0.0534, wKL=1.0000]
Train E67: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4481, E=0.8244,
KL=0.0527, wKL=1.0000]
Train E67: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.4481, E=0.8244,
KL=0.0527, wKL=1.0000]
Train E67: 44%|████▍ | 11/25 [00:16<00:19, 1.38s/batch, N=1.5053, E=0.8253,
KL=0.0526, wKL=1.0000]
Train E67: 48%|████▊ | 12/25 [00:16<00:17, 1.38s/batch, N=1.5053, E=0.8253,
KL=0.0526, wKL=1.0000]
Train E67: 48%|████▊ | 12/25 [00:17<00:17, 1.38s/batch, N=1.4830, E=0.8228,
KL=0.0526, wKL=1.0000]
Train E67: 52%|█████▏ | 13/25 [00:17<00:16, 1.39s/batch, N=1.4830, E=0.8228,
KL=0.0526, wKL=1.0000]
Train E67: 52%|█████▏ | 13/25 [00:19<00:16, 1.39s/batch, N=1.4451, E=0.8246,
KL=0.0520, wKL=1.0000]
Train E67: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4451, E=0.8246,
KL=0.0520, wKL=1.0000]
Train E67: 56%|█████▌ | 14/25 [00:20<00:15, 1.39s/batch, N=1.5060, E=0.8179,
KL=0.0523, wKL=1.0000]
Train E67: 60%|██████ | 15/25 [00:20<00:13, 1.38s/batch, N=1.5060, E=0.8179,
KL=0.0523, wKL=1.0000]
Train E67: 60%|██████ | 15/25 [00:22<00:13, 1.38s/batch, N=1.4106, E=0.8179,
KL=0.0532, wKL=1.0000]
Train E67: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.4106, E=0.8179,
KL=0.0532, wKL=1.0000]
Train E67: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4755, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E67: 68%|██████▊ | 17/25 [00:23<00:11, 1.45s/batch, N=1.4755, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E67: 68%|██████▊ | 17/25 [00:25<00:11, 1.45s/batch, N=1.4910, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E67: 72%|███████▏ | 18/25 [00:25<00:10, 1.43s/batch, N=1.4910, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E67: 72%|███████▏ | 18/25 [00:26<00:10, 1.43s/batch, N=1.4568, E=0.8254,
KL=0.0536, wKL=1.0000]
Train E67: 76%|███████▌ | 19/25 [00:26<00:08, 1.42s/batch, N=1.4568, E=0.8254,
KL=0.0536, wKL=1.0000]
Train E67: 76%|███████▌ | 19/25 [00:27<00:08, 1.42s/batch, N=1.6119, E=0.8218,
KL=0.0530, wKL=1.0000]
Train E67: 80%|████████ | 20/25 [00:27<00:07, 1.40s/batch, N=1.6119, E=0.8218,
KL=0.0530, wKL=1.0000]
Train E67: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.4645, E=0.8199,
KL=0.0546, wKL=1.0000]
Train E67: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.4645, E=0.8199,
KL=0.0546, wKL=1.0000]
Train E67: 84%|████████▍ | 21/25 [00:30<00:05, 1.39s/batch, N=1.5156, E=0.8183,
KL=0.0530, wKL=1.0000]
Train E67: 88%|████████▊ | 22/25 [00:30<00:04, 1.41s/batch, N=1.5156, E=0.8183,
KL=0.0530, wKL=1.0000]
Train E67: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5222, E=0.8220,
KL=0.0531, wKL=1.0000]
Train E67: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5222, E=0.8220,
KL=0.0531, wKL=1.0000]
Train E67: 92%|█████████▏| 23/25 [00:33<00:02, 1.40s/batch, N=1.4979, E=0.8236,
KL=0.0528, wKL=1.0000]
Train E67: 96%|█████████▌| 24/25 [00:33<00:01, 1.39s/batch, N=1.4979, E=0.8236,
KL=0.0528, wKL=1.0000]
Train E67: 96%|█████████▌| 24/25 [00:33<00:01, 1.39s/batch, N=1.5812, E=0.8201,
KL=0.0568, wKL=1.0000]
Train E67: 100%|██████████| 25/25 [00:33<00:00, 1.16s/batch, N=1.5812, E=0.8201,
KL=0.0568, wKL=1.0000]
Train E67: 100%|██████████| 25/25 [00:33<00:00, 1.36s/batch, N=1.5812, E=0.8201,
KL=0.0568, wKL=1.0000]
2392.3s 211 [Epoch 067] Total: 2.3288 | N: 1.4810 | E: 0.8214 | KL(1.00×0.5):
0.0529
2427.1s 212 Train E68: 0%| | 0/25 [00:00<?, ?batch/s]
Train E68: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4609, E=0.8258, KL=0.0529,
wKL=1.0000]
Train E68: 4%|▍ | 1/25 [00:01<00:32, 1.33s/batch, N=1.4609, E=0.8258,
KL=0.0529, wKL=1.0000]
Train E68: 4%|▍ | 1/25 [00:03<00:32, 1.33s/batch, N=1.5097, E=0.8179,
KL=0.0525, wKL=1.0000]
Train E68: 8%|▊ | 2/25 [00:03<00:39, 1.73s/batch, N=1.5097, E=0.8179,
KL=0.0525, wKL=1.0000]
Train E68: 8%|▊ | 2/25 [00:04<00:39, 1.73s/batch, N=1.4122, E=0.8243,
KL=0.0521, wKL=1.0000]
Train E68: 12%|█▏ | 3/25 [00:04<00:34, 1.58s/batch, N=1.4122, E=0.8243,
KL=0.0521, wKL=1.0000]
Train E68: 12%|█▏ | 3/25 [00:06<00:34, 1.58s/batch, N=1.5590, E=0.8228,
KL=0.0527, wKL=1.0000]
Train E68: 16%|█▌ | 4/25 [00:06<00:31, 1.49s/batch, N=1.5590, E=0.8228,
KL=0.0527, wKL=1.0000]
Train E68: 16%|█▌ | 4/25 [00:07<00:31, 1.49s/batch, N=1.4516, E=0.8212,
KL=0.0514, wKL=1.0000]
Train E68: 20%|██ | 5/25 [00:07<00:29, 1.45s/batch, N=1.4516, E=0.8212,
KL=0.0514, wKL=1.0000]
Train E68: 20%|██ | 5/25 [00:08<00:29, 1.45s/batch, N=1.4022, E=0.8233,
KL=0.0535, wKL=1.0000]
Train E68: 24%|██▍ | 6/25 [00:08<00:27, 1.44s/batch, N=1.4022, E=0.8233,
KL=0.0535, wKL=1.0000]
Train E68: 24%|██▍ | 6/25 [00:10<00:27, 1.44s/batch, N=1.5058, E=0.8189,
KL=0.0532, wKL=1.0000]
Train E68: 28%|██▊ | 7/25 [00:10<00:25, 1.42s/batch, N=1.5058, E=0.8189,
KL=0.0532, wKL=1.0000]
Train E68: 28%|██▊ | 7/25 [00:11<00:25, 1.42s/batch, N=1.5012, E=0.8259,
KL=0.0517, wKL=1.0000]
Train E68: 32%|███▏ | 8/25 [00:11<00:24, 1.41s/batch, N=1.5012, E=0.8259,
KL=0.0517, wKL=1.0000]
Train E68: 32%|███▏ | 8/25 [00:13<00:24, 1.41s/batch, N=1.4622, E=0.8236,
KL=0.0515, wKL=1.0000]
Train E68: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4622, E=0.8236,
KL=0.0515, wKL=1.0000]
Train E68: 36%|███▌ | 9/25 [00:14<00:22, 1.40s/batch, N=1.4909, E=0.8215,
KL=0.0522, wKL=1.0000]
Train E68: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.4909, E=0.8215,
KL=0.0522, wKL=1.0000]
Train E68: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4530, E=0.8207,
KL=0.0523, wKL=1.0000]
Train E68: 44%|████▍ | 11/25 [00:15<00:19, 1.38s/batch, N=1.4530, E=0.8207,
KL=0.0523, wKL=1.0000]
Train E68: 44%|████▍ | 11/25 [00:17<00:19, 1.38s/batch, N=1.4958, E=0.8159,
KL=0.0523, wKL=1.0000]
Train E68: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4958, E=0.8159,
KL=0.0523, wKL=1.0000]
Train E68: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.4753, E=0.8223,
KL=0.0512, wKL=1.0000]
Train E68: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.4753, E=0.8223,
KL=0.0512, wKL=1.0000]
Train E68: 52%|█████▏ | 13/25 [00:20<00:17, 1.42s/batch, N=1.4815, E=0.8208,
KL=0.0514, wKL=1.0000]
Train E68: 56%|█████▌ | 14/25 [00:20<00:15, 1.42s/batch, N=1.4815, E=0.8208,
KL=0.0514, wKL=1.0000]
Train E68: 56%|█████▌ | 14/25 [00:21<00:15, 1.42s/batch, N=1.4878, E=0.8212,
KL=0.0519, wKL=1.0000]
Train E68: 60%|██████ | 15/25 [00:21<00:14, 1.47s/batch, N=1.4878, E=0.8212,
KL=0.0519, wKL=1.0000]
Train E68: 60%|██████ | 15/25 [00:23<00:14, 1.47s/batch, N=1.4496, E=0.8189,
KL=0.0519, wKL=1.0000]
Train E68: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.4496, E=0.8189,
KL=0.0519, wKL=1.0000]
Train E68: 64%|██████▍ | 16/25 [00:24<00:12, 1.44s/batch, N=1.5328, E=0.8224,
KL=0.0528, wKL=1.0000]
Train E68: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.5328, E=0.8224,
KL=0.0528, wKL=1.0000]
Train E68: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4857, E=0.8193,
KL=0.0532, wKL=1.0000]
Train E68: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4857, E=0.8193,
KL=0.0532, wKL=1.0000]
Train E68: 72%|███████▏ | 18/25 [00:27<00:09, 1.41s/batch, N=1.3985, E=0.8177,
KL=0.0545, wKL=1.0000]
Train E68: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.3985, E=0.8177,
KL=0.0545, wKL=1.0000]
Train E68: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5506, E=0.8257,
KL=0.0534, wKL=1.0000]
Train E68: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.5506, E=0.8257,
KL=0.0534, wKL=1.0000]
Train E68: 80%|████████ | 20/25 [00:30<00:07, 1.43s/batch, N=1.4341, E=0.8177,
KL=0.0529, wKL=1.0000]
Train E68: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4341, E=0.8177,
KL=0.0529, wKL=1.0000]
Train E68: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.5287, E=0.8223,
KL=0.0522, wKL=1.0000]
Train E68: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.5287, E=0.8223,
KL=0.0522, wKL=1.0000]
Train E68: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5551, E=0.8214,
KL=0.0541, wKL=1.0000]
Train E68: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5551, E=0.8214,
KL=0.0541, wKL=1.0000]
Train E68: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4560, E=0.8220,
KL=0.0520, wKL=1.0000]
Train E68: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4560, E=0.8220,
KL=0.0520, wKL=1.0000]
Train E68: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4847, E=0.8239,
KL=0.0530, wKL=1.0000]
Train E68: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.4847, E=0.8239,
KL=0.0530, wKL=1.0000]
Train E68: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4847, E=0.8239,
KL=0.0530, wKL=1.0000]
2427.1s 213 [Epoch 068] Total: 2.3286 | N: 1.4809 | E: 0.8214 | KL(1.00×0.5):
0.0525
2462.0s 214 Train E69: 0%| | 0/25 [00:00<?, ?batch/s]
Train E69: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4382, E=0.8202, KL=0.0522,
wKL=1.0000]
Train E69: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.4382, E=0.8202,
KL=0.0522, wKL=1.0000]
Train E69: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4556, E=0.8193,
KL=0.0529, wKL=1.0000]
Train E69: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.4556, E=0.8193,
KL=0.0529, wKL=1.0000]
Train E69: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.5935, E=0.8257,
KL=0.0529, wKL=1.0000]
Train E69: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.5935, E=0.8257,
KL=0.0529, wKL=1.0000]
Train E69: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4248, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E69: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4248, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E69: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.5457, E=0.8222,
KL=0.0524, wKL=1.0000]
Train E69: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5457, E=0.8222,
KL=0.0524, wKL=1.0000]
Train E69: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4750, E=0.8258,
KL=0.0534, wKL=1.0000]
Train E69: 24%|██▍ | 6/25 [00:08<00:26, 1.37s/batch, N=1.4750, E=0.8258,
KL=0.0534, wKL=1.0000]
Train E69: 24%|██▍ | 6/25 [00:10<00:26, 1.37s/batch, N=1.4004, E=0.8263,
KL=0.0506, wKL=1.0000]
Train E69: 28%|██▊ | 7/25 [00:10<00:28, 1.58s/batch, N=1.4004, E=0.8263,
KL=0.0506, wKL=1.0000]
Train E69: 28%|██▊ | 7/25 [00:11<00:28, 1.58s/batch, N=1.4377, E=0.8245,
KL=0.0510, wKL=1.0000]
Train E69: 32%|███▏ | 8/25 [00:11<00:25, 1.52s/batch, N=1.4377, E=0.8245,
KL=0.0510, wKL=1.0000]
Train E69: 32%|███▏ | 8/25 [00:12<00:25, 1.52s/batch, N=1.5388, E=0.8237,
KL=0.0526, wKL=1.0000]
Train E69: 36%|███▌ | 9/25 [00:12<00:23, 1.48s/batch, N=1.5388, E=0.8237,
KL=0.0526, wKL=1.0000]
Train E69: 36%|███▌ | 9/25 [00:14<00:23, 1.48s/batch, N=1.5081, E=0.8257,
KL=0.0527, wKL=1.0000]
Train E69: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.5081, E=0.8257,
KL=0.0527, wKL=1.0000]
Train E69: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4934, E=0.8181,
KL=0.0510, wKL=1.0000]
Train E69: 44%|████▍ | 11/25 [00:15<00:20, 1.44s/batch, N=1.4934, E=0.8181,
KL=0.0510, wKL=1.0000]
Train E69: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.4111, E=0.8219,
KL=0.0517, wKL=1.0000]
Train E69: 48%|████▊ | 12/25 [00:17<00:19, 1.48s/batch, N=1.4111, E=0.8219,
KL=0.0517, wKL=1.0000]
Train E69: 48%|████▊ | 12/25 [00:18<00:19, 1.48s/batch, N=1.5216, E=0.8197,
KL=0.0519, wKL=1.0000]
Train E69: 52%|█████▏ | 13/25 [00:18<00:17, 1.50s/batch, N=1.5216, E=0.8197,
KL=0.0519, wKL=1.0000]
Train E69: 52%|█████▏ | 13/25 [00:20<00:17, 1.50s/batch, N=1.4524, E=0.8176,
KL=0.0519, wKL=1.0000]
Train E69: 56%|█████▌ | 14/25 [00:20<00:16, 1.46s/batch, N=1.4524, E=0.8176,
KL=0.0519, wKL=1.0000]
Train E69: 56%|█████▌ | 14/25 [00:21<00:16, 1.46s/batch, N=1.4907, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E69: 60%|██████ | 15/25 [00:21<00:14, 1.44s/batch, N=1.4907, E=0.8221,
KL=0.0537, wKL=1.0000]
Train E69: 60%|██████ | 15/25 [00:23<00:14, 1.44s/batch, N=1.4716, E=0.8187,
KL=0.0517, wKL=1.0000]
Train E69: 64%|██████▍ | 16/25 [00:23<00:12, 1.43s/batch, N=1.4716, E=0.8187,
KL=0.0517, wKL=1.0000]
Train E69: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.3921, E=0.8180,
KL=0.0516, wKL=1.0000]
Train E69: 68%|██████▊ | 17/25 [00:24<00:11, 1.43s/batch, N=1.3921, E=0.8180,
KL=0.0516, wKL=1.0000]
Train E69: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.5576, E=0.8210,
KL=0.0520, wKL=1.0000]
Train E69: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.5576, E=0.8210,
KL=0.0520, wKL=1.0000]
Train E69: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5280, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E69: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5280, E=0.8210,
KL=0.0522, wKL=1.0000]
Train E69: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5645, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E69: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.5645, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E69: 80%|████████ | 20/25 [00:30<00:07, 1.40s/batch, N=1.4365, E=0.8221,
KL=0.0517, wKL=1.0000]
Train E69: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.4365, E=0.8221,
KL=0.0517, wKL=1.0000]
Train E69: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.3795, E=0.8195,
KL=0.0504, wKL=1.0000]
Train E69: 88%|████████▊ | 22/25 [00:31<00:04, 1.40s/batch, N=1.3795, E=0.8195,
KL=0.0504, wKL=1.0000]
Train E69: 88%|████████▊ | 22/25 [00:32<00:04, 1.40s/batch, N=1.5752, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E69: 92%|█████████▏| 23/25 [00:32<00:02, 1.39s/batch, N=1.5752, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E69: 92%|█████████▏| 23/25 [00:34<00:02, 1.39s/batch, N=1.5235, E=0.8222,
KL=0.0520, wKL=1.0000]
Train E69: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.5235, E=0.8222,
KL=0.0520, wKL=1.0000]
Train E69: 96%|█████████▌| 24/25 [00:34<00:01, 1.41s/batch, N=1.3367, E=0.8218,
KL=0.0505, wKL=1.0000]
Train E69: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.3367, E=0.8218,
KL=0.0505, wKL=1.0000]
Train E69: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.3367, E=0.8218,
KL=0.0505, wKL=1.0000]
2462.0s 215 [Epoch 069] Total: 2.3290 | N: 1.4815 | E: 0.8216 | KL(1.00×0.5):
0.0520
2497.0s 216 Train E70: 0%| | 0/25 [00:00<?, ?batch/s]
Train E70: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4894, E=0.8202, KL=0.0532,
wKL=1.0000]
Train E70: 4%|▍ | 1/25 [00:01<00:31, 1.32s/batch, N=1.4894, E=0.8202,
KL=0.0532, wKL=1.0000]
Train E70: 4%|▍ | 1/25 [00:02<00:31, 1.32s/batch, N=1.5078, E=0.8236,
KL=0.0535, wKL=1.0000]
Train E70: 8%|▊ | 2/25 [00:02<00:30, 1.33s/batch, N=1.5078, E=0.8236,
KL=0.0535, wKL=1.0000]
Train E70: 8%|▊ | 2/25 [00:04<00:30, 1.33s/batch, N=1.4210, E=0.8260,
KL=0.0510, wKL=1.0000]
Train E70: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4210, E=0.8260,
KL=0.0510, wKL=1.0000]
Train E70: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4580, E=0.8241,
KL=0.0518, wKL=1.0000]
Train E70: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4580, E=0.8241,
KL=0.0518, wKL=1.0000]
Train E70: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.5560, E=0.8226,
KL=0.0517, wKL=1.0000]
Train E70: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5560, E=0.8226,
KL=0.0517, wKL=1.0000]
Train E70: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4107, E=0.8181,
KL=0.0509, wKL=1.0000]
Train E70: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4107, E=0.8181,
KL=0.0509, wKL=1.0000]
Train E70: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4111, E=0.8203,
KL=0.0513, wKL=1.0000]
Train E70: 28%|██▊ | 7/25 [00:09<00:24, 1.39s/batch, N=1.4111, E=0.8203,
KL=0.0513, wKL=1.0000]
Train E70: 28%|██▊ | 7/25 [00:10<00:24, 1.39s/batch, N=1.5586, E=0.8208,
KL=0.0528, wKL=1.0000]
Train E70: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.5586, E=0.8208,
KL=0.0528, wKL=1.0000]
Train E70: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4006, E=0.8217,
KL=0.0516, wKL=1.0000]
Train E70: 36%|███▌ | 9/25 [00:12<00:22, 1.38s/batch, N=1.4006, E=0.8217,
KL=0.0516, wKL=1.0000]
Train E70: 36%|███▌ | 9/25 [00:14<00:22, 1.38s/batch, N=1.4971, E=0.8154,
KL=0.0528, wKL=1.0000]
Train E70: 40%|████ | 10/25 [00:14<00:22, 1.47s/batch, N=1.4971, E=0.8154,
KL=0.0528, wKL=1.0000]
Train E70: 40%|████ | 10/25 [00:15<00:22, 1.47s/batch, N=1.4002, E=0.8201,
KL=0.0516, wKL=1.0000]
Train E70: 44%|████▍ | 11/25 [00:15<00:20, 1.47s/batch, N=1.4002, E=0.8201,
KL=0.0516, wKL=1.0000]
Train E70: 44%|████▍ | 11/25 [00:17<00:20, 1.47s/batch, N=1.4754, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E70: 48%|████▊ | 12/25 [00:17<00:21, 1.63s/batch, N=1.4754, E=0.8162,
KL=0.0520, wKL=1.0000]
Train E70: 48%|████▊ | 12/25 [00:18<00:21, 1.63s/batch, N=1.4908, E=0.8217,
KL=0.0511, wKL=1.0000]
Train E70: 52%|█████▏ | 13/25 [00:18<00:18, 1.57s/batch, N=1.4908, E=0.8217,
KL=0.0511, wKL=1.0000]
Train E70: 52%|█████▏ | 13/25 [00:20<00:18, 1.57s/batch, N=1.4189, E=0.8176,
KL=0.0512, wKL=1.0000]
Train E70: 56%|█████▌ | 14/25 [00:20<00:16, 1.52s/batch, N=1.4189, E=0.8176,
KL=0.0512, wKL=1.0000]
Train E70: 56%|█████▌ | 14/25 [00:21<00:16, 1.52s/batch, N=1.5273, E=0.8194,
KL=0.0522, wKL=1.0000]
Train E70: 60%|██████ | 15/25 [00:21<00:14, 1.48s/batch, N=1.5273, E=0.8194,
KL=0.0522, wKL=1.0000]
Train E70: 60%|██████ | 15/25 [00:23<00:14, 1.48s/batch, N=1.5741, E=0.8246,
KL=0.0512, wKL=1.0000]
Train E70: 64%|██████▍ | 16/25 [00:23<00:13, 1.46s/batch, N=1.5741, E=0.8246,
KL=0.0512, wKL=1.0000]
Train E70: 64%|██████▍ | 16/25 [00:24<00:13, 1.46s/batch, N=1.3660, E=0.8159,
KL=0.0520, wKL=1.0000]
Train E70: 68%|██████▊ | 17/25 [00:24<00:11, 1.44s/batch, N=1.3660, E=0.8159,
KL=0.0520, wKL=1.0000]
Train E70: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4965, E=0.8236,
KL=0.0509, wKL=1.0000]
Train E70: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4965, E=0.8236,
KL=0.0509, wKL=1.0000]
Train E70: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5127, E=0.8236,
KL=0.0520, wKL=1.0000]
Train E70: 76%|███████▌ | 19/25 [00:27<00:08, 1.41s/batch, N=1.5127, E=0.8236,
KL=0.0520, wKL=1.0000]
Train E70: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.5366, E=0.8263,
KL=0.0521, wKL=1.0000]
Train E70: 80%|████████ | 20/25 [00:28<00:07, 1.43s/batch, N=1.5366, E=0.8263,
KL=0.0521, wKL=1.0000]
Train E70: 80%|████████ | 20/25 [00:30<00:07, 1.43s/batch, N=1.5674, E=0.8269,
KL=0.0514, wKL=1.0000]
Train E70: 84%|████████▍ | 21/25 [00:30<00:05, 1.42s/batch, N=1.5674, E=0.8269,
KL=0.0514, wKL=1.0000]
Train E70: 84%|████████▍ | 21/25 [00:31<00:05, 1.42s/batch, N=1.4979, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E70: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4979, E=0.8220,
KL=0.0519, wKL=1.0000]
Train E70: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5145, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E70: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5145, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E70: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4828, E=0.8227,
KL=0.0514, wKL=1.0000]
Train E70: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4828, E=0.8227,
KL=0.0514, wKL=1.0000]
Train E70: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.3971, E=0.8226,
KL=0.0511, wKL=1.0000]
Train E70: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.3971, E=0.8226,
KL=0.0511, wKL=1.0000]
Train E70: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.3971, E=0.8226,
KL=0.0511, wKL=1.0000]
2497.0s 217 [Epoch 070] Total: 2.3282 | N: 1.4807 | E: 0.8215 | KL(1.00×0.5):
0.0518
2497.0s 218 Saved checkpoint: /kaggle/working/checkpoints/gvae_70_epoch070.pt
2531.7s 219 Train E71: 0%| | 0/25 [00:00<?, ?batch/s]
Train E71: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4885, E=0.8187, KL=0.0518,
wKL=1.0000]
Train E71: 4%|▍ | 1/25 [00:01<00:31, 1.33s/batch, N=1.4885, E=0.8187,
KL=0.0518, wKL=1.0000]
Train E71: 4%|▍ | 1/25 [00:02<00:31, 1.33s/batch, N=1.5137, E=0.8227,
KL=0.0518, wKL=1.0000]
Train E71: 8%|▊ | 2/25 [00:02<00:30, 1.34s/batch, N=1.5137, E=0.8227,
KL=0.0518, wKL=1.0000]
Train E71: 8%|▊ | 2/25 [00:04<00:30, 1.34s/batch, N=1.4715, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E71: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4715, E=0.8207,
KL=0.0540, wKL=1.0000]
Train E71: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.4851, E=0.8207,
KL=0.0518, wKL=1.0000]
Train E71: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4851, E=0.8207,
KL=0.0518, wKL=1.0000]
Train E71: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.5717, E=0.8205,
KL=0.0526, wKL=1.0000]
Train E71: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5717, E=0.8205,
KL=0.0526, wKL=1.0000]
Train E71: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4767, E=0.8250,
KL=0.0501, wKL=1.0000]
Train E71: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.4767, E=0.8250,
KL=0.0501, wKL=1.0000]
Train E71: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.5371, E=0.8274,
KL=0.0509, wKL=1.0000]
Train E71: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.5371, E=0.8274,
KL=0.0509, wKL=1.0000]
Train E71: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5079, E=0.8209,
KL=0.0507, wKL=1.0000]
Train E71: 32%|███▏ | 8/25 [00:11<00:24, 1.45s/batch, N=1.5079, E=0.8209,
KL=0.0507, wKL=1.0000]
Train E71: 32%|███▏ | 8/25 [00:12<00:24, 1.45s/batch, N=1.4509, E=0.8211,
KL=0.0504, wKL=1.0000]
Train E71: 36%|███▌ | 9/25 [00:12<00:22, 1.43s/batch, N=1.4509, E=0.8211,
KL=0.0504, wKL=1.0000]
Train E71: 36%|███▌ | 9/25 [00:14<00:22, 1.43s/batch, N=1.4039, E=0.8212,
KL=0.0507, wKL=1.0000]
Train E71: 40%|████ | 10/25 [00:14<00:21, 1.43s/batch, N=1.4039, E=0.8212,
KL=0.0507, wKL=1.0000]
Train E71: 40%|████ | 10/25 [00:15<00:21, 1.43s/batch, N=1.5231, E=0.8212,
KL=0.0515, wKL=1.0000]
Train E71: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5231, E=0.8212,
KL=0.0515, wKL=1.0000]
Train E71: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4813, E=0.8233,
KL=0.0516, wKL=1.0000]
Train E71: 48%|████▊ | 12/25 [00:16<00:18, 1.41s/batch, N=1.4813, E=0.8233,
KL=0.0516, wKL=1.0000]
Train E71: 48%|████▊ | 12/25 [00:18<00:18, 1.41s/batch, N=1.5059, E=0.8209,
KL=0.0514, wKL=1.0000]
Train E71: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5059, E=0.8209,
KL=0.0514, wKL=1.0000]
Train E71: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4986, E=0.8165,
KL=0.0522, wKL=1.0000]
Train E71: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4986, E=0.8165,
KL=0.0522, wKL=1.0000]
Train E71: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4642, E=0.8193,
KL=0.0522, wKL=1.0000]
Train E71: 60%|██████ | 15/25 [00:21<00:15, 1.58s/batch, N=1.4642, E=0.8193,
KL=0.0522, wKL=1.0000]
Train E71: 60%|██████ | 15/25 [00:22<00:15, 1.58s/batch, N=1.6071, E=0.8248,
KL=0.0518, wKL=1.0000]
Train E71: 64%|██████▍ | 16/25 [00:22<00:13, 1.51s/batch, N=1.6071, E=0.8248,
KL=0.0518, wKL=1.0000]
Train E71: 64%|██████▍ | 16/25 [00:24<00:13, 1.51s/batch, N=1.4109, E=0.8160,
KL=0.0515, wKL=1.0000]
Train E71: 68%|██████▊ | 17/25 [00:24<00:11, 1.50s/batch, N=1.4109, E=0.8160,
KL=0.0515, wKL=1.0000]
Train E71: 68%|██████▊ | 17/25 [00:25<00:11, 1.50s/batch, N=1.4252, E=0.8210,
KL=0.0519, wKL=1.0000]
Train E71: 72%|███████▏ | 18/25 [00:25<00:10, 1.46s/batch, N=1.4252, E=0.8210,
KL=0.0519, wKL=1.0000]
Train E71: 72%|███████▏ | 18/25 [00:27<00:10, 1.46s/batch, N=1.3896, E=0.8222,
KL=0.0506, wKL=1.0000]
Train E71: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.3896, E=0.8222,
KL=0.0506, wKL=1.0000]
Train E71: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.4720, E=0.8213,
KL=0.0513, wKL=1.0000]
Train E71: 80%|████████ | 20/25 [00:28<00:07, 1.42s/batch, N=1.4720, E=0.8213,
KL=0.0513, wKL=1.0000]
Train E71: 80%|████████ | 20/25 [00:29<00:07, 1.42s/batch, N=1.4525, E=0.8218,
KL=0.0499, wKL=1.0000]
Train E71: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.4525, E=0.8218,
KL=0.0499, wKL=1.0000]
Train E71: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4048, E=0.8229,
KL=0.0488, wKL=1.0000]
Train E71: 88%|████████▊ | 22/25 [00:31<00:04, 1.39s/batch, N=1.4048, E=0.8229,
KL=0.0488, wKL=1.0000]
Train E71: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.5306, E=0.8235,
KL=0.0520, wKL=1.0000]
Train E71: 92%|█████████▏| 23/25 [00:32<00:02, 1.38s/batch, N=1.5306, E=0.8235,
KL=0.0520, wKL=1.0000]
Train E71: 92%|█████████▏| 23/25 [00:34<00:02, 1.38s/batch, N=1.4583, E=0.8227,
KL=0.0504, wKL=1.0000]
Train E71: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4583, E=0.8227,
KL=0.0504, wKL=1.0000]
Train E71: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4900, E=0.8249,
KL=0.0509, wKL=1.0000]
Train E71: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4900, E=0.8249,
KL=0.0509, wKL=1.0000]
Train E71: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4900, E=0.8249,
KL=0.0509, wKL=1.0000]
2531.7s 220 [Epoch 071] Total: 2.3279 | N: 1.4806 | E: 0.8216 | KL(1.00×0.5):
0.0513
2566.6s 221 Train E72: 0%| | 0/25 [00:00<?, ?batch/s]
Train E72: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5392, E=0.8222, KL=0.0510,
wKL=1.0000]
Train E72: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.5392, E=0.8222,
KL=0.0510, wKL=1.0000]
Train E72: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.4445, E=0.8234,
KL=0.0521, wKL=1.0000]
Train E72: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.4445, E=0.8234,
KL=0.0521, wKL=1.0000]
Train E72: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.5931, E=0.8193,
KL=0.0512, wKL=1.0000]
Train E72: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.5931, E=0.8193,
KL=0.0512, wKL=1.0000]
Train E72: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.4898, E=0.8196,
KL=0.0515, wKL=1.0000]
Train E72: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.4898, E=0.8196,
KL=0.0515, wKL=1.0000]
Train E72: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4524, E=0.8223,
KL=0.0511, wKL=1.0000]
Train E72: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.4524, E=0.8223,
KL=0.0511, wKL=1.0000]
Train E72: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.5143, E=0.8240,
KL=0.0513, wKL=1.0000]
Train E72: 24%|██▍ | 6/25 [00:08<00:28, 1.49s/batch, N=1.5143, E=0.8240,
KL=0.0513, wKL=1.0000]
Train E72: 24%|██▍ | 6/25 [00:09<00:28, 1.49s/batch, N=1.3630, E=0.8152,
KL=0.0510, wKL=1.0000]
Train E72: 28%|██▊ | 7/25 [00:09<00:26, 1.46s/batch, N=1.3630, E=0.8152,
KL=0.0510, wKL=1.0000]
Train E72: 28%|██▊ | 7/25 [00:11<00:26, 1.46s/batch, N=1.5034, E=0.8159,
KL=0.0514, wKL=1.0000]
Train E72: 32%|███▏ | 8/25 [00:11<00:24, 1.43s/batch, N=1.5034, E=0.8159,
KL=0.0514, wKL=1.0000]
Train E72: 32%|███▏ | 8/25 [00:12<00:24, 1.43s/batch, N=1.4933, E=0.8216,
KL=0.0519, wKL=1.0000]
Train E72: 36%|███▌ | 9/25 [00:12<00:22, 1.41s/batch, N=1.4933, E=0.8216,
KL=0.0519, wKL=1.0000]
Train E72: 36%|███▌ | 9/25 [00:14<00:22, 1.41s/batch, N=1.4813, E=0.8274,
KL=0.0518, wKL=1.0000]
Train E72: 40%|████ | 10/25 [00:14<00:20, 1.39s/batch, N=1.4813, E=0.8274,
KL=0.0518, wKL=1.0000]
Train E72: 40%|████ | 10/25 [00:15<00:20, 1.39s/batch, N=1.4800, E=0.8232,
KL=0.0513, wKL=1.0000]
Train E72: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.4800, E=0.8232,
KL=0.0513, wKL=1.0000]
Train E72: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.4841, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E72: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4841, E=0.8212,
KL=0.0516, wKL=1.0000]
Train E72: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4525, E=0.8244,
KL=0.0521, wKL=1.0000]
Train E72: 52%|█████▏ | 13/25 [00:18<00:16, 1.41s/batch, N=1.4525, E=0.8244,
KL=0.0521, wKL=1.0000]
Train E72: 52%|█████▏ | 13/25 [00:19<00:16, 1.41s/batch, N=1.5348, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E72: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5348, E=0.8223,
KL=0.0546, wKL=1.0000]
Train E72: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4985, E=0.8216,
KL=0.0500, wKL=1.0000]
Train E72: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4985, E=0.8216,
KL=0.0500, wKL=1.0000]
Train E72: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.4263, E=0.8124,
KL=0.0503, wKL=1.0000]
Train E72: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.4263, E=0.8124,
KL=0.0503, wKL=1.0000]
Train E72: 64%|██████▍ | 16/25 [00:23<00:12, 1.40s/batch, N=1.5210, E=0.8230,
KL=0.0511, wKL=1.0000]
Train E72: 68%|██████▊ | 17/25 [00:23<00:11, 1.39s/batch, N=1.5210, E=0.8230,
KL=0.0511, wKL=1.0000]
Train E72: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.3812, E=0.8220,
KL=0.0502, wKL=1.0000]
Train E72: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.3812, E=0.8220,
KL=0.0502, wKL=1.0000]
Train E72: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.5435, E=0.8246,
KL=0.0517, wKL=1.0000]
Train E72: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.5435, E=0.8246,
KL=0.0517, wKL=1.0000]
Train E72: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4436, E=0.8201,
KL=0.0509, wKL=1.0000]
Train E72: 80%|████████ | 20/25 [00:28<00:07, 1.57s/batch, N=1.4436, E=0.8201,
KL=0.0509, wKL=1.0000]
Train E72: 80%|████████ | 20/25 [00:30<00:07, 1.57s/batch, N=1.4697, E=0.8236,
KL=0.0513, wKL=1.0000]
Train E72: 84%|████████▍ | 21/25 [00:30<00:06, 1.53s/batch, N=1.4697, E=0.8236,
KL=0.0513, wKL=1.0000]
Train E72: 84%|████████▍ | 21/25 [00:31<00:06, 1.53s/batch, N=1.4481, E=0.8239,
KL=0.0509, wKL=1.0000]
Train E72: 88%|████████▊ | 22/25 [00:31<00:04, 1.50s/batch, N=1.4481, E=0.8239,
KL=0.0509, wKL=1.0000]
Train E72: 88%|████████▊ | 22/25 [00:32<00:04, 1.50s/batch, N=1.5138, E=0.8246,
KL=0.0506, wKL=1.0000]
Train E72: 92%|█████████▏| 23/25 [00:32<00:02, 1.47s/batch, N=1.5138, E=0.8246,
KL=0.0506, wKL=1.0000]
Train E72: 92%|█████████▏| 23/25 [00:34<00:02, 1.47s/batch, N=1.4484, E=0.8187,
KL=0.0503, wKL=1.0000]
Train E72: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4484, E=0.8187,
KL=0.0503, wKL=1.0000]
Train E72: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.5191, E=0.8198,
KL=0.0500, wKL=1.0000]
Train E72: 100%|██████████| 25/25 [00:34<00:00, 1.20s/batch, N=1.5191, E=0.8198,
KL=0.0500, wKL=1.0000]
Train E72: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.5191, E=0.8198,
KL=0.0500, wKL=1.0000]
2566.6s 222 [Epoch 072] Total: 2.3278 | N: 1.4806 | E: 0.8215 | KL(1.00×0.5):
0.0513
2600.9s 223 Train E73: 0%| | 0/25 [00:00<?, ?batch/s]
Train E73: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5606, E=0.8234, KL=0.0511,
wKL=1.0000]
Train E73: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.5606, E=0.8234,
KL=0.0511, wKL=1.0000]
Train E73: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5031, E=0.8254,
KL=0.0506, wKL=1.0000]
Train E73: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5031, E=0.8254,
KL=0.0506, wKL=1.0000]
Train E73: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4293, E=0.8217,
KL=0.0494, wKL=1.0000]
Train E73: 12%|█▏ | 3/25 [00:04<00:31, 1.44s/batch, N=1.4293, E=0.8217,
KL=0.0494, wKL=1.0000]
Train E73: 12%|█▏ | 3/25 [00:05<00:31, 1.44s/batch, N=1.4637, E=0.8216,
KL=0.0514, wKL=1.0000]
Train E73: 16%|█▌ | 4/25 [00:05<00:30, 1.47s/batch, N=1.4637, E=0.8216,
KL=0.0514, wKL=1.0000]
Train E73: 16%|█▌ | 4/25 [00:07<00:30, 1.47s/batch, N=1.4619, E=0.8200,
KL=0.0503, wKL=1.0000]
Train E73: 20%|██ | 5/25 [00:07<00:28, 1.44s/batch, N=1.4619, E=0.8200,
KL=0.0503, wKL=1.0000]
Train E73: 20%|██ | 5/25 [00:08<00:28, 1.44s/batch, N=1.4228, E=0.8201,
KL=0.0505, wKL=1.0000]
Train E73: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.4228, E=0.8201,
KL=0.0505, wKL=1.0000]
Train E73: 24%|██▍ | 6/25 [00:09<00:26, 1.42s/batch, N=1.4939, E=0.8231,
KL=0.0539, wKL=1.0000]
Train E73: 28%|██▊ | 7/25 [00:09<00:25, 1.41s/batch, N=1.4939, E=0.8231,
KL=0.0539, wKL=1.0000]
Train E73: 28%|██▊ | 7/25 [00:11<00:25, 1.41s/batch, N=1.5489, E=0.8213,
KL=0.0510, wKL=1.0000]
Train E73: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.5489, E=0.8213,
KL=0.0510, wKL=1.0000]
Train E73: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4093, E=0.8269,
KL=0.0517, wKL=1.0000]
Train E73: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4093, E=0.8269,
KL=0.0517, wKL=1.0000]
Train E73: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.4586, E=0.8234,
KL=0.0511, wKL=1.0000]
Train E73: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4586, E=0.8234,
KL=0.0511, wKL=1.0000]
Train E73: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.5106, E=0.8188,
KL=0.0524, wKL=1.0000]
Train E73: 44%|████▍ | 11/25 [00:15<00:19, 1.41s/batch, N=1.5106, E=0.8188,
KL=0.0524, wKL=1.0000]
Train E73: 44%|████▍ | 11/25 [00:16<00:19, 1.41s/batch, N=1.5593, E=0.8169,
KL=0.0517, wKL=1.0000]
Train E73: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.5593, E=0.8169,
KL=0.0517, wKL=1.0000]
Train E73: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.5169, E=0.8190,
KL=0.0506, wKL=1.0000]
Train E73: 52%|█████▏ | 13/25 [00:18<00:16, 1.40s/batch, N=1.5169, E=0.8190,
KL=0.0506, wKL=1.0000]
Train E73: 52%|█████▏ | 13/25 [00:19<00:16, 1.40s/batch, N=1.4921, E=0.8224,
KL=0.0503, wKL=1.0000]
Train E73: 56%|█████▌ | 14/25 [00:19<00:15, 1.39s/batch, N=1.4921, E=0.8224,
KL=0.0503, wKL=1.0000]
Train E73: 56%|█████▌ | 14/25 [00:21<00:15, 1.39s/batch, N=1.4549, E=0.8210,
KL=0.0503, wKL=1.0000]
Train E73: 60%|██████ | 15/25 [00:21<00:13, 1.39s/batch, N=1.4549, E=0.8210,
KL=0.0503, wKL=1.0000]
Train E73: 60%|██████ | 15/25 [00:22<00:13, 1.39s/batch, N=1.5237, E=0.8199,
KL=0.0500, wKL=1.0000]
Train E73: 64%|██████▍ | 16/25 [00:22<00:12, 1.38s/batch, N=1.5237, E=0.8199,
KL=0.0500, wKL=1.0000]
Train E73: 64%|██████▍ | 16/25 [00:23<00:12, 1.38s/batch, N=1.4984, E=0.8209,
KL=0.0505, wKL=1.0000]
Train E73: 68%|██████▊ | 17/25 [00:23<00:11, 1.38s/batch, N=1.4984, E=0.8209,
KL=0.0505, wKL=1.0000]
Train E73: 68%|██████▊ | 17/25 [00:25<00:11, 1.38s/batch, N=1.4418, E=0.8245,
KL=0.0503, wKL=1.0000]
Train E73: 72%|███████▏ | 18/25 [00:25<00:09, 1.39s/batch, N=1.4418, E=0.8245,
KL=0.0503, wKL=1.0000]
Train E73: 72%|███████▏ | 18/25 [00:26<00:09, 1.39s/batch, N=1.4649, E=0.8229,
KL=0.0504, wKL=1.0000]
Train E73: 76%|███████▌ | 19/25 [00:26<00:08, 1.39s/batch, N=1.4649, E=0.8229,
KL=0.0504, wKL=1.0000]
Train E73: 76%|███████▌ | 19/25 [00:28<00:08, 1.39s/batch, N=1.4549, E=0.8197,
KL=0.0524, wKL=1.0000]
Train E73: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4549, E=0.8197,
KL=0.0524, wKL=1.0000]
Train E73: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.4323, E=0.8183,
KL=0.0504, wKL=1.0000]
Train E73: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4323, E=0.8183,
KL=0.0504, wKL=1.0000]
Train E73: 84%|████████▍ | 21/25 [00:30<00:05, 1.40s/batch, N=1.4617, E=0.8210,
KL=0.0512, wKL=1.0000]
Train E73: 88%|████████▊ | 22/25 [00:30<00:04, 1.39s/batch, N=1.4617, E=0.8210,
KL=0.0512, wKL=1.0000]
Train E73: 88%|████████▊ | 22/25 [00:32<00:04, 1.39s/batch, N=1.4627, E=0.8226,
KL=0.0499, wKL=1.0000]
Train E73: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4627, E=0.8226,
KL=0.0499, wKL=1.0000]
Train E73: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.4611, E=0.8196,
KL=0.0508, wKL=1.0000]
Train E73: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.4611, E=0.8196,
KL=0.0508, wKL=1.0000]
Train E73: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.5872, E=0.8274,
KL=0.0519, wKL=1.0000]
Train E73: 100%|██████████| 25/25 [00:34<00:00, 1.17s/batch, N=1.5872, E=0.8274,
KL=0.0519, wKL=1.0000]
Train E73: 100%|██████████| 25/25 [00:34<00:00, 1.37s/batch, N=1.5872, E=0.8274,
KL=0.0519, wKL=1.0000]
2600.9s 224 [Epoch 073] Total: 2.3275 | N: 1.4805 | E: 0.8215 | KL(1.00×0.5):
0.0510
2635.8s 225 Train E74: 0%| | 0/25 [00:00<?, ?batch/s]
Train E74: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4255, E=0.8204, KL=0.0494,
wKL=1.0000]
Train E74: 4%|▍ | 1/25 [00:01<00:33, 1.41s/batch, N=1.4255, E=0.8204,
KL=0.0494, wKL=1.0000]
Train E74: 4%|▍ | 1/25 [00:02<00:33, 1.41s/batch, N=1.4596, E=0.8215,
KL=0.0499, wKL=1.0000]
Train E74: 8%|▊ | 2/25 [00:02<00:33, 1.45s/batch, N=1.4596, E=0.8215,
KL=0.0499, wKL=1.0000]
Train E74: 8%|▊ | 2/25 [00:04<00:33, 1.45s/batch, N=1.4867, E=0.8251,
KL=0.0512, wKL=1.0000]
Train E74: 12%|█▏ | 3/25 [00:04<00:30, 1.41s/batch, N=1.4867, E=0.8251,
KL=0.0512, wKL=1.0000]
Train E74: 12%|█▏ | 3/25 [00:05<00:30, 1.41s/batch, N=1.4231, E=0.8200,
KL=0.0516, wKL=1.0000]
Train E74: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.4231, E=0.8200,
KL=0.0516, wKL=1.0000]
Train E74: 16%|█▌ | 4/25 [00:07<00:29, 1.39s/batch, N=1.4953, E=0.8210,
KL=0.0507, wKL=1.0000]
Train E74: 20%|██ | 5/25 [00:07<00:32, 1.61s/batch, N=1.4953, E=0.8210,
KL=0.0507, wKL=1.0000]
Train E74: 20%|██ | 5/25 [00:09<00:32, 1.61s/batch, N=1.4717, E=0.8190,
KL=0.0514, wKL=1.0000]
Train E74: 24%|██▍ | 6/25 [00:09<00:29, 1.54s/batch, N=1.4717, E=0.8190,
KL=0.0514, wKL=1.0000]
Train E74: 24%|██▍ | 6/25 [00:10<00:29, 1.54s/batch, N=1.4962, E=0.8263,
KL=0.0507, wKL=1.0000]
Train E74: 28%|██▊ | 7/25 [00:10<00:27, 1.51s/batch, N=1.4962, E=0.8263,
KL=0.0507, wKL=1.0000]
Train E74: 28%|██▊ | 7/25 [00:11<00:27, 1.51s/batch, N=1.4533, E=0.8196,
KL=0.0510, wKL=1.0000]
Train E74: 32%|███▏ | 8/25 [00:11<00:24, 1.46s/batch, N=1.4533, E=0.8196,
KL=0.0510, wKL=1.0000]
Train E74: 32%|███▏ | 8/25 [00:13<00:24, 1.46s/batch, N=1.5435, E=0.8206,
KL=0.0518, wKL=1.0000]
Train E74: 36%|███▌ | 9/25 [00:13<00:23, 1.44s/batch, N=1.5435, E=0.8206,
KL=0.0518, wKL=1.0000]
Train E74: 36%|███▌ | 9/25 [00:14<00:23, 1.44s/batch, N=1.4780, E=0.8170,
KL=0.0509, wKL=1.0000]
Train E74: 40%|████ | 10/25 [00:14<00:21, 1.41s/batch, N=1.4780, E=0.8170,
KL=0.0509, wKL=1.0000]
Train E74: 40%|████ | 10/25 [00:15<00:21, 1.41s/batch, N=1.4446, E=0.8214,
KL=0.0501, wKL=1.0000]
Train E74: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4446, E=0.8214,
KL=0.0501, wKL=1.0000]
Train E74: 44%|████▍ | 11/25 [00:17<00:19, 1.40s/batch, N=1.4319, E=0.8190,
KL=0.0500, wKL=1.0000]
Train E74: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4319, E=0.8190,
KL=0.0500, wKL=1.0000]
Train E74: 48%|████▊ | 12/25 [00:18<00:18, 1.39s/batch, N=1.5470, E=0.8186,
KL=0.0508, wKL=1.0000]
Train E74: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.5470, E=0.8186,
KL=0.0508, wKL=1.0000]
Train E74: 52%|█████▏ | 13/25 [00:20<00:16, 1.39s/batch, N=1.4169, E=0.8249,
KL=0.0500, wKL=1.0000]
Train E74: 56%|█████▌ | 14/25 [00:20<00:15, 1.41s/batch, N=1.4169, E=0.8249,
KL=0.0500, wKL=1.0000]
Train E74: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.5818, E=0.8164,
KL=0.0507, wKL=1.0000]
Train E74: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.5818, E=0.8164,
KL=0.0507, wKL=1.0000]
Train E74: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.3786, E=0.8226,
KL=0.0491, wKL=1.0000]
Train E74: 64%|██████▍ | 16/25 [00:22<00:12, 1.40s/batch, N=1.3786, E=0.8226,
KL=0.0491, wKL=1.0000]
Train E74: 64%|██████▍ | 16/25 [00:24<00:12, 1.40s/batch, N=1.4696, E=0.8205,
KL=0.0497, wKL=1.0000]
Train E74: 68%|██████▊ | 17/25 [00:24<00:11, 1.40s/batch, N=1.4696, E=0.8205,
KL=0.0497, wKL=1.0000]
Train E74: 68%|██████▊ | 17/25 [00:25<00:11, 1.40s/batch, N=1.5118, E=0.8256,
KL=0.0519, wKL=1.0000]
Train E74: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.5118, E=0.8256,
KL=0.0519, wKL=1.0000]
Train E74: 72%|███████▏ | 18/25 [00:27<00:09, 1.40s/batch, N=1.5294, E=0.8264,
KL=0.0526, wKL=1.0000]
Train E74: 76%|███████▌ | 19/25 [00:27<00:08, 1.40s/batch, N=1.5294, E=0.8264,
KL=0.0526, wKL=1.0000]
Train E74: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4659, E=0.8189,
KL=0.0497, wKL=1.0000]
Train E74: 80%|████████ | 20/25 [00:28<00:07, 1.41s/batch, N=1.4659, E=0.8189,
KL=0.0497, wKL=1.0000]
Train E74: 80%|████████ | 20/25 [00:29<00:07, 1.41s/batch, N=1.5021, E=0.8262,
KL=0.0498, wKL=1.0000]
Train E74: 84%|████████▍ | 21/25 [00:29<00:05, 1.41s/batch, N=1.5021, E=0.8262,
KL=0.0498, wKL=1.0000]
Train E74: 84%|████████▍ | 21/25 [00:31<00:05, 1.41s/batch, N=1.4769, E=0.8176,
KL=0.0514, wKL=1.0000]
Train E74: 88%|████████▊ | 22/25 [00:31<00:04, 1.41s/batch, N=1.4769, E=0.8176,
KL=0.0514, wKL=1.0000]
Train E74: 88%|████████▊ | 22/25 [00:32<00:04, 1.41s/batch, N=1.5423, E=0.8241,
KL=0.0504, wKL=1.0000]
Train E74: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5423, E=0.8241,
KL=0.0504, wKL=1.0000]
Train E74: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4966, E=0.8250,
KL=0.0527, wKL=1.0000]
Train E74: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4966, E=0.8250,
KL=0.0527, wKL=1.0000]
Train E74: 96%|█████████▌| 24/25 [00:34<00:01, 1.45s/batch, N=1.4734, E=0.8194,
KL=0.0516, wKL=1.0000]
Train E74: 100%|██████████| 25/25 [00:34<00:00, 1.19s/batch, N=1.4734, E=0.8194,
KL=0.0516, wKL=1.0000]
Train E74: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4734, E=0.8194,
KL=0.0516, wKL=1.0000]
2635.8s 226 [Epoch 074] Total: 2.3271 | N: 1.4802 | E: 0.8215 | KL(1.00×0.5):
0.0507
2670.4s 227 Train E75: 0%| | 0/25 [00:00<?, ?batch/s]
Train E75: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4984, E=0.8229, KL=0.0501,
wKL=1.0000]
Train E75: 4%|▍ | 1/25 [00:01<00:32, 1.34s/batch, N=1.4984, E=0.8229,
KL=0.0501, wKL=1.0000]
Train E75: 4%|▍ | 1/25 [00:02<00:32, 1.34s/batch, N=1.5112, E=0.8231,
KL=0.0519, wKL=1.0000]
Train E75: 8%|▊ | 2/25 [00:02<00:31, 1.36s/batch, N=1.5112, E=0.8231,
KL=0.0519, wKL=1.0000]
Train E75: 8%|▊ | 2/25 [00:04<00:31, 1.36s/batch, N=1.4278, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E75: 12%|█▏ | 3/25 [00:04<00:29, 1.35s/batch, N=1.4278, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E75: 12%|█▏ | 3/25 [00:05<00:29, 1.35s/batch, N=1.5089, E=0.8180,
KL=0.0519, wKL=1.0000]
Train E75: 16%|█▌ | 4/25 [00:05<00:29, 1.39s/batch, N=1.5089, E=0.8180,
KL=0.0519, wKL=1.0000]
Train E75: 16%|█▌ | 4/25 [00:06<00:29, 1.39s/batch, N=1.5336, E=0.8226,
KL=0.0509, wKL=1.0000]
Train E75: 20%|██ | 5/25 [00:06<00:27, 1.39s/batch, N=1.5336, E=0.8226,
KL=0.0509, wKL=1.0000]
Train E75: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.5005, E=0.8233,
KL=0.0508, wKL=1.0000]
Train E75: 24%|██▍ | 6/25 [00:08<00:26, 1.39s/batch, N=1.5005, E=0.8233,
KL=0.0508, wKL=1.0000]
Train E75: 24%|██▍ | 6/25 [00:09<00:26, 1.39s/batch, N=1.4836, E=0.8254,
KL=0.0513, wKL=1.0000]
Train E75: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4836, E=0.8254,
KL=0.0513, wKL=1.0000]
Train E75: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.3761, E=0.8194,
KL=0.0495, wKL=1.0000]
Train E75: 32%|███▏ | 8/25 [00:11<00:26, 1.58s/batch, N=1.3761, E=0.8194,
KL=0.0495, wKL=1.0000]
Train E75: 32%|███▏ | 8/25 [00:12<00:26, 1.58s/batch, N=1.4250, E=0.8231,
KL=0.0502, wKL=1.0000]
Train E75: 36%|███▌ | 9/25 [00:12<00:24, 1.51s/batch, N=1.4250, E=0.8231,
KL=0.0502, wKL=1.0000]
Train E75: 36%|███▌ | 9/25 [00:14<00:24, 1.51s/batch, N=1.6067, E=0.8184,
KL=0.0509, wKL=1.0000]
Train E75: 40%|████ | 10/25 [00:14<00:21, 1.46s/batch, N=1.6067, E=0.8184,
KL=0.0509, wKL=1.0000]
Train E75: 40%|████ | 10/25 [00:15<00:21, 1.46s/batch, N=1.4531, E=0.8261,
KL=0.0503, wKL=1.0000]
Train E75: 44%|████▍ | 11/25 [00:15<00:20, 1.44s/batch, N=1.4531, E=0.8261,
KL=0.0503, wKL=1.0000]
Train E75: 44%|████▍ | 11/25 [00:17<00:20, 1.44s/batch, N=1.4294, E=0.8223,
KL=0.0500, wKL=1.0000]
Train E75: 48%|████▊ | 12/25 [00:17<00:18, 1.43s/batch, N=1.4294, E=0.8223,
KL=0.0500, wKL=1.0000]
Train E75: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.5157, E=0.8268,
KL=0.0504, wKL=1.0000]
Train E75: 52%|█████▏ | 13/25 [00:18<00:16, 1.42s/batch, N=1.5157, E=0.8268,
KL=0.0504, wKL=1.0000]
Train E75: 52%|█████▏ | 13/25 [00:19<00:16, 1.42s/batch, N=1.4675, E=0.8186,
KL=0.0502, wKL=1.0000]
Train E75: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.4675, E=0.8186,
KL=0.0502, wKL=1.0000]
Train E75: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4542, E=0.8238,
KL=0.0522, wKL=1.0000]
Train E75: 60%|██████ | 15/25 [00:21<00:14, 1.40s/batch, N=1.4542, E=0.8238,
KL=0.0522, wKL=1.0000]
Train E75: 60%|██████ | 15/25 [00:22<00:14, 1.40s/batch, N=1.5485, E=0.8220,
KL=0.0505, wKL=1.0000]
Train E75: 64%|██████▍ | 16/25 [00:22<00:12, 1.39s/batch, N=1.5485, E=0.8220,
KL=0.0505, wKL=1.0000]
Train E75: 64%|██████▍ | 16/25 [00:24<00:12, 1.39s/batch, N=1.4422, E=0.8217,
KL=0.0500, wKL=1.0000]
Train E75: 68%|██████▊ | 17/25 [00:24<00:11, 1.39s/batch, N=1.4422, E=0.8217,
KL=0.0500, wKL=1.0000]
Train E75: 68%|██████▊ | 17/25 [00:25<00:11, 1.39s/batch, N=1.4407, E=0.8212,
KL=0.0504, wKL=1.0000]
Train E75: 72%|███████▏ | 18/25 [00:25<00:09, 1.40s/batch, N=1.4407, E=0.8212,
KL=0.0504, wKL=1.0000]
Train E75: 72%|███████▏ | 18/25 [00:26<00:09, 1.40s/batch, N=1.5007, E=0.8179,
KL=0.0536, wKL=1.0000]
Train E75: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.5007, E=0.8179,
KL=0.0536, wKL=1.0000]
Train E75: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.5921, E=0.8197,
KL=0.0515, wKL=1.0000]
Train E75: 80%|████████ | 20/25 [00:28<00:06, 1.39s/batch, N=1.5921, E=0.8197,
KL=0.0515, wKL=1.0000]
Train E75: 80%|████████ | 20/25 [00:29<00:06, 1.39s/batch, N=1.4277, E=0.8196,
KL=0.0496, wKL=1.0000]
Train E75: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.4277, E=0.8196,
KL=0.0496, wKL=1.0000]
Train E75: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.4748, E=0.8243,
KL=0.0501, wKL=1.0000]
Train E75: 88%|████████▊ | 22/25 [00:31<00:04, 1.43s/batch, N=1.4748, E=0.8243,
KL=0.0501, wKL=1.0000]
Train E75: 88%|████████▊ | 22/25 [00:32<00:04, 1.43s/batch, N=1.5231, E=0.8189,
KL=0.0505, wKL=1.0000]
Train E75: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.5231, E=0.8189,
KL=0.0505, wKL=1.0000]
Train E75: 92%|█████████▏| 23/25 [00:33<00:02, 1.41s/batch, N=1.3832, E=0.8174,
KL=0.0500, wKL=1.0000]
Train E75: 96%|█████████▌| 24/25 [00:33<00:01, 1.40s/batch, N=1.3832, E=0.8174,
KL=0.0500, wKL=1.0000]
Train E75: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4909, E=0.8258,
KL=0.0498, wKL=1.0000]
Train E75: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4909, E=0.8258,
KL=0.0498, wKL=1.0000]
Train E75: 100%|██████████| 25/25 [00:34<00:00, 1.38s/batch, N=1.4909, E=0.8258,
KL=0.0498, wKL=1.0000]
2670.4s 228 [Epoch 075] Total: 2.3274 | N: 1.4804 | E: 0.8217 | KL(1.00×0.5):
0.0507
2705.2s 229 Train E76: 0%| | 0/25 [00:00<?, ?batch/s]
Train E76: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5499, E=0.8181, KL=0.0509,
wKL=1.0000]
Train E76: 4%|▍ | 1/25 [00:01<00:32, 1.37s/batch, N=1.5499, E=0.8181,
KL=0.0509, wKL=1.0000]
Train E76: 4%|▍ | 1/25 [00:02<00:32, 1.37s/batch, N=1.4101, E=0.8147,
KL=0.0522, wKL=1.0000]
Train E76: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4101, E=0.8147,
KL=0.0522, wKL=1.0000]
Train E76: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.3817, E=0.8211,
KL=0.0489, wKL=1.0000]
Train E76: 12%|█▏ | 3/25 [00:04<00:30, 1.38s/batch, N=1.3817, E=0.8211,
KL=0.0489, wKL=1.0000]
Train E76: 12%|█▏ | 3/25 [00:05<00:30, 1.38s/batch, N=1.5799, E=0.8213,
KL=0.0503, wKL=1.0000]
Train E76: 16%|█▌ | 4/25 [00:05<00:28, 1.38s/batch, N=1.5799, E=0.8213,
KL=0.0503, wKL=1.0000]
Train E76: 16%|█▌ | 4/25 [00:06<00:28, 1.38s/batch, N=1.4443, E=0.8210,
KL=0.0489, wKL=1.0000]
Train E76: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.4443, E=0.8210,
KL=0.0489, wKL=1.0000]
Train E76: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4719, E=0.8234,
KL=0.0490, wKL=1.0000]
Train E76: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4719, E=0.8234,
KL=0.0490, wKL=1.0000]
Train E76: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.4775, E=0.8224,
KL=0.0498, wKL=1.0000]
Train E76: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.4775, E=0.8224,
KL=0.0498, wKL=1.0000]
Train E76: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4095, E=0.8186,
KL=0.0503, wKL=1.0000]
Train E76: 32%|███▏ | 8/25 [00:11<00:24, 1.41s/batch, N=1.4095, E=0.8186,
KL=0.0503, wKL=1.0000]
Train E76: 32%|███▏ | 8/25 [00:12<00:24, 1.41s/batch, N=1.4750, E=0.8236,
KL=0.0497, wKL=1.0000]
Train E76: 36%|███▌ | 9/25 [00:12<00:22, 1.42s/batch, N=1.4750, E=0.8236,
KL=0.0497, wKL=1.0000]
Train E76: 36%|███▌ | 9/25 [00:13<00:22, 1.42s/batch, N=1.5015, E=0.8166,
KL=0.0512, wKL=1.0000]
Train E76: 40%|████ | 10/25 [00:13<00:21, 1.40s/batch, N=1.5015, E=0.8166,
KL=0.0512, wKL=1.0000]
Train E76: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.4204, E=0.8121,
KL=0.0505, wKL=1.0000]
Train E76: 44%|████▍ | 11/25 [00:15<00:22, 1.60s/batch, N=1.4204, E=0.8121,
KL=0.0505, wKL=1.0000]
Train E76: 44%|████▍ | 11/25 [00:17<00:22, 1.60s/batch, N=1.5312, E=0.8230,
KL=0.0522, wKL=1.0000]
Train E76: 48%|████▊ | 12/25 [00:17<00:19, 1.52s/batch, N=1.5312, E=0.8230,
KL=0.0522, wKL=1.0000]
Train E76: 48%|████▊ | 12/25 [00:18<00:19, 1.52s/batch, N=1.4733, E=0.8218,
KL=0.0507, wKL=1.0000]
Train E76: 52%|█████▏ | 13/25 [00:18<00:17, 1.48s/batch, N=1.4733, E=0.8218,
KL=0.0507, wKL=1.0000]
Train E76: 52%|█████▏ | 13/25 [00:20<00:17, 1.48s/batch, N=1.4294, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E76: 56%|█████▌ | 14/25 [00:20<00:16, 1.47s/batch, N=1.4294, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E76: 56%|█████▌ | 14/25 [00:21<00:16, 1.47s/batch, N=1.4731, E=0.8239,
KL=0.0492, wKL=1.0000]
Train E76: 60%|██████ | 15/25 [00:21<00:14, 1.45s/batch, N=1.4731, E=0.8239,
KL=0.0492, wKL=1.0000]
Train E76: 60%|██████ | 15/25 [00:22<00:14, 1.45s/batch, N=1.5662, E=0.8267,
KL=0.0522, wKL=1.0000]
Train E76: 64%|██████▍ | 16/25 [00:22<00:12, 1.43s/batch, N=1.5662, E=0.8267,
KL=0.0522, wKL=1.0000]
Train E76: 64%|██████▍ | 16/25 [00:24<00:12, 1.43s/batch, N=1.4209, E=0.8257,
KL=0.0494, wKL=1.0000]
Train E76: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.4209, E=0.8257,
KL=0.0494, wKL=1.0000]
Train E76: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4601, E=0.8292,
KL=0.0495, wKL=1.0000]
Train E76: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4601, E=0.8292,
KL=0.0495, wKL=1.0000]
Train E76: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5706, E=0.8185,
KL=0.0517, wKL=1.0000]
Train E76: 76%|███████▌ | 19/25 [00:27<00:08, 1.44s/batch, N=1.5706, E=0.8185,
KL=0.0517, wKL=1.0000]
Train E76: 76%|███████▌ | 19/25 [00:28<00:08, 1.44s/batch, N=1.5314, E=0.8275,
KL=0.0500, wKL=1.0000]
Train E76: 80%|████████ | 20/25 [00:28<00:07, 1.45s/batch, N=1.5314, E=0.8275,
KL=0.0500, wKL=1.0000]
Train E76: 80%|████████ | 20/25 [00:30<00:07, 1.45s/batch, N=1.4314, E=0.8211,
KL=0.0508, wKL=1.0000]
Train E76: 84%|████████▍ | 21/25 [00:30<00:05, 1.43s/batch, N=1.4314, E=0.8211,
KL=0.0508, wKL=1.0000]
Train E76: 84%|████████▍ | 21/25 [00:31<00:05, 1.43s/batch, N=1.5584, E=0.8225,
KL=0.0524, wKL=1.0000]
Train E76: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.5584, E=0.8225,
KL=0.0524, wKL=1.0000]
Train E76: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.4465, E=0.8243,
KL=0.0499, wKL=1.0000]
Train E76: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4465, E=0.8243,
KL=0.0499, wKL=1.0000]
Train E76: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.5302, E=0.8239,
KL=0.0511, wKL=1.0000]
Train E76: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.5302, E=0.8239,
KL=0.0511, wKL=1.0000]
Train E76: 96%|█████████▌| 24/25 [00:34<00:01, 1.39s/batch, N=1.4286, E=0.8236,
KL=0.0516, wKL=1.0000]
Train E76: 100%|██████████| 25/25 [00:34<00:00, 1.15s/batch, N=1.4286, E=0.8236,
KL=0.0516, wKL=1.0000]
Train E76: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4286, E=0.8236,
KL=0.0516, wKL=1.0000]
2705.2s 230 [Epoch 076] Total: 2.3271 | N: 1.4801 | E: 0.8218 | KL(1.00×0.5):
0.0505
2740.2s 231 Train E77: 0%| | 0/25 [00:00<?, ?batch/s]
Train E77: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4402, E=0.8251, KL=0.0501,
wKL=1.0000]
Train E77: 4%|▍ | 1/25 [00:01<00:33, 1.38s/batch, N=1.4402, E=0.8251,
KL=0.0501, wKL=1.0000]
Train E77: 4%|▍ | 1/25 [00:02<00:33, 1.38s/batch, N=1.4384, E=0.8166,
KL=0.0504, wKL=1.0000]
Train E77: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4384, E=0.8166,
KL=0.0504, wKL=1.0000]
Train E77: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.4111, E=0.8184,
KL=0.0512, wKL=1.0000]
Train E77: 12%|█▏ | 3/25 [00:04<00:30, 1.37s/batch, N=1.4111, E=0.8184,
KL=0.0512, wKL=1.0000]
Train E77: 12%|█▏ | 3/25 [00:05<00:30, 1.37s/batch, N=1.5541, E=0.8270,
KL=0.0502, wKL=1.0000]
Train E77: 16%|█▌ | 4/25 [00:05<00:29, 1.38s/batch, N=1.5541, E=0.8270,
KL=0.0502, wKL=1.0000]
Train E77: 16%|█▌ | 4/25 [00:06<00:29, 1.38s/batch, N=1.5219, E=0.8230,
KL=0.0510, wKL=1.0000]
Train E77: 20%|██ | 5/25 [00:06<00:27, 1.38s/batch, N=1.5219, E=0.8230,
KL=0.0510, wKL=1.0000]
Train E77: 20%|██ | 5/25 [00:08<00:27, 1.38s/batch, N=1.4935, E=0.8219,
KL=0.0499, wKL=1.0000]
Train E77: 24%|██▍ | 6/25 [00:08<00:26, 1.42s/batch, N=1.4935, E=0.8219,
KL=0.0499, wKL=1.0000]
Train E77: 24%|██▍ | 6/25 [00:09<00:26, 1.42s/batch, N=1.4855, E=0.8237,
KL=0.0501, wKL=1.0000]
Train E77: 28%|██▊ | 7/25 [00:09<00:25, 1.40s/batch, N=1.4855, E=0.8237,
KL=0.0501, wKL=1.0000]
Train E77: 28%|██▊ | 7/25 [00:11<00:25, 1.40s/batch, N=1.4879, E=0.8198,
KL=0.0498, wKL=1.0000]
Train E77: 32%|███▏ | 8/25 [00:11<00:23, 1.40s/batch, N=1.4879, E=0.8198,
KL=0.0498, wKL=1.0000]
Train E77: 32%|███▏ | 8/25 [00:12<00:23, 1.40s/batch, N=1.4957, E=0.8204,
KL=0.0507, wKL=1.0000]
Train E77: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4957, E=0.8204,
KL=0.0507, wKL=1.0000]
Train E77: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.4915, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E77: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.4915, E=0.8226,
KL=0.0502, wKL=1.0000]
Train E77: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4456, E=0.8176,
KL=0.0496, wKL=1.0000]
Train E77: 44%|████▍ | 11/25 [00:15<00:19, 1.40s/batch, N=1.4456, E=0.8176,
KL=0.0496, wKL=1.0000]
Train E77: 44%|████▍ | 11/25 [00:16<00:19, 1.40s/batch, N=1.4377, E=0.8213,
KL=0.0509, wKL=1.0000]
Train E77: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4377, E=0.8213,
KL=0.0509, wKL=1.0000]
Train E77: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4169, E=0.8195,
KL=0.0499, wKL=1.0000]
Train E77: 52%|█████▏ | 13/25 [00:18<00:16, 1.39s/batch, N=1.4169, E=0.8195,
KL=0.0499, wKL=1.0000]
Train E77: 52%|█████▏ | 13/25 [00:20<00:16, 1.39s/batch, N=1.5260, E=0.8213,
KL=0.0511, wKL=1.0000]
Train E77: 56%|█████▌ | 14/25 [00:20<00:17, 1.58s/batch, N=1.5260, E=0.8213,
KL=0.0511, wKL=1.0000]
Train E77: 56%|█████▌ | 14/25 [00:21<00:17, 1.58s/batch, N=1.5324, E=0.8215,
KL=0.0511, wKL=1.0000]
Train E77: 60%|██████ | 15/25 [00:21<00:15, 1.52s/batch, N=1.5324, E=0.8215,
KL=0.0511, wKL=1.0000]
Train E77: 60%|██████ | 15/25 [00:22<00:15, 1.52s/batch, N=1.4643, E=0.8220,
KL=0.0504, wKL=1.0000]
Train E77: 64%|██████▍ | 16/25 [00:22<00:13, 1.47s/batch, N=1.4643, E=0.8220,
KL=0.0504, wKL=1.0000]
Train E77: 64%|██████▍ | 16/25 [00:24<00:13, 1.47s/batch, N=1.5139, E=0.8208,
KL=0.0509, wKL=1.0000]
Train E77: 68%|██████▊ | 17/25 [00:24<00:12, 1.54s/batch, N=1.5139, E=0.8208,
KL=0.0509, wKL=1.0000]
Train E77: 68%|██████▊ | 17/25 [00:26<00:12, 1.54s/batch, N=1.4211, E=0.8217,
KL=0.0515, wKL=1.0000]
Train E77: 72%|███████▏ | 18/25 [00:26<00:10, 1.52s/batch, N=1.4211, E=0.8217,
KL=0.0515, wKL=1.0000]
Train E77: 72%|███████▏ | 18/25 [00:27<00:10, 1.52s/batch, N=1.5577, E=0.8197,
KL=0.0503, wKL=1.0000]
Train E77: 76%|███████▌ | 19/25 [00:27<00:08, 1.47s/batch, N=1.5577, E=0.8197,
KL=0.0503, wKL=1.0000]
Train E77: 76%|███████▌ | 19/25 [00:28<00:08, 1.47s/batch, N=1.4170, E=0.8211,
KL=0.0517, wKL=1.0000]
Train E77: 80%|████████ | 20/25 [00:28<00:07, 1.46s/batch, N=1.4170, E=0.8211,
KL=0.0517, wKL=1.0000]
Train E77: 80%|████████ | 20/25 [00:30<00:07, 1.46s/batch, N=1.4397, E=0.8257,
KL=0.0501, wKL=1.0000]
Train E77: 84%|████████▍ | 21/25 [00:30<00:05, 1.45s/batch, N=1.4397, E=0.8257,
KL=0.0501, wKL=1.0000]
Train E77: 84%|████████▍ | 21/25 [00:31<00:05, 1.45s/batch, N=1.5025, E=0.8231,
KL=0.0496, wKL=1.0000]
Train E77: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.5025, E=0.8231,
KL=0.0496, wKL=1.0000]
Train E77: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.5516, E=0.8212,
KL=0.0503, wKL=1.0000]
Train E77: 92%|█████████▏| 23/25 [00:32<00:02, 1.40s/batch, N=1.5516, E=0.8212,
KL=0.0503, wKL=1.0000]
Train E77: 92%|█████████▏| 23/25 [00:34<00:02, 1.40s/batch, N=1.4884, E=0.8194,
KL=0.0519, wKL=1.0000]
Train E77: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4884, E=0.8194,
KL=0.0519, wKL=1.0000]
Train E77: 96%|█████████▌| 24/25 [00:34<00:01, 1.40s/batch, N=1.4567, E=0.8260,
KL=0.0481, wKL=1.0000]
Train E77: 100%|██████████| 25/25 [00:34<00:00, 1.16s/batch, N=1.4567, E=0.8260,
KL=0.0481, wKL=1.0000]
Train E77: 100%|██████████| 25/25 [00:34<00:00, 1.40s/batch, N=1.4567, E=0.8260,
KL=0.0481, wKL=1.0000]
2740.2s 232 [Epoch 077] Total: 2.3270 | N: 1.4802 | E: 0.8215 | KL(1.00×0.5):
0.0505
2774.9s 233 Train E78: 0%| | 0/25 [00:00<?, ?batch/s]
Train E78: 0%| | 0/25 [00:01<?, ?batch/s, N=1.5463, E=0.8264, KL=0.0496,
wKL=1.0000]
Train E78: 4%|▍ | 1/25 [00:01<00:32, 1.35s/batch, N=1.5463, E=0.8264,
KL=0.0496, wKL=1.0000]
Train E78: 4%|▍ | 1/25 [00:02<00:32, 1.35s/batch, N=1.4583, E=0.8186,
KL=0.0511, wKL=1.0000]
Train E78: 8%|▊ | 2/25 [00:02<00:31, 1.38s/batch, N=1.4583, E=0.8186,
KL=0.0511, wKL=1.0000]
Train E78: 8%|▊ | 2/25 [00:04<00:31, 1.38s/batch, N=1.5744, E=0.8274,
KL=0.0504, wKL=1.0000]
Train E78: 12%|█▏ | 3/25 [00:04<00:30, 1.36s/batch, N=1.5744, E=0.8274,
KL=0.0504, wKL=1.0000]
Train E78: 12%|█▏ | 3/25 [00:05<00:30, 1.36s/batch, N=1.4490, E=0.8217,
KL=0.0498, wKL=1.0000]
Train E78: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4490, E=0.8217,
KL=0.0498, wKL=1.0000]
Train E78: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.5076, E=0.8250,
KL=0.0500, wKL=1.0000]
Train E78: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.5076, E=0.8250,
KL=0.0500, wKL=1.0000]
Train E78: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4512, E=0.8222,
KL=0.0518, wKL=1.0000]
Train E78: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4512, E=0.8222,
KL=0.0518, wKL=1.0000]
Train E78: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5049, E=0.8197,
KL=0.0497, wKL=1.0000]
Train E78: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5049, E=0.8197,
KL=0.0497, wKL=1.0000]
Train E78: 28%|██▊ | 7/25 [00:11<00:24, 1.38s/batch, N=1.4420, E=0.8207,
KL=0.0530, wKL=1.0000]
Train E78: 32%|███▏ | 8/25 [00:11<00:23, 1.39s/batch, N=1.4420, E=0.8207,
KL=0.0530, wKL=1.0000]
Train E78: 32%|███▏ | 8/25 [00:12<00:23, 1.39s/batch, N=1.4918, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 36%|███▌ | 9/25 [00:12<00:22, 1.40s/batch, N=1.4918, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 36%|███▌ | 9/25 [00:13<00:22, 1.40s/batch, N=1.5346, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 40%|████ | 10/25 [00:13<00:20, 1.40s/batch, N=1.5346, E=0.8206,
KL=0.0505, wKL=1.0000]
Train E78: 40%|████ | 10/25 [00:15<00:20, 1.40s/batch, N=1.4990, E=0.8259,
KL=0.0492, wKL=1.0000]
Train E78: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.4990, E=0.8259,
KL=0.0492, wKL=1.0000]
Train E78: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.5064, E=0.8189,
KL=0.0500, wKL=1.0000]
Train E78: 48%|████▊ | 12/25 [00:16<00:18, 1.39s/batch, N=1.5064, E=0.8189,
KL=0.0500, wKL=1.0000]
Train E78: 48%|████▊ | 12/25 [00:17<00:18, 1.39s/batch, N=1.4709, E=0.8208,
KL=0.0512, wKL=1.0000]
Train E78: 52%|█████▏ | 13/25 [00:17<00:16, 1.38s/batch, N=1.4709, E=0.8208,
KL=0.0512, wKL=1.0000]
Train E78: 52%|█████▏ | 13/25 [00:19<00:16, 1.38s/batch, N=1.5565, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E78: 56%|█████▌ | 14/25 [00:19<00:15, 1.38s/batch, N=1.5565, E=0.8188,
KL=0.0534, wKL=1.0000]
Train E78: 56%|█████▌ | 14/25 [00:20<00:15, 1.38s/batch, N=1.4205, E=0.8260,
KL=0.0507, wKL=1.0000]
Train E78: 60%|██████ | 15/25 [00:20<00:14, 1.42s/batch, N=1.4205, E=0.8260,
KL=0.0507, wKL=1.0000]
Train E78: 60%|██████ | 15/25 [00:22<00:14, 1.42s/batch, N=1.4347, E=0.8199,
KL=0.0495, wKL=1.0000]
Train E78: 64%|██████▍ | 16/25 [00:22<00:13, 1.45s/batch, N=1.4347, E=0.8199,
KL=0.0495, wKL=1.0000]
Train E78: 64%|██████▍ | 16/25 [00:23<00:13, 1.45s/batch, N=1.4745, E=0.8217,
KL=0.0490, wKL=1.0000]
Train E78: 68%|██████▊ | 17/25 [00:23<00:11, 1.44s/batch, N=1.4745, E=0.8217,
KL=0.0490, wKL=1.0000]
Train E78: 68%|██████▊ | 17/25 [00:25<00:11, 1.44s/batch, N=1.4519, E=0.8218,
KL=0.0496, wKL=1.0000]
Train E78: 72%|███████▏ | 18/25 [00:25<00:09, 1.42s/batch, N=1.4519, E=0.8218,
KL=0.0496, wKL=1.0000]
Train E78: 72%|███████▏ | 18/25 [00:27<00:09, 1.42s/batch, N=1.5019, E=0.8206,
KL=0.0499, wKL=1.0000]
Train E78: 76%|███████▌ | 19/25 [00:27<00:09, 1.60s/batch, N=1.5019, E=0.8206,
KL=0.0499, wKL=1.0000]
Train E78: 76%|███████▌ | 19/25 [00:28<00:09, 1.60s/batch, N=1.4589, E=0.8231,
KL=0.0501, wKL=1.0000]
Train E78: 80%|████████ | 20/25 [00:28<00:07, 1.53s/batch, N=1.4589, E=0.8231,
KL=0.0501, wKL=1.0000]
Train E78: 80%|████████ | 20/25 [00:29<00:07, 1.53s/batch, N=1.4548, E=0.8233,
KL=0.0495, wKL=1.0000]
Train E78: 84%|████████▍ | 21/25 [00:29<00:05, 1.48s/batch, N=1.4548, E=0.8233,
KL=0.0495, wKL=1.0000]
Train E78: 84%|████████▍ | 21/25 [00:31<00:05, 1.48s/batch, N=1.4272, E=0.8163,
KL=0.0499, wKL=1.0000]
Train E78: 88%|████████▊ | 22/25 [00:31<00:04, 1.46s/batch, N=1.4272, E=0.8163,
KL=0.0499, wKL=1.0000]
Train E78: 88%|████████▊ | 22/25 [00:32<00:04, 1.46s/batch, N=1.4642, E=0.8257,
KL=0.0502, wKL=1.0000]
Train E78: 92%|█████████▏| 23/25 [00:32<00:02, 1.44s/batch, N=1.4642, E=0.8257,
KL=0.0502, wKL=1.0000]
Train E78: 92%|█████████▏| 23/25 [00:34<00:02, 1.44s/batch, N=1.4457, E=0.8170,
KL=0.0506, wKL=1.0000]
Train E78: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4457, E=0.8170,
KL=0.0506, wKL=1.0000]
Train E78: 96%|█████████▌| 24/25 [00:34<00:01, 1.42s/batch, N=1.4779, E=0.8169,
KL=0.0514, wKL=1.0000]
Train E78: 100%|██████████| 25/25 [00:34<00:00, 1.18s/batch, N=1.4779, E=0.8169,
KL=0.0514, wKL=1.0000]
Train E78: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.4779, E=0.8169,
KL=0.0514, wKL=1.0000]
2774.9s 234 [Epoch 078] Total: 2.3271 | N: 1.4803 | E: 0.8217 | KL(1.00×0.5):
0.0504
2809.7s 235 Train E79: 0%| | 0/25 [00:00<?, ?batch/s]
Train E79: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4136, E=0.8232, KL=0.0514,
wKL=1.0000]
Train E79: 4%|▍ | 1/25 [00:01<00:32, 1.36s/batch, N=1.4136, E=0.8232,
KL=0.0514, wKL=1.0000]
Train E79: 4%|▍ | 1/25 [00:02<00:32, 1.36s/batch, N=1.5013, E=0.8214,
KL=0.0517, wKL=1.0000]
Train E79: 8%|▊ | 2/25 [00:02<00:31, 1.35s/batch, N=1.5013, E=0.8214,
KL=0.0517, wKL=1.0000]
Train E79: 8%|▊ | 2/25 [00:04<00:31, 1.35s/batch, N=1.4968, E=0.8208,
KL=0.0500, wKL=1.0000]
Train E79: 12%|█▏ | 3/25 [00:04<00:29, 1.36s/batch, N=1.4968, E=0.8208,
KL=0.0500, wKL=1.0000]
Train E79: 12%|█▏ | 3/25 [00:05<00:29, 1.36s/batch, N=1.4443, E=0.8207,
KL=0.0495, wKL=1.0000]
Train E79: 16%|█▌ | 4/25 [00:05<00:28, 1.36s/batch, N=1.4443, E=0.8207,
KL=0.0495, wKL=1.0000]
Train E79: 16%|█▌ | 4/25 [00:06<00:28, 1.36s/batch, N=1.3959, E=0.8221,
KL=0.0484, wKL=1.0000]
Train E79: 20%|██ | 5/25 [00:06<00:27, 1.37s/batch, N=1.3959, E=0.8221,
KL=0.0484, wKL=1.0000]
Train E79: 20%|██ | 5/25 [00:08<00:27, 1.37s/batch, N=1.4460, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E79: 24%|██▍ | 6/25 [00:08<00:26, 1.38s/batch, N=1.4460, E=0.8205,
KL=0.0503, wKL=1.0000]
Train E79: 24%|██▍ | 6/25 [00:09<00:26, 1.38s/batch, N=1.5543, E=0.8239,
KL=0.0501, wKL=1.0000]
Train E79: 28%|██▊ | 7/25 [00:09<00:24, 1.38s/batch, N=1.5543, E=0.8239,
KL=0.0501, wKL=1.0000]
Train E79: 28%|██▊ | 7/25 [00:10<00:24, 1.38s/batch, N=1.5059, E=0.8186,
KL=0.0489, wKL=1.0000]
Train E79: 32%|███▏ | 8/25 [00:10<00:23, 1.38s/batch, N=1.5059, E=0.8186,
KL=0.0489, wKL=1.0000]
Train E79: 32%|███▏ | 8/25 [00:12<00:23, 1.38s/batch, N=1.4985, E=0.8177,
KL=0.0497, wKL=1.0000]
Train E79: 36%|███▌ | 9/25 [00:12<00:21, 1.37s/batch, N=1.4985, E=0.8177,
KL=0.0497, wKL=1.0000]
Train E79: 36%|███▌ | 9/25 [00:13<00:21, 1.37s/batch, N=1.4382, E=0.8216,
KL=0.0495, wKL=1.0000]
Train E79: 40%|████ | 10/25 [00:13<00:20, 1.38s/batch, N=1.4382, E=0.8216,
KL=0.0495, wKL=1.0000]
Train E79: 40%|████ | 10/25 [00:15<00:20, 1.38s/batch, N=1.5302, E=0.8216,
KL=0.0527, wKL=1.0000]
Train E79: 44%|████▍ | 11/25 [00:15<00:19, 1.39s/batch, N=1.5302, E=0.8216,
KL=0.0527, wKL=1.0000]
Train E79: 44%|████▍ | 11/25 [00:16<00:19, 1.39s/batch, N=1.4568, E=0.8188,
KL=0.0506, wKL=1.0000]
Train E79: 48%|████▊ | 12/25 [00:16<00:18, 1.40s/batch, N=1.4568, E=0.8188,
KL=0.0506, wKL=1.0000]
Train E79: 48%|████▊ | 12/25 [00:18<00:18, 1.40s/batch, N=1.4980, E=0.8250,
KL=0.0511, wKL=1.0000]
Train E79: 52%|█████▏ | 13/25 [00:18<00:17, 1.45s/batch, N=1.4980, E=0.8250,
KL=0.0511, wKL=1.0000]
Train E79: 52%|█████▏ | 13/25 [00:19<00:17, 1.45s/batch, N=1.4282, E=0.8166,
KL=0.0507, wKL=1.0000]
Train E79: 56%|█████▌ | 14/25 [00:19<00:16, 1.48s/batch, N=1.4282, E=0.8166,
KL=0.0507, wKL=1.0000]
Train E79: 56%|█████▌ | 14/25 [00:21<00:16, 1.48s/batch, N=1.5073, E=0.8244,
KL=0.0500, wKL=1.0000]
Train E79: 60%|██████ | 15/25 [00:21<00:14, 1.46s/batch, N=1.5073, E=0.8244,
KL=0.0500, wKL=1.0000]
Train E79: 60%|██████ | 15/25 [00:22<00:14, 1.46s/batch, N=1.5327, E=0.8258,
KL=0.0499, wKL=1.0000]
Train E79: 64%|██████▍ | 16/25 [00:22<00:12, 1.44s/batch, N=1.5327, E=0.8258,
KL=0.0499, wKL=1.0000]
Train E79: 64%|██████▍ | 16/25 [00:23<00:12, 1.44s/batch, N=1.5054, E=0.8209,
KL=0.0494, wKL=1.0000]
Train E79: 68%|██████▊ | 17/25 [00:23<00:11, 1.43s/batch, N=1.5054, E=0.8209,
KL=0.0494, wKL=1.0000]
Train E79: 68%|██████▊ | 17/25 [00:25<00:11, 1.43s/batch, N=1.4753, E=0.8193,
KL=0.0501, wKL=1.0000]
Train E79: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4753, E=0.8193,
KL=0.0501, wKL=1.0000]
Train E79: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.4056, E=0.8216,
KL=0.0496, wKL=1.0000]
Train E79: 76%|███████▌ | 19/25 [00:26<00:08, 1.41s/batch, N=1.4056, E=0.8216,
KL=0.0496, wKL=1.0000]
Train E79: 76%|███████▌ | 19/25 [00:28<00:08, 1.41s/batch, N=1.4801, E=0.8217,
KL=0.0519, wKL=1.0000]
Train E79: 80%|████████ | 20/25 [00:28<00:06, 1.40s/batch, N=1.4801, E=0.8217,
KL=0.0519, wKL=1.0000]
Train E79: 80%|████████ | 20/25 [00:29<00:06, 1.40s/batch, N=1.5106, E=0.8235,
KL=0.0512, wKL=1.0000]
Train E79: 84%|████████▍ | 21/25 [00:29<00:05, 1.39s/batch, N=1.5106, E=0.8235,
KL=0.0512, wKL=1.0000]
Train E79: 84%|████████▍ | 21/25 [00:31<00:05, 1.39s/batch, N=1.5250, E=0.8225,
KL=0.0507, wKL=1.0000]
Train E79: 88%|████████▊ | 22/25 [00:31<00:04, 1.57s/batch, N=1.5250, E=0.8225,
KL=0.0507, wKL=1.0000]
Train E79: 88%|████████▊ | 22/25 [00:32<00:04, 1.57s/batch, N=1.5053, E=0.8187,
KL=0.0509, wKL=1.0000]
Train E79: 92%|█████████▏| 23/25 [00:32<00:03, 1.51s/batch, N=1.5053, E=0.8187,
KL=0.0509, wKL=1.0000]
Train E79: 92%|█████████▏| 23/25 [00:34<00:03, 1.51s/batch, N=1.4286, E=0.8237,
KL=0.0495, wKL=1.0000]
Train E79: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.4286, E=0.8237,
KL=0.0495, wKL=1.0000]
Train E79: 96%|█████████▌| 24/25 [00:34<00:01, 1.48s/batch, N=1.5679, E=0.8251,
KL=0.0498, wKL=1.0000]
Train E79: 100%|██████████| 25/25 [00:34<00:00, 1.22s/batch, N=1.5679, E=0.8251,
KL=0.0498, wKL=1.0000]
Train E79: 100%|██████████| 25/25 [00:34<00:00, 1.39s/batch, N=1.5679, E=0.8251,
KL=0.0498, wKL=1.0000]
2809.7s 236 [Epoch 079] Total: 2.3267 | N: 1.4800 | E: 0.8215 | KL(1.00×0.5):
0.0503
2844.9s 237 [Epoch 080] Total: 2.3266 | N: 1.4798 | E: 0.8216 | KL(1.00×0.5):
0.0503
2844.9s 238 Saved checkpoint: /kaggle/working/checkpoints/gvae_80_epoch080.pt
2844.9s 239 Train E80: 0%| | 0/25 [00:00<?, ?batch/s]
Train E80: 0%| | 0/25 [00:01<?, ?batch/s, N=1.4574, E=0.8178, KL=0.0497,
wKL=1.0000]
Train E80: 4%|▍ | 1/25 [00:01<00:34, 1.43s/batch, N=1.4574, E=0.8178,
KL=0.0497, wKL=1.0000]
Train E80: 4%|▍ | 1/25 [00:02<00:34, 1.43s/batch, N=1.3826, E=0.8243,
KL=0.0490, wKL=1.0000]
Train E80: 8%|▊ | 2/25 [00:02<00:33, 1.44s/batch, N=1.3826, E=0.8243,
KL=0.0490, wKL=1.0000]
Train E80: 8%|▊ | 2/25 [00:04<00:33, 1.44s/batch, N=1.5051, E=0.8204,
KL=0.0506, wKL=1.0000]
Train E80: 12%|█▏ | 3/25 [00:04<00:31, 1.41s/batch, N=1.5051, E=0.8204,
KL=0.0506, wKL=1.0000]
Train E80: 12%|█▏ | 3/25 [00:05<00:31, 1.41s/batch, N=1.4334, E=0.8193,
KL=0.0493, wKL=1.0000]
Train E80: 16%|█▌ | 4/25 [00:05<00:29, 1.41s/batch, N=1.4334, E=0.8193,
KL=0.0493, wKL=1.0000]
Train E80: 16%|█▌ | 4/25 [00:07<00:29, 1.41s/batch, N=1.5351, E=0.8275,
KL=0.0508, wKL=1.0000]
Train E80: 20%|██ | 5/25 [00:07<00:27, 1.39s/batch, N=1.5351, E=0.8275,
KL=0.0508, wKL=1.0000]
Train E80: 20%|██ | 5/25 [00:08<00:27, 1.39s/batch, N=1.4755, E=0.8143,
KL=0.0506, wKL=1.0000]
Train E80: 24%|██▍ | 6/25 [00:08<00:26, 1.40s/batch, N=1.4755, E=0.8143,
KL=0.0506, wKL=1.0000]
Train E80: 24%|██▍ | 6/25 [00:09<00:26, 1.40s/batch, N=1.4700, E=0.8232,
KL=0.0499, wKL=1.0000]
Train E80: 28%|██▊ | 7/25 [00:09<00:25, 1.39s/batch, N=1.4700, E=0.8232,
KL=0.0499, wKL=1.0000]
Train E80: 28%|██▊ | 7/25 [00:11<00:25, 1.39s/batch, N=1.5489, E=0.8210,
KL=0.0514, wKL=1.0000]
Train E80: 32%|███▏ | 8/25 [00:11<00:23, 1.41s/batch, N=1.5489, E=0.8210,
KL=0.0514, wKL=1.0000]
Train E80: 32%|███▏ | 8/25 [00:12<00:23, 1.41s/batch, N=1.4681, E=0.8261,
KL=0.0494, wKL=1.0000]
Train E80: 36%|███▌ | 9/25 [00:12<00:22, 1.39s/batch, N=1.4681, E=0.8261,
KL=0.0494, wKL=1.0000]
Train E80: 36%|███▌ | 9/25 [00:14<00:22, 1.39s/batch, N=1.4855, E=0.8211,
KL=0.0502, wKL=1.0000]
Train E80: 40%|████ | 10/25 [00:14<00:21, 1.40s/batch, N=1.4855, E=0.8211,
KL=0.0502, wKL=1.0000]
Train E80: 40%|████ | 10/25 [00:15<00:21, 1.40s/batch, N=1.4895, E=0.8198,
KL=0.0510, wKL=1.0000]
Train E80: 44%|████▍ | 11/25 [00:15<00:20, 1.46s/batch, N=1.4895, E=0.8198,
KL=0.0510, wKL=1.0000]
Train E80: 44%|████▍ | 11/25 [00:17<00:20, 1.46s/batch, N=1.4731, E=0.8225,
KL=0.0498, wKL=1.0000]
Train E80: 48%|████▊ | 12/25 [00:17<00:18, 1.43s/batch, N=1.4731, E=0.8225,
KL=0.0498, wKL=1.0000]
Train E80: 48%|████▊ | 12/25 [00:18<00:18, 1.43s/batch, N=1.5002, E=0.8224,
KL=0.0494, wKL=1.0000]
Train E80: 52%|█████▏ | 13/25 [00:18<00:17, 1.42s/batch, N=1.5002, E=0.8224,
KL=0.0494, wKL=1.0000]
Train E80: 52%|█████▏ | 13/25 [00:19<00:17, 1.42s/batch, N=1.5430, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E80: 56%|█████▌ | 14/25 [00:19<00:15, 1.41s/batch, N=1.5430, E=0.8227,
KL=0.0526, wKL=1.0000]
Train E80: 56%|█████▌ | 14/25 [00:21<00:15, 1.41s/batch, N=1.4453, E=0.8180,
KL=0.0504, wKL=1.0000]
Train E80: 60%|██████ | 15/25 [00:21<00:14, 1.43s/batch, N=1.4453, E=0.8180,
KL=0.0504, wKL=1.0000]
Train E80: 60%|██████ | 15/25 [00:22<00:14, 1.43s/batch, N=1.4463, E=0.8236,
KL=0.0489, wKL=1.0000]
Train E80: 64%|██████▍ | 16/25 [00:22<00:12, 1.42s/batch, N=1.4463, E=0.8236,
KL=0.0489, wKL=1.0000]
Train E80: 64%|██████▍ | 16/25 [00:24<00:12, 1.42s/batch, N=1.5152, E=0.8207,
KL=0.0513, wKL=1.0000]
Train E80: 68%|██████▊ | 17/25 [00:24<00:11, 1.42s/batch, N=1.5152, E=0.8207,
KL=0.0513, wKL=1.0000]
Train E80: 68%|██████▊ | 17/25 [00:25<00:11, 1.42s/batch, N=1.4201, E=0.8185,
KL=0.0498, wKL=1.0000]
Train E80: 72%|███████▏ | 18/25 [00:25<00:09, 1.41s/batch, N=1.4201, E=0.8185,
KL=0.0498, wKL=1.0000]
Train E80: 72%|███████▏ | 18/25 [00:26<00:09, 1.41s/batch, N=1.6035, E=0.8281,
KL=0.0501, wKL=1.0000]
Train E80: 76%|███████▌ | 19/25 [00:26<00:08, 1.40s/batch, N=1.6035, E=0.8281,
KL=0.0501, wKL=1.0000]
Train E80: 76%|███████▌ | 19/25 [00:28<00:08, 1.40s/batch, N=1.4994, E=0.8254,
KL=0.0507, wKL=1.0000]
Train E80: 80%|████████ | 20/25 [00:28<00:07, 1.40s/batch, N=1.4994, E=0.8254,
KL=0.0507, wKL=1.0000]
Train E80: 80%|████████ | 20/25 [00:29<00:07, 1.40s/batch, N=1.4604, E=0.8212,
KL=0.0498, wKL=1.0000]
Train E80: 84%|████████▍ | 21/25 [00:29<00:05, 1.40s/batch, N=1.4604, E=0.8212,
KL=0.0498, wKL=1.0000]
Train E80: 84%|████████▍ | 21/25 [00:31<00:05, 1.40s/batch, N=1.3982, E=0.8171,
KL=0.0502, wKL=1.0000]
Train E80: 88%|████████▊ | 22/25 [00:31<00:04, 1.42s/batch, N=1.3982, E=0.8171,
KL=0.0502, wKL=1.0000]
Train E80: 88%|████████▊ | 22/25 [00:32<00:04, 1.42s/batch, N=1.4754, E=0.8217,
KL=0.0520, wKL=1.0000]
Train E80: 92%|█████████▏| 23/25 [00:32<00:02, 1.41s/batch, N=1.4754, E=0.8217,
KL=0.0520, wKL=1.0000]
Train E80: 92%|█████████▏| 23/25 [00:34<00:02, 1.41s/batch, N=1.4833, E=0.8209,
KL=0.0503, wKL=1.0000]
Train E80: 96%|█████████▌| 24/25 [00:34<00:01, 1.59s/batch, N=1.4833, E=0.8209,
KL=0.0503, wKL=1.0000]
Train E80: 96%|█████████▌| 24/25 [00:35<00:01, 1.59s/batch, N=1.4833, E=0.8231,
KL=0.0486, wKL=1.0000]
Train E80: 100%|██████████| 25/25 [00:35<00:00, 1.31s/batch, N=1.4833, E=0.8231,
KL=0.0486, wKL=1.0000]
Train E80: 100%|██████████| 25/25 [00:35<00:00, 1.41s/batch, N=1.4833, E=0.8231,
KL=0.0486, wKL=1.0000]
2850.7s 240 /usr/local/lib/python3.11/dist-packages/traitlets/traitlets.py:2915:
FutureWarning: --
Exporter.preprocessors=["remove_papermill_header.RemovePapermillHeader"] for
containers is deprecated in traitlets 5.0. You can pass `--Exporter.preprocessors
item` ... multiple times to add items to a list.
2850.7s 241 warn(
2850.7s 242 [NbConvertApp] Converting notebook __notebook__.ipynb to notebook
2851.5s 243 [NbConvertApp] Writing 67931 bytes to __notebook__.ipynb
2852.8s 244 /usr/local/lib/python3.11/dist-packages/traitlets/traitlets.py:2915:
FutureWarning: --
Exporter.preprocessors=["nbconvert.preprocessors.ExtractOutputPreprocessor"] for
containers is deprecated in traitlets 5.0. You can pass `--Exporter.preprocessors
item` ... multiple times to add items to a list.
2852.8s 245 warn(
2852.8s 246 [NbConvertApp] Converting notebook __notebook__.ipynb to html
2853.6s 247 [NbConvertApp] Writing 409106 bytes to __results__.html

You might also like