Why my train loss after introducing sync loss? #140

Marskly · 2024-05-01T13:04:40Z

After introducin at Step 250000, the L1 Loss, Vgg Loss, Percep are all increasing.
It is because taht the loss of sync is too big? And it influences the weights of model?

see2run · 2024-05-07T09:39:02Z

Hey, can you share what you do from dataset preparation to running the script train_syncnet_sam.py? Because I've been trying and the output result is just stuck like this without any progress:

(w2l_cek) vian:~/wav2lip_288x288$ python3 train_syncnet_sam.py
use_cuda: True
total trainable params 65054464
Training From Scratch !!!
Starting Epoch: 0

Marskly · 2024-05-09T00:47:16Z

Hey, can you share what you do from dataset preparation to running the script train_syncnet_sam.py? Because I've been trying and the output result is just stuck like this without any progress:

(w2l_cek) vian:~/wav2lip_288x288$ python3 train_syncnet_sam.py use_cuda: True total trainable params 65054464 Training From Scratch !!! Starting Epoch: 0

Maybe your CPU loads data too slowly. You can monitor your CPU utilization and GPU memory.
Try smaller batch size.

Liming-belief · 2024-05-17T08:54:46Z

Hello, I have encountered the same problem as you. Have you resolved it @Marskly

see2run · 2024-05-20T03:18:44Z

Hey, can you share what you do from dataset preparation to running the script train_syncnet_sam.py? Because I've been trying and the output result is just stuck like this without any progress:
(w2l_cek) vian:~/wav2lip_288x288$ python3 train_syncnet_sam.py use_cuda: True total trainable params 65054464 Training From Scratch !!! Starting Epoch: 0

Maybe your CPU loads data too slowly. You can monitor your CPU utilization and GPU memory. Try smaller batch size.

Okay, I have solved it, thank you, and now when training, the results are as follows:

Step 259 | L1: 0.08976 | Vgg: 0.3718 | SW: 0.03 | Sync: 0.0 | DW: 0.0 | Percep: 0.0 | Fake: 0.0, Real: 0.0 | Load: 0.01096, Train: 1.225

where Percep, Fake, and Real are always 0.0.
Can you provide any suggestions? I am training with 1725 videos

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why my train loss after introducing sync loss? #140

Why my train loss after introducing sync loss? #140

Marskly commented May 1, 2024

see2run commented May 7, 2024

Marskly commented May 9, 2024

Liming-belief commented May 17, 2024

see2run commented May 20, 2024

Why my train loss after introducing sync loss? #140

Why my train loss after introducing sync loss? #140

Comments

Marskly commented May 1, 2024

see2run commented May 7, 2024

Marskly commented May 9, 2024

Liming-belief commented May 17, 2024

see2run commented May 20, 2024