推理出来的视频，整体来说还不错。但是嘴型有时候会突然很快 #102

wanghx1121 · 2024-05-09T04:15:16Z

@ZiqiaoPeng 作者你好，在使用hubert推理后，嘴型在某个阶段会突然很快。麻烦看一下。谢谢！~
在nerf中，使用同一个视频同样的参数推理，没有复现该问题

5.9.5.mp4

ZiqiaoPeng · 2024-05-09T04:17:51Z

看起来不是推理的问题，是视频拼接的问题，在那一帧应该是用的别的帧的图片，然后导致突然的抖动，可以检查一下特定帧对应的图片。

wanghx1121 · 2024-05-09T05:40:56Z

看起来不是推理的问题，是视频拼接的问题，在那一帧应该是用的别的帧的图片，然后导致突然的抖动，可以检查一下特定帧对应的图片。

这个问题，应该怎么去定位，麻烦告知一下 @ZiqiaoPeng

wanghx1121 · 2024-05-09T06:22:10Z

看起来不是推理的问题，是视频拼接的问题，在那一帧应该是用的别的帧的图片，然后导致突然的抖动，可以检查一下特定帧对应的图片。

5.9.7.mp4

上述视频是推理完成后，自动生成的测试结果。也出现了这个问题！~

wanghx1121 · 2024-05-09T06:23:26Z

看起来不是推理的问题，是视频拼接的问题，在那一帧应该是用的别的帧的图片，然后导致突然的抖动，可以检查一下特定帧对应的图片。

5.9.7.mp4
上述视频是推理完成后，自动生成的测试结果。也出现了这个问题！~

@ZiqiaoPeng

ZiqiaoPeng · 2024-05-09T06:27:45Z

如果方便的话可以把原视频发送到我的邮箱pengziqiao@ruc.edu.cn，以对问题进行定位。

wanghx1121 · 2024-05-09T06:36:25Z

如果方便的话可以把原视频发送到我的邮箱pengziqiao@ruc.edu.cn，以对问题进行定位。

可以的。下面是nerf训练出来的结果：

5.9.8.mp4

wanghx1121 · 2024-05-09T06:50:36Z

pengziqiao@ruc.edu.cn

已发送邮件，请查收！~

wanghx1121 · 2024-05-09T09:42:55Z

如果方便的话可以把原视频发送到我的邮箱pengziqiao@ruc.edu.cn，以对问题进行定位。

请问你本地复现了吗？ @ZiqiaoPeng

jinqiupeter · 2024-05-09T16:01:13Z

Most likely it's because of your source video. Here is my result:

qs_cn_half.mp4

ZiqiaoPeng · 2024-05-09T16:13:49Z

wf_test.mp4

我使用hubert训练的结果没有问题，头部稳定，唇形同步，眼睛正常眨眼。

wanghx1121 · 2024-05-10T01:21:52Z

我使用hubert训练的结果没有问题，头部稳定，唇形同步，眼睛正常眨眼。

@ZiqiaoPeng 请问素材你做了特殊处理吗？能详细说一下你的预训练过程吗？谢谢！~

StephanPan · 2024-05-10T03:02:24Z

当推理帧和原始帧差异较大，结果会有双下巴，是需要训一下torso吗？还是贴脸的逻辑不太对？

test_result.mp4

ZiqiaoPeng · 2024-05-10T05:09:03Z

我使用hubert训练的结果没有问题，头部稳定，唇形同步，眼睛正常眨眼。

@ZiqiaoPeng 请问素材你做了特殊处理吗？能详细说一下你的预训练过程吗？谢谢！~

没有特殊处理，step1训练6w步，step2训练到10w步。

wning13 · 2024-05-10T05:47:28Z

当推理帧和原始帧差异较大，结果会有双下巴，是需要训一下torso吗？还是贴脸的逻辑不太对？

test_result.mp4

我之前遇到过这个问题，当时尝试了在预处理数据的时候把靠上一部分的脖子区域标记成脸部，生成效果会好一些。

wning13 · 2024-05-10T05:48:29Z

也可以试试用类似柏松融合的方案修复

wanghx1121 · 2024-05-10T07:41:04Z

我使用hubert训练的结果没有问题，头部稳定，唇形同步，眼睛正常眨眼。

@ZiqiaoPeng 请问素材你做了特殊处理吗？能详细说一下你的预训练过程吗？谢谢！~

没有特殊处理，step1训练6w步，step2训练到10w步。

我step1 训练20W步，step2 训练到40W步。复现了该问题。因为我的素材时长为5分钟，步数太少，像楼上所说，会出现双下巴 @ZiqiaoPeng

HinaAnwar04 · 2024-05-10T11:11:25Z

Most likely it's because of your source video. Here is my result:

qs_cn_half.mp4

can you please share how you achieved such good results, you followed the same repo code for preprocessing or made any additional changes ? For training which asr_model you used hubert, ave or deepspeech and no of training iterations please?

StephanPan · 2024-05-11T07:28:35Z

有什么tricks可以提高清晰度吗，感觉预测的图像清晰度相较训练集有所下降？

flysky126 · 2024-05-13T06:23:03Z

Most likely it's because of your source video. Here is my result:

qs_cn_half.mp4

这个是用May使用的模式训练出来的吗？我训了新的id 嘴型对的没有那么好，是数据不够吗？

alexcazacu · 2024-05-13T07:09:11Z

有什么tricks可以提高清晰度吗，感觉预测的图像清晰度相较训练集有所下降？

@StephanPan When the output mp4 is merged with the target audio, the video is re-encoded, leading to a substantial loss in quality. To solve this, you can add "-c:v copy" to this ffmpeg command: https://github.com/ZiqiaoPeng/SyncTalk/blob/main/nerf_triplane/utils.py#L1101.

StephanPan · 2024-05-13T07:18:32Z

@alexcazacu thx for your suggestion. It's true that the ffmpeg may reduce the quality of image, but i found that the raw output of the model is of lower quality than the training images.

samggggflynn · 2024-05-13T08:00:48Z

Most likely it's because of your source video. Here is my result:

qs_cn_half.mp4

nice job. 请问一下，你这个训练素材多少时长？ step1 和 step2 各训练了多少？

samggggflynn · 2024-05-13T08:06:12Z

有什么tricks可以提高清晰度吗，感觉预测的图像清晰度相较训练集有所下降？

遇到同样的问题，请问你找到原因或者解决了吗？增加数据和训练step有作用吗

huyppppppp · 2024-05-13T09:32:15Z

Most likely it's because of your source video. Here is my result:

qs_cn_half.mp4
请问这个使用多长的视频训练的呀，我训练出来，嘴部有抖动，你这个效果很好

jinqiupeter · 2024-05-13T13:00:40Z

Most likely it's because of your source video. Here is my result:
qs_cn_half.mp4

nice job. 请问一下，你这个训练素材多少时长？ step1 和 step2 各训练了多少？

I trained with a 2-minute-long video, using the default steps (60k and 100k steps)

samggggflynn · 2024-05-13T13:04:34Z

may i ask more details, what does the face cropping region look like in your training data? Peter ***@***.***>于2024年5月13日周一下午9:01写道：

…

Most likely it's because of your source video. Here is my result: qs_cn_half.mp4 nice job. 请问一下，你这个训练素材多少时长？ step1 和 step2 各训练了多少？ I trained with a 2-minute-long video, using the default steps (60k and 100k steps) — Reply to this email directly, view it on GitHub <#102 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJ4MHLWPON5LVVKDFTDZ5C3ZCC2QZAVCNFSM6AAAAABHODXBMSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMBXGUYTMNRWGQ> . You are receiving this because you commented.Message ID: ***@***.***>

StephanPan · 2024-05-14T06:54:51Z

模型直出的效果下巴会有像素拉伸的效果，有人可以解答一下吗？

ngp_ep0015.mp4

schxnhxlz · 2024-05-14T09:12:46Z

模型直出的效果下巴会有像素拉伸的效果，有人可以解答一下吗？

ngp_ep0015.mp4

I have the same problem :/

CSZHK · 2024-05-17T12:02:58Z

Most likely it's because of your source video. Here is my result:很可能是因为你的源视频。这是我的结果：

qs_cn_half.mp4

这个有做什么修改么，效果真不错

CSZHK · 2024-05-19T13:25:31Z

https://github.com/ZiqiaoPeng/SyncTalk/assets/5602838/504b44a7-0a0d-4a9a-bc75-851bd968de4a
没有特殊处理，step1训练6w步，step2训练到10w步。为啥我的边框和抖动这么厉害，求帮忙看下

schxnhxlz · 2024-05-22T09:33:59Z

https://github.com/ZiqiaoPeng/SyncTalk/assets/5602838/504b44a7-0a0d-4a9a-bc75-851bd968de4a 没有特殊处理，step1训练6w步，step2训练到10w步。为啥我的边框和抖动这么厉害，求帮忙看下

I had a similiar issue with a person with long hair covering parts of the face. maybe also the glasses are decreasing the quality.

zhouzhenneng · 2024-05-23T11:29:20Z

当推理帧和原始帧差异较大，结果会有双下巴，是需要训一下torso吗？还是贴脸的逻辑不太对？
test_result.mp4

我之前遇到过这个问题，当时尝试了在预处理数据的时候把靠上一部分的脖子区域标记成脸部，生成效果会好一些。

请问有具体点的操作步骤吗，需要修改哪些文件呢

ZiqiaoPeng closed this as completed May 9, 2024

ZiqiaoPeng reopened this May 9, 2024

schxnhxlz mentioned this issue May 14, 2024

Questions About Enhancing Video Quality #78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

推理出来的视频，整体来说还不错。但是嘴型有时候会突然很快 #102

推理出来的视频，整体来说还不错。但是嘴型有时候会突然很快 #102

wanghx1121 commented May 9, 2024

ZiqiaoPeng commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

ZiqiaoPeng commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

jinqiupeter commented May 9, 2024 •

edited

ZiqiaoPeng commented May 9, 2024

wanghx1121 commented May 10, 2024

StephanPan commented May 10, 2024

ZiqiaoPeng commented May 10, 2024

wning13 commented May 10, 2024

wning13 commented May 10, 2024

wanghx1121 commented May 10, 2024

HinaAnwar04 commented May 10, 2024

StephanPan commented May 11, 2024

flysky126 commented May 13, 2024

alexcazacu commented May 13, 2024

StephanPan commented May 13, 2024

samggggflynn commented May 13, 2024

samggggflynn commented May 13, 2024

huyppppppp commented May 13, 2024

jinqiupeter commented May 13, 2024

samggggflynn commented May 13, 2024 via email

StephanPan commented May 14, 2024

schxnhxlz commented May 14, 2024

CSZHK commented May 17, 2024

CSZHK commented May 19, 2024 •

edited

schxnhxlz commented May 22, 2024

zhouzhenneng commented May 23, 2024

推理出来的视频，整体来说还不错。但是嘴型有时候会突然很快 #102

推理出来的视频，整体来说还不错。但是嘴型有时候会突然很快 #102

Comments

wanghx1121 commented May 9, 2024

ZiqiaoPeng commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

ZiqiaoPeng commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

wanghx1121 commented May 9, 2024

jinqiupeter commented May 9, 2024 • edited

ZiqiaoPeng commented May 9, 2024

wanghx1121 commented May 10, 2024

StephanPan commented May 10, 2024

ZiqiaoPeng commented May 10, 2024

wning13 commented May 10, 2024

wning13 commented May 10, 2024

wanghx1121 commented May 10, 2024

HinaAnwar04 commented May 10, 2024

StephanPan commented May 11, 2024

flysky126 commented May 13, 2024

alexcazacu commented May 13, 2024

StephanPan commented May 13, 2024

samggggflynn commented May 13, 2024

samggggflynn commented May 13, 2024

huyppppppp commented May 13, 2024

jinqiupeter commented May 13, 2024

samggggflynn commented May 13, 2024 via email

StephanPan commented May 14, 2024

schxnhxlz commented May 14, 2024

CSZHK commented May 17, 2024

CSZHK commented May 19, 2024 • edited

schxnhxlz commented May 22, 2024

zhouzhenneng commented May 23, 2024

jinqiupeter commented May 9, 2024 •

edited

CSZHK commented May 19, 2024 •

edited