Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion (IJCAI 2021)

Paper | Demo

Requirements

Python 3.6 , Pytorch >= 1.6 and ffmpeg
Other requirements are listed in the 'requirements.txt'

Pretrained Checkpoint

Please download the pretrained checkpoint from google-drive and put it within the folder (/checkpoints).

Generate Demo Results

python inference.py --audio_path xxx.wav --img_path xxx.jpg

Note that the input images must keep the same height and width and the face should be appropriately cropped as in /demo/img.

License and Citation

@InProceedings{wang2021audio2head,
author = Suzhen Wang, Lincheng Li, Yu Ding, Changjie Fan, Xin Yu
title = {Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion},
booktitle = {the 30th International Joint Conference on Artificial Intelligence (IJCAI-21)},
year = {2021},
}

Acknowledgement

This codebase is based on First Order Motion Model, thanks for their contribution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion (IJCAI 2021)

Paper | Demo

Requirements

Pretrained Checkpoint

Generate Demo Results

License and Citation

Acknowledgement

Files

README.md

Latest commit

History

README.md

File metadata and controls

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion (IJCAI 2021)

Paper | Demo

Requirements

Pretrained Checkpoint

Generate Demo Results

License and Citation

Acknowledgement