GitHub - Pixel-Talk/MODA: Official Repository for ICCV-2023 MODA

MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions

Yunfei Liu¹, Lijian Lin¹, Fei Yu², Changyin Zhou², Yu Li¹
¹International Digital Economy Academy (IDEA), Shenzhen, China, ²Vistring Inc., Hangzhou, China

—ICCV 2023—

🕊️ Description

MODA is a unified system for multi-person, diverse, and high-fidelity talking portrait generation system.

🎊 News

2023/08/31 Training codes have been released.
2023/08/31 Pretrained models have been released.
2023/08/13 Inference codes have been released.
2023/08/13 Data preprocessing scripts have been released.

🛠️ Installation

After cloning the repository please install the environment by running the install.sh script. It will prepare the MODA for usage.

git clone https://github.com/DreamtaleCore/MODA.git
cd MODA
bash ./install.sh

🚀 Usage

Quick run

python inference.py

Then a few minutes later ☕, the results will be generated at results/.

Parameters:

usage: Inference entrance for MODA. [-h] [--audio_fp_or_dir AUDIO_FP_OR_DIR] [--person_config PERSON_CONFIG]
                                    [--output_dir OUTPUT_DIR] [--n_sample N_SAMPLE]

optional arguments:
  -h, --help            show this help message and exit
  --audio_fp_or_dir AUDIO_FP_OR_DIR
  --person_config PERSON_CONFIG
  --output_dir OUTPUT_DIR
  --n_sample N_SAMPLE

🍏 Dataset preparation

cd data_prepare

python process.py -i your/video/dir -o your/output/dir

More informations please refer to here.

🏃 Train

Train the MODA model and FaCo model

python train.py --config configs/train/moda.yaml

python train.py --config configs/train/faco.yaml

Train the renderer for new avatar

python train_renderer.py --config configs/train/renderer/Cathy.yaml

Link your models

ln -s your_absolute_dir/TrainMODAVel/Audio2FeatureVertices/best_MODA.pkl assets/ckpts/MODA.pkl

ln -s your_absolute_dir/TrainFaCoModel/Audio2FeatureVertices/best_FaCo_G.pkl assets/ckpts/FaCo.pkl

ln -s your_absolute_dir/Render/TrainRenderCathy/Render/best_Render_G.pkl assets/ckpts/renderer/Cathy.pth

Then update the ckpt filepath in your config files.

🚧 TODO

Release the inference code
Data preprocessing scripts
Prepare the pretriained-weights
Release the training code
Prepare the huggingface🤗 demo
Releaes the processed HDTF data

🛎 Citation

If you find our work useful in your research, please consider citing:

@inproceedings{liu2023MODA,
  title={MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions},
  author={Liu, Yunfei and Lin, Lijian and Fei, Yu and Changyin, Zhou, and Yu, Li},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2023}
}

🥂 Acknowledgement

Our code is based on LiveSpeechPortrait and FaceFormer.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
assets/data		assets/data
configs		configs
data_prepare		data_prepare
datasets		datasets
images		images
models		models
options		options
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
environment.yml		environment.yml
inference.py		inference.py
install.sh		install.sh
train.py		train.py
train_renderer.py		train_renderer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions

🕊️ Description

🎊 News

🛠️ Installation

🚀 Usage

🍏 Dataset preparation

🏃 Train

Train the MODA model and FaCo model

Train the renderer for new avatar

Link your models

🚧 TODO

🛎 Citation

🥂 Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Pixel-Talk/MODA

Folders and files

Latest commit

History

Repository files navigation

MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions

🕊️ Description

🎊 News

🛠️ Installation

🚀 Usage

🍏 Dataset preparation

🏃 Train

Train the MODA model and FaCo model

Train the renderer for new avatar

Link your models

🚧 TODO

🛎 Citation

🥂 Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages