ComfyUI_Sonic

Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI

Update

Change the model loading to a monolithic SVD model 模型加载改为单体SVD模型；
add frame number to change infer legth. 新增frame number选项，用于控制输出视频的长度（如果无限大，就是基于音频长度）；
Support output of non square images，OOM 支持非正方形图片的输出，容易OOM；
image_size is used to control the minimum size of the output image. If OOM, please reduce this value ,image_size用于控制输出图片的最小尺寸，如果OOM请调小这个数值；
感谢@civen-cn 提交的PR

1. Installation

In the ./ComfyUI/custom_node directory, run the following:

git clone https://github.com/smthemex/ComfyUI_Sonic.git

2. Requirements

pip install -r requirements.txt

3.Model

3.1.1 download checkpoints from google 从Google下载必须的模型,文件结构如下图
3.1.2 download openai/whisper-tiny

--  ComfyUI/models/sonic/
    |-- audio2bucket.pth
    |-- audio2token.pth
    |-- unet.pth
    |-- yoloface_v5m.pt
    |-- whisper-tiny/
        |--config.json
        |--model.safetensors
        |--preprocessor_config.json
    |-- RIFE/
        |--flownet.pkl

3.2 SVD checkpoints svd_xt.safetensors or svd_xt_1_1.safetensors

--   ComfyUI/models/checkpoints
    ├── svd_xt.safetensors  or  svd_xt_1_1.safetensors

Example

Citation

@misc{ji2024sonicshiftingfocusglobal,
      title={Sonic: Shifting Focus to Global Audio Perception in Portrait Animation}, 
      author={Xiaozhong Ji and Xiaobin Hu and Zhihong Xu and Junwei Zhu and Chuming Lin and Qingdong He and Jiangning Zhang and Donghao Luo and Yi Chen and Qin Lin and Qinglin Lu and Chengjie Wang},
      year={2024},
      eprint={2411.16331},
      archivePrefix={arXiv},
      primaryClass={cs.MM},
      url={https://arxiv.org/abs/2411.16331}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
config/inference		config/inference
examples		examples
src		src
svd_repo		svd_repo
20250211.json		20250211.json
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
demo.py		demo.py
exampleA.png		exampleA.png
gradio_app.py		gradio_app.py
node_utils.py		node_utils.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sonic.py		sonic.py
sonic_node.py		sonic_node.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI_Sonic

Update

1. Installation

2. Requirements

3.Model

Example

Citation

About

Releases

Packages

Languages

License

njemanzedavid/ComfyUI_Sonic

Folders and files

Latest commit

History

Repository files navigation

ComfyUI_Sonic

Update

1. Installation

2. Requirements

3.Model

Example

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages