ECCV 2026

OmniDance

Multimodal Driven Dance Video Generation with Large-scale Internet Data

Kaixing Yang1 Jiashu Zhu2,* Xulong Tang5 Ziqiao Peng1 Xiangyue Zhang4 Chubin Chen3 Puwei Wang1,† Jiahong Wu2,† Xiangxiang Chu2 Hongyan Liu3 Jun He1,†

1Renmin University of China 2AMap, Alibaba 3Tsinghua University 4Wuhan University 5Malou Tech Inc

*Project Leader   Corresponding Authors

Abstract

Music-driven dance video generation aims to synthesize expressive human motion that is temporally aligned with music while maintaining high visual fidelity. Existing methods still struggle to generate dance videos that simultaneously exhibit expressive motion and high visual quality, largely due to limited dance-specific data and the difficulty of integrating music into video generation foundation models.

We introduce CIPE-Dance, a large-scale Internet-sourced dance video dataset with choreography-informed text annotations, and propose OmniDance, a framework-level recipe for integrating music into a TI2V foundation model without sacrificing controllability or visual fidelity. OmniDance supports unified TI2V, MI2V, and MTI2V generation through depth-aware specialization, anchored curriculum learning, and modality-specialized time-dependent CFG.

Overview Video

Video

Dataset

CIPE-Dance

CIPE-Dance data collection and statistics
300K high-quality clips
400+ video hours
30+ dance genres
5 annotation aspects

Method

OmniDance Framework

OmniDance framework overview
01

Music-Text Progressive Specialization

02

Curriculum Learning Training Strategy

03

Modality-Specialized Inference Strategy

Results

Generation Results

TI2V Text + Image to Video
MI2V Music + Image to Video
MTI2V Music + Text + Image to Video

Citation

BibTeX

@article{omnidance2026,
  title={OmniDance: Multimodal Driven Dance Video Generation with Large-scale Internet Data},
  author={Yang, Kaixing and Zhu, Jiashu and Tang, Xulong and Peng, Ziqiao and Zhang, Xiangyue and Chen, Chubin and Wang, Puwei and Wu, Jiahong and Chu, Xiangxiang and Liu, Hongyan and He, Jun},
  journal={ECCV},
  year={2026}
}