dreamvideo-t2v

我要开发同款
匿名用户2024年07月31日
31阅读
所属分类aiPytorch
开源地址https://modelscope.cn/models/iic/dreamvideo-t2v

作品详情

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion (CVPR2024)

figure1

Customized generation using diffusion models has made impressive progress in image generation, but remains unsatisfactory in the challenging video generation task, as it requires the controllability of both subjects and motions. To that end, we present DreamVideo, a novel approach to generating personalized videos from a few static images of the desired subject and a few videos of target motion. DreamVideo decouples this task into two stages, subject learning and motion learning, by leveraging a pre-trained video diffusion model. The subject learning aims to accurately capture the fine appearance of the subject from provided images, which is achieved by combining textual inversion and fine-tuning of our carefully designed identity adapter. In motion learning, we architect a motion adapter and fine-tune it on the given videos to effectively model the target motion pattern. Combining these two lightweight and efficient adapters allows for flexible customization of any subject with any motion. Extensive experimental results demonstrate the superior performance of our DreamVideo over the state-of-the-art methods for customized video generation. Our project page is at https://dreamvideo-t2v.github.io.

?News!!!

  • [2024.04] We release the models of DreamVideo and ModelScopeT2V V1.5!!! ModelScopeT2V V1.5 is further fine-tuned on ModelScopeT2V for 365k iterations with more data.
  • [2024.03] We release the training and inference code of DreamVideo!

Models

ModelScopeT2V V1.5

ModelScopeT2V V1.5 is further fine-tuned on ModelScopeT2V for 365k iterations with more data. The weight file of ModelScopeT2V V1.5 is model_scope_v1-5_0632000.pth.

Customized Adapters

We provide ID adapters for two subjects (dog2, wolf_plushie) and motion adapters for two motions (carTurn, playingGuitar). The adapter's weight files and text embeddings are located in the DreamVideo directory.

BibTeX

If this repo is useful to you, please cite our corresponding paper.

@inproceedings{dreamvideo,
  title={DreamVideo: Composing Your Dream Videos with Customized Subject and Motion},
  author={Wei, Yujie and Zhang, Shiwei and Qing, Zhiwu and Yuan, Hangjie and Liu, Zhiheng and Liu, Yu and Zhang, Yingya and Zhou, Jingren and Shan, Hongming},
  booktitle={CVPR},
  year={2024}
}
声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论