
One of our main innovations is a new conditional generation method for unconditional diffusion models. Our new conditioning method, which refer to as the gradient method, modifies the sampling procedure of the model to improve a conditioning loss on denoised data using gradient-based optimization. We find that the gradient method is more capable than existing methods in ensuring consistency of the generated samples with the conditioning information.
We use the gradient method to autoregressively extend our models to more timesteps and higher resolutions.
Frames from our gradient method (left) and a baseline "replacement" method (right) for autoregressive extension. Videos sampled using the gradient method attain superior temporal coherence compared to the baseline method.
We show that high quality videos can be generated by essentially the standard formulation of the Gaussian diffusion model, with little modification other than straightforward architectural changes to accommodate video data within memory constraints of deep learning accelerators. We train models that generate a block of a fixed number of frames of a video, and to generate videos longer than that number of frames, we additionally show how to repurpose a trained model to act as a model which is block-autoregressive over frames. We test our methods on an unconditional video generation benchmark, where we achieve state-of-the-art sample quality scores, and we also show promising results on text-conditioned video generation.
数据统计
数据评估
关于Video Diffusion Models特别声明
本站鸟瑞导航提供的Video Diffusion Models数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午6:59收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

Chaos develops visualization technologies that empower artists & designers to create photorealistic imagery and animation across all creative industries

Auth0: Secure access for everyone. But not just anyone.
Rapidly integrate authentication and authorization for web, mobile, and legacy applications so you can focus on your core business.

SceneXplain
SceneXplain - Leading AI Solution for Image Captions and Video Summaries

InstantStyle
InstantStyle.

SuperCraft
SuperCraft helps teams design great physical products

智谱AI绘画
中国版对话语言模型,与GLM大模型进行对话。

#1 AI Manga Translator
Upload manga images and translate into multiple languages with one click, preserving original artwork. Fast, accurate AI translation for comics and manga fans.

Civitai社区
Explore thousands of high-quality Stable Diffusion & Flux models, share your AI-generated art, and engage with a vibrant community of creators
暂无评论...


