
One of our main innovations is a new conditional generation method for unconditional diffusion models. Our new conditioning method, which refer to as the gradient method, modifies the sampling procedure of the model to improve a conditioning loss on denoised data using gradient-based optimization. We find that the gradient method is more capable than existing methods in ensuring consistency of the generated samples with the conditioning information.
We use the gradient method to autoregressively extend our models to more timesteps and higher resolutions.
Frames from our gradient method (left) and a baseline "replacement" method (right) for autoregressive extension. Videos sampled using the gradient method attain superior temporal coherence compared to the baseline method.
We show that high quality videos can be generated by essentially the standard formulation of the Gaussian diffusion model, with little modification other than straightforward architectural changes to accommodate video data within memory constraints of deep learning accelerators. We train models that generate a block of a fixed number of frames of a video, and to generate videos longer than that number of frames, we additionally show how to repurpose a trained model to act as a model which is block-autoregressive over frames. We test our methods on an unconditional video generation benchmark, where we achieve state-of-the-art sample quality scores, and we also show promising results on text-conditioned video generation.
数据统计
数据评估
关于Video Diffusion Models特别声明
本站鸟瑞导航提供的Video Diffusion Models数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午6:59收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

B族智能MJ中文站提供优质的Midjourney绘画系统平台,汇集Midjourney绘画、MJ中文版绘图、平台支持高质量图片生成、风格转换、智能抠图等多种功能,满足不同用户需求。

腾讯混元3D
AI-3D生成--腾讯混元3D AI创作引擎基于腾讯混元3D...

Chaos: Industry
Chaos develops visualization technologies that empower artists & designers to create photorealistic imagery and animation across all creative industries

深氧AI
深氧未来(深圳)科技有限公司(o3.xyz)是一家专注于AI图形/视觉的公司,致力于使用AIGC技术一站式生产3D、视频等内容,赋能游戏、XR、短视频等领域。我们通过整合AI、多模态大模型、云原生、计算机图形、计算机视觉等技术红利打造下一代3D视频内容生产工具,极大的降低3D视频制作门槛。我们的使命是实现“人人可制作3D视频”的创意未来。愿景是“打造下一代3D视频生产工具”。

大设AI
大设网(原AI大作)是基于Stable Diffusion的免费ai绘画网站,为ai作画爱好者提供一键生成高清精绘大图、sdxl模型保姆级教程、AI提示词工具。在大设ai人工智能绘画平台随意发挥自己的绘画创意。

OPS/OpenPromptStudio
在 Moonvy 月维上在线管理并交付你的设计资源,强大的设计标注与代码生成,支持海量文件格式。无论使用 Sketch、Figma、即时设计、Photoshop 等各种设计工具都有完美的支持

彩葫芦
用AI生成故事漫画、科普绘本、小说插画,加入彩葫芦绘画社区,一起释放创造力!

51建模网
51建模网是深圳积木易搭科技技术有限公司旗下3D数据服务平台,包含3D建模业务对接与制作分发,3D模型数据云存储与调用展示,提供真正的一站式整体解决方案,加快推动各地区各行各业的3D数字化技术应用.
暂无评论...
