Google Scholar

FA Croitoru, V Hondru, RT Ionescu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Denoising diffusion models represent a recent emerging topic in computer vision,
demonstrating remarkable results in the area of generative modeling. A diffusion model is a …

Save Cite Cited by 741 Related articles All 7 versions

[PDF] thecvf.com

Align your latents: High-resolution video synthesis with latent diffusion models

A Blattmann, R Rombach, H Ling… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …

Save Cite Cited by 494 Related articles All 6 versions View as HTML

[PDF] arxiv.org

A survey on video diffusion models

Z Xing, Q Feng, H Chen, Q Dai, H Hu, H Xu… - arXiv preprint arXiv …, 2023 - arxiv.org

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

Save Cite Cited by 35 Related articles All 3 versions View as HTML

[PDF] thecvf.com

Preserve your own correlation: A noise prior for video diffusion models

S Ge, S Nah, G Liu, T Poon, A Tao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Despite tremendous progress in generating high-quality images using diffusion models,
synthesizing a sequence of animated frames that are both photorealistic and temporally …

Save Cite Cited by 130 Related articles All 6 versions View as HTML

[PDF] openreview.net

Phenaki: Variable length video generation from open domain textual descriptions

R Villegas, M Babaeizadeh, PJ Kindermans… - International …, 2022 - openreview.net

We present Phenaki, a model capable of realistic video synthesis given a sequence of
textual prompts. Generating videos from text is particularly challenging due to the …

Save Cite Cited by 257 Related articles All 5 versions View as HTML

[PDF] arxiv.org

A survey on generative diffusion models

H Cao, C Tan, Z Gao, Y Xu, G Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Deep generative models have unlocked another profound realm of human creativity. By
capturing and generalizing patterns within data, we have entered the epoch of all …

Save Cite Cited by 209 Related articles All 5 versions

[PDF] thecvf.com

Video probabilistic diffusion models in projected latent space

S Yu, K Sohn, S Kim, J Shin - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Despite the remarkable progress in deep generative models, synthesizing high-resolution
and temporally coherent videos still remains a challenge due to their high-dimensionality …

Save Cite Cited by 104 Related articles All 6 versions View as HTML

[PDF] thecvf.com

Magvit: Masked generative video transformer

L Yu, Y Cheng, K Sohn, J Lezama… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract We introduce the MAsked Generative VIdeo Transformer, MAGVIT, to tackle various
video synthesis tasks with a single model. We introduce a 3D tokenizer to quantize a video …

Save Cite Cited by 96 Related articles All 8 versions View as HTML

[PDF] arxiv.org

Modelscope text-to-video technical report

J Wang, H Yuan, D Chen, Y Zhang, X Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

This paper introduces ModelScopeT2V, a text-to-video synthesis model that evolves from a
text-to-image synthesis model (ie, Stable Diffusion). ModelScopeT2V incorporates spatio …

Save Cite Cited by 151 Related articles All 2 versions View as HTML

[PDF] thecvf.com

Simda: Simple diffusion adapter for efficient video generation

Z Xing, Q Dai, H Hu, Z Wu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

The recent wave of AI-generated content has witnessed the great development and success
of Text-to-Image (T2I) technologies. By contrast Text-to-Video (T2V) still falls short of …

Save Cite Cited by 34 Related articles All 3 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

Diffusion models for video prediction and infilling

Diffusion models in vision: A survey

Align your latents: High-resolution video synthesis with latent diffusion models

A survey on video diffusion models

Preserve your own correlation: A noise prior for video diffusion models

Phenaki: Variable length video generation from open domain textual descriptions

A survey on generative diffusion models

Video probabilistic diffusion models in projected latent space

Magvit: Masked generative video transformer

Modelscope text-to-video technical report

Simda: Simple diffusion adapter for efficient video generation