Skip to content

Latest commit

 

History

History
354 lines (242 loc) · 33.9 KB

video_generation.md

File metadata and controls

354 lines (242 loc) · 33.9 KB

Video Generation

Survey

Generation

Animation

  • DisPose: Disentangling Pose Guidance for Controllable Human Image Animation, arXiv, 2412.09349, arxiv, pdf, cication: -1

    Hongxiang Li, Yaowei Li, Yuhang Yang, ..., Xuxin Cheng, Long Chen · (lihxxx.github) · (DisPose - lihxxx) Star

  • An image-to-video model by CreateAI. 🤗

    · (Ruyi-Models - IamCreateAI) Star

  • DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses, arXiv, 2412.00397, arxiv, pdf, cication: -1

    Yatian Pang, Bin Zhu, Bin Lin, ..., Harry Yang, Li Yuan

  • 🌟 StableAnimator: High-Quality Identity-Preserving Human Image Animation, arXiv, 2411.17697, arxiv, pdf, cication: -1

    Shuyuan Tu, Zhen Xing, Xintong Han, ..., Chong Luo, Zuxuan Wu · (StableAnimator - Francis-Rings) Star

  • Trajectory Attention for Fine-grained Video Motion Control, arXiv, 2411.19324, arxiv, pdf, cication: -1

    Zeqi Xiao, Wenqi Ouyang, Yifan Zhou, ..., Jianlou Si, Xingang Pan

  • AnimateAnything: Consistent and Controllable Animation for Video Generation, arXiv, 2411.10836, arxiv, pdf, cication: -1

    Guojun Lei, Chi Wang, Hong Li, ..., Yikai Wang, Weiwei Xu · (yu-shaonian.github)

  • FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations, arXiv, 2411.10818, arxiv, pdf, cication: -1

    Hmrishav Bandyopadhyay, Yi-Zhe Song · (hmrishavbandy.github)

  • EasyAnimate - aigc-apps Star

  • SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation, arXiv, 2411.04989, arxiv, pdf, cication: -1

    Koichi Namekata, Sherwin Bahmani, Ziyi Wu, ..., Igor Gilitschenski, David B. Lindell · (kmcode1.github)

  • HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models, arXiv, 2410.22901, arxiv, pdf, cication: -1

    Shengkai Zhang, Nianhong Jiao, Tian Li, ..., Boya Niu, Jun Gao · (HelloMeme - HelloVision) Star · (songkey.github)

  • CamI2V: Camera-Controlled Image-to-Video Diffusion Model, arXiv, 2410.15957, arxiv, pdf, cication: -1

    Guangcong Zheng, Teng Li, Rui Jiang, ..., Tao Wu, Xi Li · (zgctroy.github) · (CamI2V - ZGCTroy) Star

  • FrameBridge: Improving Image-to-Video Generation with Bridge Models

  • Animate-X: Universal Character Image Animation with Enhanced Motion Representation, arXiv, 2410.10306, arxiv, pdf, cication: -1

    Shuai Tan, Biao Gong, Xiang Wang, ..., Jingdong Chen, Ming Yang · (lucaria-academy.github)

  • DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control, arXiv, 2410.13830, arxiv, pdf, cication: -1

    Yujie Wei, Shiwei Zhang, Hangjie Yuan, ..., Yingya Zhang, Hongming Shan · (dreamvideo2.github)

Evaluation

  • 🌟 Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models, arXiv, 2412.09645, arxiv, pdf, cication: -1

    Fan Zhang, Shulin Tian, Ziqi Huang, ..., Yu Qiao, Ziwei Liu

  • VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models, arXiv, 2411.13503, arxiv, pdf, cication: -1

    Ziqi Huang, Fan Zhang, Xiaojie Xu, ..., Yu Qiao, Ziwei Liu · (huggingface)

  • How Far is Video Generation from World Model: A Physical Law Perspective, arXiv, 2411.02385, arxiv, pdf, cication: -1

    Bingyi Kang, Yang Yue, Rui Lu, ..., Gao Huang, Jiashi Feng · (phyworld.github)

  • Artificial Analysis Video Generation Arena Leaderboard

Detection

  • Video Seal: Open and Efficient Video Watermarking, arXiv, 2412.09492, arxiv, pdf, cication: -1

    Pierre Fernandez, Hady Elsahar, I. Zeki Yalniz, ..., Alexandre Mourachko · (videoseal - facebookresearch) Star

Alignment

  • VideoDPO: Omni-Preference Alignment for Video Diffusion Generation, arXiv, 2412.14167, arxiv, pdf, cication: -1

    Runtao Liu, Haoyu Wu, Zheng Ziqiang, ..., Renjie Pi, Qifeng Chen · (videodpo.github)

  • LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment, arXiv, 2412.04814, arxiv, pdf, cication: -1

    Yibin Wang, Zhiyu Tan, Junyan Wang, ..., Cheng Jin, Hao Li · (codegoat24.github) · (LiFT - CodeGoat24) Star

Auto Regressive

  • From Slow Bidirectional to Fast Causal Video Generators, arXiv, 2412.07772, arxiv, pdf, cication: -1

    Tianwei Yin, Qiang Zhang, Richard Zhang, ..., Eli Shechtman, Xun Huang · (causvid.github)

  • Progressive Autoregressive Video Diffusion Models, arXiv, 2410.08151, arxiv, pdf, cication: -1

    Desai Xie, Zhan Xu, Yicong Hong, ..., Arie Kaufman, Yang Zhou

    · (desaixie.github)

Editting

  • 🌟 STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution, arXiv, 2501.02976, arxiv, pdf, cication: -1

    Rui Xie, Yinhong Liu, Penghao Zhou, ..., Zhenheng Yang, Ying Tai · (STAR - NJU-PCALab) Star · (arxiv) · (nju-pcalab.github)

  • SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration, arXiv, 2501.01320, arxiv, pdf, cication: -1

    Jianyi Wang, Zhijie Lin, Meng Wei, ..., Chen Change Loy, Lu Jiang · (iceclear.github)

  • Generative Video Propagation, arXiv, 2412.19761, arxiv, pdf, cication: -1

    Shaoteng Liu, Tianyu Wang, Jui-Hsien Wang, ..., Soo Ye Kim, Jiaya Jia

  • MoViE: Mobile Diffusion for Video Editing, arXiv, 2412.06578, arxiv, pdf, cication: -1

    Adil Karjauv, Noor Fathima, Ioannis Lelekas, ..., Amir Ghodrati, Amirhossein Habibian

  • DIVE: Taming DINO for Subject-Driven Video Editing, arXiv, 2412.03347, arxiv, pdf, cication: -1

    Yi Huang, Wei Xiong, He Zhang, ..., Mingfu Yan, Shifeng Chen · (dino-video-editing.github)

  • MyTimeMachine: Personalized Facial Age Transformation, arXiv, 2411.14521, arxiv, pdf, cication: -1

    Luchao Qi, Jiaye Wu, Bang Gong, ..., David W. Jacobs, Roni Sengupta · (mytimemachine.github)

  • 🌟 Generative Omnimatte Learning to Decompose Video into Layers

    · (𝕏)

  • StableV2V: Stablizing Shape Consistency in Video-to-Video Editing, arXiv, 2411.11045, arxiv, pdf, cication: -1

    Chang Liu, Rui Li, Kaidong Zhang, ..., Yunwei Lan, Dong Liu

  • Fashion-VDM: Video Diffusion Model for Virtual Try-On

  • AutoVFX: Physically Realistic Video Editing from Natural Language Instructions, arXiv, 2411.02394, arxiv, pdf, cication: -1

    Hao-Yu Hsu, Zhi-Hao Lin, Albert Zhai, ..., Hongchi Xia, Shenlong Wang · (haoyuhsu.github) · (autovfx - haoyuhsu) Star

  • 🌟 ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning, arXiv, 2411.05003, arxiv, pdf, cication: -1

    David Junhao Zhang, Roni Paiss, Shiran Zada, ..., Neal Wadhwa, Nataniel Ruiz · (generative-video-camera-controls.github)

  • Fashion-VDM: Video Diffusion Model for Virtual Try-On, arXiv, 2411.00225, arxiv, pdf, cication: -1

    Johanna Karras, Yingwei Li, Nan Liu, ..., Chris Lee, Ira Kemelmacher-Shlizerman · (johannakarras.github) · (arxiv)

  • InvokeAI - invoke-ai Star

  • ComfyUI-MochiEdit - logtd Star

  • Framer: Interactive Frame Interpolation, arXiv, 2410.18978, arxiv, pdf, cication: -1

    Wen Wang, Qiuyu Wang, Kecheng Zheng, ..., Yujun Shen, Chunhua Shen

Datasets

  • InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption, arXiv, 2412.09283, arxiv, pdf, cication: -1

    Tiehan Fan, Kepan Nan, Rui Xie, ..., Jian Yang, Ying Tai · (InstanceCap - NJU-PCALab) Star · (arxiv)

  • EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation, arXiv, 2411.08380, arxiv, pdf, cication: -1

    Xiaofeng Wang, Kang Zhao, Feng Liu, ..., Yingya Zhang, Xingang Wang · (egovid.github)

  • TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation, arXiv, 2411.04709, arxiv, pdf, cication: -1

    Wenhao Wang, Yi Yang · (tip-i2v.github.io)

Toolkits

Tutorials

Blog

Products

Misc