Paper Notes

Home

❯

Multimodal Generation

❯

Video & Audio Video Generation

文件夹: Multimodal-Generation/Video--and--Audio-Video-Generation

此文件夹下有13条笔记。

  • 2026年5月

    Stream-R1

  • 2026年5月

    UniVidX

  • 2026年5月

    Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

  • 2026年4月

    Seeing Fast and Slow - Learning the Flow of Time in Videos

  • 2026年3月

    CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

  • 2026年3月

    Improving Joint Audio-Video Generation with Cross-Modal Context Learning

  • 2026年3月

    Kling-MotionControl Technical Report

  • 2026年3月

    ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

  • 2026年1月

    LTX-2: Efficient Joint Audio-Visual Foundation Model

  • 2025年12月

    Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

  • 2025年5月

    OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

  • MOVA - Scalable and Synchronized Video-Audio Generation

  • ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation


Created with Quartz v4.5.2 © 2026

  • Source