Paper Notes

标签: paper/llm-vlm/evaluation-analysis

此标签下有15条笔记。

  • 2026年4月

    Image Generators are Generalist Vision Learners

  • 2026年3月

    Attend Before Attention - Efficient and Scalable Video Understanding via Autoregressive Gazing

  • 2026年2月

    Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

  • 2025年12月

    ThinkGen: Generalized Thinking for Visual Generation

  • 2025年5月

    LLMs Get Lost In Multi-Turn Conversation

  • 2025年3月

    Streaming Video Question-Answering with In-context Video KV-Cache Retrieval

  • E-VRAG: Enhancing Long Video Understanding with Resource-Efficient Retrieval Augmented Generation

  • Is Your Video Language Model a Reliable Judge?

  • LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval

  • Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model

  • RIVER: A Real-Time Interaction Benchmark for Video LLMs

  • StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression

  • UNIVERSE: Adapting Vision-Language Models for Evaluating World Models

  • V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval

  • Video Panels for Long Video Understanding


Created with Quartz v4.5.2 © 2026

  • Source