Paper Notes

标签: text-to-image

此标签下有4条笔记。

  • 2026年5月

    Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria

  • 2026年5月

    Reinforce Adjoint Matching: Scaling RL Post-Training of Diffusion and Flow-Matching Models

  • 2026年4月

    FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

  • 2026年4月

    V-GRPO - Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think


Created with Quartz v4.5.2 © 2026

  • Source