Paper Notes
Search
搜索
暗色模式
亮色模式
探索
标签: text-to-image
此标签下有4条笔记。
2026年5月
Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria
2026年5月
Reinforce Adjoint Matching: Scaling RL Post-Training of Diffusion and Flow-Matching Models
2026年4月
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling
2026年4月
V-GRPO - Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think