论文来源 | 梯度提升竟是扩散训练最优解

重点关注

When Do Diffusion Models learn to Generate Multiple Objects? score 8
入选 HF Daily Papers；HF 热度: 6 upvotes (+2)；有代码实现；关键词(1): text-to-image
Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance score 6
入选 HF Daily Papers；HF 热度: 15 upvotes (+3)
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models score 6
入选 HF Daily Papers；HF 热度: 6 upvotes (+2)；关键词(1): distillation
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning score 7
入选 HF Daily Papers；HF 热度: 14 upvotes (+3)；关键词(8): scaling, lightweight, fine-tuning, PPO, GRPO

Online Self-Calibration Against Hallucination in Vision-Language Models score 5
入选 HF Daily Papers；HF 热度: 2 upvotes (+1)；关键词(1): vision-language