AI论文简报
搜索
方法论
公众号
EN
梯度提升竟是扩散训练最优解
从192篇论文中选出5篇
重点关注
When Do Diffusion Models learn to Generate Multiple Objects?
score 8
入选 HF Daily Papers;HF 热度: 6 upvotes (+2);有代码实现;关键词(1): text-to-image
Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance
score 6
入选 HF Daily Papers;HF 热度: 15 upvotes (+3)
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
score 6
入选 HF Daily Papers;HF 热度: 6 upvotes (+2);关键词(1): distillation
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning
score 7
入选 HF Daily Papers;HF 热度: 14 upvotes (+3);关键词(8): scaling, lightweight, fine-tuning, PPO, GRPO
也值得关注
Online Self-Calibration Against Hallucination in Vision-Language Models
score 5
入选 HF Daily Papers;HF 热度: 2 upvotes (+1);关键词(1): vision-language