论文来源 | 两次循环让SWE-bench从43涨到64

重点关注

OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation score 10
入选 HF Daily Papers；HF 热度: 26 upvotes (+4)；有代码实现；关键词(1): distillation
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine? score 10
入选 HF Daily Papers；HF 热度: 42 upvotes (+4)；有代码实现；关键词(1): coding
Variable-Width Transformers score 8
入选 HF Daily Papers；HF 热度: 5 upvotes (+2)；有代码实现；关键词(2): scaling, MoE
Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification score 9
入选 HF Daily Papers；HF 热度: 11 upvotes (+3)；有代码实现；关键词(4): scaling, quantization, fine-tuning, pre-training
ActWorld: From Explorable to Interactive World Model via Action-Aware Memory score 6
入选 HF Daily Papers；HF 热度: 6 upvotes (+2)；关键词(3): compression, real-time, reasoning
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients score 8
入选 HF Daily Papers；HF 热度: 48 upvotes (+4)；关键词(3): distillation, GRPO, vision-language