Sources | Two Loops Take SWE-bench From 43 to 64

Featured

OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation score 10
入选 HF Daily Papers; HF 热度: 26 upvotes (+4); 有代码实现; 关键词(1): distillation
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine? score 10
入选 HF Daily Papers; HF 热度: 42 upvotes (+4); 有代码实现; 关键词(1): coding
Variable-Width Transformers score 8
入选 HF Daily Papers; HF 热度: 5 upvotes (+2); 有代码实现; 关键词(2): scaling, MoE
Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification score 9
入选 HF Daily Papers; HF 热度: 11 upvotes (+3); 有代码实现; 关键词(4): scaling, quantization, fine-tuning, pre-training
ActWorld: From Explorable to Interactive World Model via Action-Aware Memory score 6
入选 HF Daily Papers; HF 热度: 6 upvotes (+2); 关键词(3): compression, real-time, reasoning
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients score 8
入选 HF Daily Papers; HF 热度: 48 upvotes (+4); 关键词(3): distillation, GRPO, vision-language