-
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
score 10
入选 HF Daily Papers;HF 热度: 135 upvotes (+4);有代码实现;关键词(2): scaling, reasoning
-
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
score 10
入选 HF Daily Papers;HF 热度: 48 upvotes (+4);有代码实现;关键词(1): reasoning
-
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video
score 10
入选 HF Daily Papers;HF 热度: 34 upvotes (+4);有代码实现;关键词(3): lightweight, finetuning, post-training
-
Self-Distilled Agentic Reinforcement Learning
score 10
入选 HF Daily Papers;HF 热度: 75 upvotes (+4);有代码实现;关键词(4): distillation, GRPO, post-training, agentic
-
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
score 10
入选 HF Daily Papers;HF 热度: 42 upvotes (+4);有代码实现;关键词(2): tool use, reasoning
-
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn
score 8
入选 HF Daily Papers;HF 热度: 28 upvotes (+4);关键词(3): scaling, fine-tuning, reasoning
-
Retrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation
score 8
入选 HF Daily Papers;HF 热度: 8 upvotes (+2);有代码实现;关键词(3): retrieval-augmented, RAG, reasoning
-
Orchard: An Open-Source Agentic Modeling Framework
score 9
入选 HF Daily Papers;HF 热度: 12 upvotes (+3);有代码实现;关键词(7): lightweight, agentic, tool use, coding, reasoning
-
Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion
score 9
入选 HF Daily Papers;HF 热度: 10 upvotes (+3);有代码实现;关键词(2): lightweight, throughput
-
RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
score 8
入选 HF Daily Papers;HF 热度: 7 upvotes (+2);有代码实现;关键词(1): reasoning