-
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?
score 10
入选 HF Daily Papers;HF 热度: 37 upvotes (+4);有代码实现;关键词(3): distillation, post-training, reasoning
-
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
score 8
入选 HF Daily Papers;HF 热度: 84 upvotes (+4);关键词(2): scaling, reasoning
-
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
score 10
入选 HF Daily Papers;HF 热度: 35 upvotes (+4);有代码实现;关键词(2): distillation, fine-tuning
-
Understanding the Challenges in Iterative Generative Optimization with LLMs
score 8
入选 HF Daily Papers;HF 热度: 18 upvotes (+3);有代码实现
-
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare
score 8
入选 HF Daily Papers;HF 热度: 8 upvotes (+2);有代码实现;关键词(4): agentic, reasoning, vision-language, open-source
-
GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents
score 7
入选 HF Daily Papers;HF 热度: 17 upvotes (+3);关键词(4): agentic, reasoning, robotics, embodied
-
SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision
score 7
入选 HF Daily Papers;HF 热度: 11 upvotes (+3);关键词(1): real-time
-
OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning
score 5
入选 HF Daily Papers;HF 热度: 4 upvotes (+1);关键词(3): pretraining, reasoning, open-source