-
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition
score 9
入选 HF Daily Papers; HF 热度: 29 upvotes (+4); 关键词(4): fine-tuning, preference, multimodal, benchmark
-
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
score 11
入选 HF Daily Papers; HF 热度: 57 upvotes (+4); 关键词(8): scaling, efficiency, fast, alignment, diffusion; Reddit r/ML 热议
-
WorldCompass: Reinforcement Learning for Long-Horizon World Models
score 11
机构: Tencent; 入选 HF Daily Papers; HF 热度: 18 upvotes (+3); 关键词(5): efficient, efficiency, fine-tuning, post-training, open-source
-
iGRPO: Self-Feedback-Driven LLM Reasoning
score 8
入选 HF Daily Papers; HF 热度: 13 upvotes (+3); 关键词(4): efficient, PPO, GRPO, reasoning
-
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
score 9
入选 HF Daily Papers; HF 热度: 235 upvotes (+4); 关键词(6): inference, post-training, agents, code generation, reasoning
-
UI-Venus-1.5 Technical Report
score 9
入选 HF Daily Papers; HF 热度: 147 upvotes (+4); 关键词(2): agent, agents
-
MOVA: Towards Scalable and Synchronized Video-Audio Generation
score 9
入选 HF Daily Papers; HF 热度: 144 upvotes (+4); 关键词(9): efficient, inference, fine-tuning, MoE, multimodal
-
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
score 9
入选 HF Daily Papers; HF 热度: 61 upvotes (+4); 关键词(2): agentic, reasoning
-
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
score 9
入选 HF Daily Papers; HF 热度: 61 upvotes (+4); 关键词(5): distillation, agent, agents, reasoning, search
-
Improving Data and Reward Design for Scientific Reasoning in Large Language Models
score 9
入选 HF Daily Papers; HF 热度: 37 upvotes (+4); 关键词(4): post-training, reasoning, evaluation, open-source