-
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning
score 10
入选 HF Daily Papers;HF 热度: 52 upvotes (+4);有代码实现;关键词(6): lightweight, serving, latency, throughput, agentic
-
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models
score 10
入选 HF Daily Papers;HF 热度: 40 upvotes (+4);有代码实现;关键词(1): compression
-
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
score 10
入选 HF Daily Papers;HF 热度: 34 upvotes (+4);有代码实现;关键词(3): fine-tuning, GRPO, reasoning
-
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
score 10
入选 HF Daily Papers;HF 热度: 21 upvotes (+4);有代码实现;关键词(3): GRPO, reasoning, vision-language
-
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG
score 9
入选 HF Daily Papers;HF 热度: 77 upvotes (+4);有代码实现
-
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
score 8
入选 HF Daily Papers;HF 热度: 35 upvotes (+4);关键词(1): embodied
-
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation
score 8
入选 HF Daily Papers;HF 热度: 30 upvotes (+4);关键词(4): scaling, GRPO, post-training, reasoning
-
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions
score 8
入选 HF Daily Papers;HF 热度: 3 upvotes (+1);关键词(3): lightweight, reasoning, vision-language;顶会接收: CVPR
-
RealMaster: Lifting Rendered Scenes into Photorealistic Video
score 7
入选 HF Daily Papers;HF 热度: 24 upvotes (+4)
-
Reconstruction-Guided Slot Curriculum: Addressing Object Over-Fragmentation in Video Object-Centric Learning
score 7
入选 HF Daily Papers;HF 热度: 2 upvotes (+1);有代码实现;关键词(1): edge