-
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening
score 9
入选 HF Daily Papers; HF 热度: 63 upvotes (+4); 关键词(9): efficient, efficiency, lightweight, latency, agent
-
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR
score 9
入选 HF Daily Papers; HF 热度: 45 upvotes (+4); 关键词(4): GRPO, reasoning, multimodal, vision-language
-
Context Forcing: Consistent Autoregressive Video Generation with Long Context
score 9
入选 HF Daily Papers; HF 热度: 29 upvotes (+4); 关键词(3): fast, real-time, evaluation
-
DFlash: Block Diffusion for Flash Speculative Decoding
score 9
入选 HF Daily Papers; HF 热度: 27 upvotes (+4); 关键词(6): efficient, fast, lightweight, inference, latency
-
RISE-Video: Can Video Generators Decode Implicit World Rules?
score 9
入选 HF Daily Papers; HF 热度: 26 upvotes (+4); 关键词(5): alignment, reasoning, multimodal, benchmark, evaluation
-
ProAct: Agentic Lookahead in Interactive Environments
score 9
入选 HF Daily Papers; HF 热度: 22 upvotes (+4); 关键词(13): lightweight, distillation, inference, fine-tuning, PPO
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
score 9
入选 HF Daily Papers; HF 热度: 20 upvotes (+4); 关键词(2): scaling, GRPO
-
InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions
score 8
入选 HF Daily Papers; HF 热度: 18 upvotes (+3); 关键词(6): scaling, deployment, finetuning, post-training, pretraining
-
Semantic Search over 9 Million Mathematical Theorems
score 8
入选 HF Daily Papers; HF 热度: 17 upvotes (+3); 关键词(3): agents, search, evaluation
-
Reinforcement World Model Learning for LLM-based Agents
score 8
入选 HF Daily Papers; HF 热度: 17 upvotes (+3); 关键词(2): agents, agentic