-
MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling
score 7
入选 HF Daily Papers;HF 热度: 5 upvotes (+2);关键词(6): efficient, efficiency, inference, transformer, attention
-
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
score 9
入选 HF Daily Papers;HF 热度: 72 upvotes (+4);关键词(9): efficient, lightweight, deployment, fine-tuning, GRPO
-
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
score 9
入选 HF Daily Papers;HF 热度: 55 upvotes (+4);关键词(4): scaling, distillation, code generation, reasoning
-
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
score 9
入选 HF Daily Papers;HF 热度: 45 upvotes (+4);关键词(4): deployment, reasoning, vision-language, benchmark
-
Thinking with Drafting: Optical Decompression via Logical Reconstruction
score 9
入选 HF Daily Papers;HF 热度: 31 upvotes (+4);关键词(3): reasoning, multimodal, benchmark
-
LawThinker: A Deep Research Legal Agent in Dynamic Environments
score 9
入选 HF Daily Papers;HF 热度: 31 upvotes (+4);关键词(3): agent, reasoning, benchmark
-
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
score 9
入选 HF Daily Papers;HF 热度: 26 upvotes (+4);关键词(2): scaling, reasoning
-
Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching
score 9
入选 HF Daily Papers;HF 热度: 26 upvotes (+4);关键词(2): distillation, serving
-
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
score 8
入选 HF Daily Papers;HF 热度: 86 upvotes (+4);关键词(1): reasoning
-
dVoting: Fast Voting for dLLMs
score 8
入选 HF Daily Papers;HF 热度: 19 upvotes (+3);关键词(4): scaling, fast, diffusion, reasoning