-
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training
score 10
入选 HF Daily Papers;HF 热度: 24 upvotes (+4);有代码实现;关键词(2): fine-tuning, post-training
-
Make Each Token Count: Towards Improving Long-Context Performance with KV Cache Eviction
score 9
入选 HF Daily Papers;HF 热度: 10 upvotes (+3);有代码实现;关键词(4): lightweight, compression, reasoning, vision-language
-
Metal-Sci: A Scientific Compute Benchmark for Evolutionary LLM Kernel Search on Apple Silicon
score 7
入选 HF Daily Papers;HF 热度: 3 upvotes (+1);有代码实现;关键词(1): lightweight
-
Shaping Schema via Language Representation as the Next Frontier for LLM Intelligence Expanding
score 6
入选 HF Daily Papers;HF 热度: 5 upvotes (+2);关键词(1): scaling
-
Crosslingual On-Policy Self-Distillation for Multilingual Reasoning
score 6
入选 HF Daily Papers;有代码实现;关键词(4): scaling, distillation, GRPO, reasoning
-
Reinforcing Multimodal Reasoning Against Visual Degradation
score 5
入选 HF Daily Papers;HF 热度: 4 upvotes (+1);关键词(4): compression, fine-tuning, GRPO, reasoning
-
DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification
score 5
入选 HF Daily Papers;HF 热度: 4 upvotes (+1);关键词(1): reasoning