- Mar 1, 2026 Latent Reasoning's Gains Aren't From Reasoning Daily
- Feb 25, 2026 Token Probabilities as Zero-Shot Rewards Hit 0.95 Correlation Daily
- Feb 21, 2026 Agents Score Higher but Fail the Same Way Daily
- Feb 20, 2026 Example Pairs Replace Prompts, Agents Play Favorites Daily
- Feb 12, 2026 Text Diffusion Hits Practical Speed, RL Spreads Everywhere Daily
- Feb 10, 2026 LinkedIn Ships LLM-Powered Search Ranking at Scale Daily
- Feb 5, 2026 Kimi K2.5 Open-Sources Agent Swarm, CoT Plans Only 2-3 Steps Ahead Daily
- Feb 4, 2026 Better SFT Makes Worse RL, Distillation Waste, Reward Circuits Daily
- Feb 3, 2026 Zero-Cost Data Mix Search, Guided RLVR, Selective SFT Daily
- Feb 1, 2026 Open-Source Deep Research Beats GPT-5, Embedding Scaling Outshines Experts Daily