- Mar 1, 2026 Latent Reasoning's Gains Aren't From Reasoning Daily
- Feb 28, 2026 Tri-Modal Training From Scratch, Agentic RL Gets a Stability Fix Daily
- Feb 27, 2026 TTT Is Linear Attention, Terminal Agent Data Recipe Goes Open Daily
- Feb 26, 2026 11 Agent Failure Modes From Red-Teaming, Step-Level Routing Cuts Cost 700x Daily
- Feb 25, 2026 Token Probabilities as Zero-Shot Rewards Hit 0.95 Correlation Daily
- Feb 24, 2026 74% of Agent Coordination May Be Wasted Effort Daily
- Feb 23, 2026 Model Folding Beats Pruning, XR Gets Hand-Level Control Daily
- Feb 21, 2026 Agents Score Higher but Fail the Same Way Daily
- Feb 20, 2026 Example Pairs Replace Prompts, Agents Play Favorites Daily
- Feb 19, 2026 Spectral Decay Recovers 7% Accuracy in W4A4 Quantization Daily
- Feb 18, 2026 Binary Tokens Make Image Gen 30x Faster, RL Training Learns to Reflect Daily
- Feb 17, 2026 Online RL Cracks Web Agents, Reward Models Learn to Look Backward Daily
- Feb 16, 2026 Vertical AI Is Winning: Medical, Robotics, and Science Agents Daily
- Feb 15, 2026 Running Out of RL Training Data? Just Combine the Easy Problems Daily
- Feb 14, 2026 11B Active Parameters Hit Frontier-Level Agent Intelligence Daily
- Feb 12, 2026 Text Diffusion Hits Practical Speed, RL Spreads Everywhere Daily
- Feb 11, 2026 Agent Bottlenecks Are Shifting From Models to Systems Daily
- Feb 10, 2026 LinkedIn Ships LLM-Powered Search Ranking at Scale Daily
- Feb 9, 2026 Medical LLMs Should Ask Questions, Not Just Answer Them Daily
- Feb 7, 2026 Trillion-Parameter Multimodal, 4B Agents Match 671B, PPO Exposed Daily
- Feb 5, 2026 Kimi K2.5 Open-Sources Agent Swarm, CoT Plans Only 2-3 Steps Ahead Daily
- Feb 3, 2026 Zero-Cost Data Mix Search, Guided RLVR, Selective SFT Daily
- Feb 1, 2026 Open-Source Deep Research Beats GPT-5, Embedding Scaling Outshines Experts Daily