- Apr 15, 2026 dLLMs Hallucinate Differently, PRM Labeling Cost Drops 100x Daily
- Apr 12, 2026 Scrambled Media Boosts Reasoning; 6B Model Tops GPT-4o Daily
- Apr 10, 2026 Entropy Is Lying to You, Implicit Reasoning Tops Out at 7 Steps Daily
- Apr 9, 2026 120B on One GPU, and 40% of Video Benchmarks Are Guessable Daily
- Apr 6, 2026 Open-Source 32B Cracks Hardware Code, Agents Score Just 23% Daily
- Apr 4, 2026 Single Neurons Remember Entities, Reusable Routines Boost 19% Daily
- Apr 3, 2026 Minimalist Agents Match MCP, Code Models Think Mid-Stream Daily
- Apr 2, 2026 Data Mixing Becomes Post-Training, Surface Cues Hijack Reasoning 38x Daily
- Mar 29, 2026 Mistral Ships TTS, Diffusion LLMs Get 4.7x Faster Daily
- Mar 25, 2026 PDEs Beat Attention 2x, Local RL Saves 3/4 Compute Daily
- Mar 24, 2026 Seed1.8 Goes Agent-Native, Language Training Erodes Vision Daily
- Mar 23, 2026 12B Beats GPT-4, Distilled Students Surpass Teachers Daily
- Mar 19, 2026 Open-Source Search Agent Wins With 12K Samples, Agent Skills Mostly Fail Daily
- Mar 18, 2026 700K Paper Pairs Distill Taste, Null Spaces Expose Blind Spots Daily
- Mar 17, 2026 Expert Reasoning Structure for CoT, +13% on Novel Class Discovery Daily
- Mar 16, 2026 Budget-Aware Agents Beat 4x Brute-Force Sampling Daily
- Mar 14, 2026 Encode the Answer, Not the Question — Embeddings Gain 9% Daily
- Mar 11, 2026 4-Step Diffusion Beats 100-Step Baselines, Layer Skipping Saves 18% Daily
- Mar 10, 2026 12k Samples Beat Finance SOTA, CUDA Optimization 35% Faster Daily
- Mar 9, 2026 Drop CLIP, Gain Performance: VLMs Work Better Without It Daily
- Mar 7, 2026 14B Video Model Runs Real-Time on a Single GPU Daily
- Mar 3, 2026 Spectral Conditions Unify μP Scaling, Data Curation Leaks Privacy Daily
- Mar 1, 2026 Latent Reasoning's Gains Aren't From Reasoning Daily
- Feb 28, 2026 Tri-Modal Training From Scratch, Agentic RL Gets a Stability Fix Daily
- Feb 27, 2026 TTT Is Linear Attention, Terminal Agent Data Recipe Goes Open Daily
- Feb 24, 2026 74% of Agent Coordination May Be Wasted Effort Daily
- Feb 23, 2026 Model Folding Beats Pruning, XR Gets Hand-Level Control Daily
- Feb 20, 2026 Example Pairs Replace Prompts, Agents Play Favorites Daily
- Feb 19, 2026 Spectral Decay Recovers 7% Accuracy in W4A4 Quantization Daily
- Feb 18, 2026 Binary Tokens Make Image Gen 30x Faster, RL Training Learns to Reflect Daily
- Feb 13, 2026 AI Solves Real Open Math Problems, World Models Everywhere Daily
- Feb 8, 2026 Diffusion Drafting Hits 6x Speedup, 14B Beats Claude at Kernels Daily
- Feb 7, 2026 Trillion-Parameter Multimodal, 4B Agents Match 671B, PPO Exposed Daily
- Feb 6, 2026 256 Tokens Match Full Attention, Agents That Build Agents Daily