- Apr 15, 2026 dLLMs Hallucinate Differently, PRM Labeling Cost Drops 100x Daily
- Apr 14, 2026 SFT Convergence Hides Failures, Attention Hijacking Hits 94% Daily
- Apr 11, 2026 1.7x Faster From Fine-Tuning Alone, Token Collapse Misdiagnosed Daily
- Apr 10, 2026 Entropy Is Lying to You, Implicit Reasoning Tops Out at 7 Steps Daily
- Apr 8, 2026 Streaming Video QA Hits 2 FPS, RLVR Shrugs Off Noisy Labels Daily
- Apr 6, 2026 Open-Source 32B Cracks Hardware Code, Agents Score Just 23% Daily
- Apr 5, 2026 4M Game Frames Train Rendering, Internalized Skills Beat Retrieval Daily
- Apr 3, 2026 Minimalist Agents Match MCP, Code Models Think Mid-Stream Daily
- Apr 2, 2026 Data Mixing Becomes Post-Training, Surface Cues Hijack Reasoning 38x Daily
- Mar 29, 2026 Mistral Ships TTS, Diffusion LLMs Get 4.7x Faster Daily
- Mar 27, 2026 Speculative Execution Hits Agent Loops, 3x Faster Daily
- Mar 26, 2026 Diffusion OCR Decodes 3.2x Faster, Single-Stream AV in 2 Seconds Daily
- Mar 25, 2026 PDEs Beat Attention 2x, Local RL Saves 3/4 Compute Daily
- Mar 24, 2026 Seed1.8 Goes Agent-Native, Language Training Erodes Vision Daily
- Mar 21, 2026 3D at 0.1% Tokens, Video Fine-Tuning's Hidden Spatial Cost Daily
- Mar 20, 2026 First 32B Industrial Code Model, War-Tested Reasoning Eval Daily
- Mar 19, 2026 Open-Source Search Agent Wins With 12K Samples, Agent Skills Mostly Fail Daily
- Mar 17, 2026 Expert Reasoning Structure for CoT, +13% on Novel Class Discovery Daily
- Mar 12, 2026 Write Code Before You Draw, Layouts Improve 68% Daily
- Mar 11, 2026 4-Step Diffusion Beats 100-Step Baselines, Layer Skipping Saves 18% Daily
- Mar 10, 2026 12k Samples Beat Finance SOTA, CUDA Optimization 35% Faster Daily
- Mar 8, 2026 \"Be Concise\" Halves Tokens, Lifts Accuracy by 16 Points Daily
- Mar 6, 2026 Code Agents Can't Cross Repo Boundaries, Under 45% Success Daily
- Mar 5, 2026 Direct Lottie Generation, DPO's Built-In Forgetting Defense Daily
- Mar 4, 2026 9K Samples Rival R1, Most RL Gains Trace Back to SFT Daily
- Mar 3, 2026 Spectral Conditions Unify μP Scaling, Data Curation Leaks Privacy Daily
- Feb 28, 2026 Tri-Modal Training From Scratch, Agentic RL Gets a Stability Fix Daily
- Feb 27, 2026 TTT Is Linear Attention, Terminal Agent Data Recipe Goes Open Daily
- Feb 20, 2026 Example Pairs Replace Prompts, Agents Play Favorites Daily
- Feb 18, 2026 Binary Tokens Make Image Gen 30x Faster, RL Training Learns to Reflect Daily
- Feb 16, 2026 Vertical AI Is Winning: Medical, Robotics, and Science Agents Daily
- Feb 15, 2026 Running Out of RL Training Data? Just Combine the Easy Problems Daily
- Feb 14, 2026 11B Active Parameters Hit Frontier-Level Agent Intelligence Daily
- Feb 12, 2026 Text Diffusion Hits Practical Speed, RL Spreads Everywhere Daily
- Feb 9, 2026 Medical LLMs Should Ask Questions, Not Just Answer Them Daily
- Feb 4, 2026 Better SFT Makes Worse RL, Distillation Waste, Reward Circuits Daily
- Feb 2, 2026 Unlimited RLVR Data From Web Text, FP4 Pretraining Matches BF16 Daily
- Feb 1, 2026 Open-Source Deep Research Beats GPT-5, Embedding Scaling Outshines Experts Daily