AI论文简报
搜索
方法论
公众号
EN
ViT改用LM目标预训练替代CLIP
从218篇论文中选出14篇
重点关注
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
score 8
入选 HF Daily Papers;HF 热度: 19 upvotes (+3);有代码实现
Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring
score 6
入选 HF Daily Papers;有代码实现;关键词(4): scaling, post-training, code generation, open-source
Let ViT Speak: Generative Language-Image Pre-training
score 6
入选 HF Daily Papers;有代码实现;关键词(2): pre-training, pretraining
也值得关注
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
score 4
关键词(2): function calling, reasoning;顶会接收: ICML
Escaping Mode Collapse in LLM Generation via Geometric Regulation
score 4
关键词(1): lightweight;顶会接收: ICML
A11y-Compressor: A Framework for Enhancing the Efficiency of GUI Agent Observations through Visual Context Reconstruction and Redundancy Reduction
score 4
关键词(1): lightweight;顶会接收: ACL
Federated Distillation for Whole Slide Image via Gaussian-Mixture Feature Alignment and Curriculum Integration
score 4
关键词(2): compression, distillation;顶会接收: ICML
Jailbreaking Vision-Language Models Through the Visual Modality
score 4
关键词(3): post-training, vision-language, jailbreak;顶会接收: ICML
Group Cognition Learning: Making Everything Better Through Governed Two-Stage Agents Collaboration
score 3
顶会接收: ICML
Mesh Field Theory: Port-Hamiltonian Formulation of Mesh-Based Physics
score 3
顶会接收: ICML
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer
score 3
入选 HF Daily Papers
Possibilistic Predictive Uncertainty for Deep Learning
score 3
顶会接收: ICML
NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search
score 3
顶会接收: ICML
Map2World: Segment Map Conditioned Text to 3D World Generation
score 3
入选 HF Daily Papers