2025.06.27 | 强化学习提升搜索效率；记忆增强生成逼真驾驶场景。

本期的 15 篇论文如下：

00:25 🔍 MMSearch-R1: Incentivizing LMMs to Search（MMSearch-R1：激励大型多模态模型进行搜索）

00:59 🚗 MADrive: Memory-Augmented Driving Scene Modeling（MADrive：基于记忆增强的驾驶场景建模）

01:43 🤖 WorldVLA: Towards Autoregressive Action World Model（WorldVLA：面向自回归动作世界模型）

02:23 💡 Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test（大型语言模型预训练中Grokking现象 কোথায়? 无需测试，监测从记忆到泛化的过程）

03:14 🤖 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge（Mind2Web 2：使用Agent-as-a-Judge评估自主搜索）

04:00 🚗 SAM4D: Segment Anything in Camera and LiDAR Streams（SAM4D：相机和激光雷达流中的可分割一切）

04:40 🎨 FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing（FaSTA$^*$: 快速-慢速工具路径智能体，通过子程序挖掘实现高效的多轮图像编辑）

05:16 🤖 Whole-Body Conditioned Egocentric Video Prediction（全身条件下的自我中心视频预测）

05:53 🧠 Arch-Router: Aligning LLM Routing with Human Preferences（Arch-Router：将LLM路由与人类偏好对齐）

06:35 🎨 FairyGen: Storied Cartoon Video from a Single Child-Drawn Character（FairyGen：从单张儿童绘画生成故事驱动的卡通视频）

07:12 🌐 DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster（DiLoCoX：一种用于去中心化集群的低通信大规模训练框架）

07:55 🧬 An Agentic System for Rare Disease Diagnosis with Traceable Reasoning（基于Agent的罕见病诊断系统，具有可追溯的推理能力）

08:35 🤖 HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges（HeurAgenix：利用大型语言模型解决复杂组合优化难题）

09:18 🦘 Learning to Skip the Middle Layers of Transformers（学习跳过Transformer的中间层）

09:57 🎵 MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners（MuseControlLite：基于轻量级调节器的多功能音乐生成）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

Fish AI Reader