2025.05.01 | 阿拉伯语变音难题新解；深度推理模型能力增强

本期的 14 篇论文如下：

[00:21] 🗣 Sadeed: Advancing Arabic Diacritization Through Small Language Model（Sadeed：通过小型语言模型推进阿拉伯语变音）

[01:05] 🔎 WebThinker: Empowering Large Reasoning Models with Deep Research Capability（WebThinker：利用深度研究能力增强大型推理模型）

[01:43] 🧮 Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math（Phi-4-Mini-Reasoning：探索小型推理语言模型在数学方面的极限）

[02:20] 💡 Softpick: No Attention Sink, No Massive Activations with Rectified Softmax（Softpick：一种使用修正Softmax且无注意力陷阱、无大规模激活的方法）

[03:00] 🤔 Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think（超越最终答案：你的推理轨迹揭示了超乎你想象的信息）

[03:38] 🧠 Phi-4-reasoning Technical Report（Phi-4-reasoning 技术报告）

[04:21] 🧩 COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning（COMPACT：组合式的原子到复杂视觉能力调优）

[04:59] 💡 Taming the Titans: A Survey of Efficient LLM Inference Serving（驯服泰坦：高效LLM推理服务综述）

[05:34] 🤖 Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions（用于角色动画的生成式人工智能：技术、应用与未来方向的综合综述）

[06:09] 🤖 RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning（RoboVerse：面向可扩展和泛化机器人学习的统一平台、数据集和基准）

[06:49] 🎬 ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction（ReVision：基于显式3D物理建模的高质量、低成本复杂运动与交互视频生成）

[07:32] 🛡 Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report（Llama-3.1-FoundationAI-SecurityLLM-Base-8B 技术报告）

[08:08] 🩻 UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation（UniBiomed：用于Grounded生物医学图像解读的通用基础模型）

[08:53] 🗳 Selecting Optimal Candidate Profiles in Adversarial Environments Using Conjoint Analysis and Machine Learning（在对抗环境中利用联合分析和机器学习选择最优候选人形象）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签