热点
"优化方法" 相关文章
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces
cs.AI updates on arXiv.org 2025-07-25T04:28:38.000000Z
WhisperKit: On-device Real-time ASR with Billion-Scale Transformers
cs.AI updates on arXiv.org 2025-07-16T04:28:35.000000Z
Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
cs.AI updates on arXiv.org 2025-07-15T04:26:55.000000Z
wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
cs.AI updates on arXiv.org 2025-07-15T04:24:22.000000Z
Optimisation Is Not What You Need
cs.AI updates on arXiv.org 2025-07-08T05:54:00.000000Z
算力生意,风云突变!
特大号 2025-04-09T10:10:16.000000Z
DeepSeek-R1自写CUDA内核跑分屠榜!斯坦福学霸狂飙GPU编程自动化挑战人类
智源社区 2025-02-28T09:21:28.000000Z
Adaptive Inference Budget Management in Large Language Models through Constrained Policy Optimization
MarkTechPost@AI 2025-02-10T05:35:10.000000Z
o1也会「想太多」?腾讯AI Lab与上海交大揭秘o1模型过度思考问题
机器之心 2025-01-08T07:39:55.000000Z
From Scale to Density: A New AI Framework for Evaluating Large Language Models
MarkTechPost@AI 2024-12-10T05:34:56.000000Z
活动报名|LLM Alignment综述及RLHF、DPO、UNA的深入分析
智源社区 2024-09-19T08:38:16.000000Z