优化方法_Fishai

热点

"优化方法" 相关文章

Zeroth-Order Fine-Tuning of LLMs in Random Subspaces

cs.AI updates on arXiv.org 2025-07-25T04:28:38.000000Z

WhisperKit: On-device Real-time ASR with Billion-Scale Transformers

cs.AI updates on arXiv.org 2025-07-16T04:28:35.000000Z

Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training

cs.AI updates on arXiv.org 2025-07-15T04:26:55.000000Z

wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models

cs.AI updates on arXiv.org 2025-07-15T04:24:22.000000Z

Optimisation Is Not What You Need

cs.AI updates on arXiv.org 2025-07-08T05:54:00.000000Z

算力生意，风云突变！

特大号 2025-04-09T10:10:16.000000Z

DeepSeek-R1自写CUDA内核跑分屠榜！斯坦福学霸狂飙GPU编程自动化挑战人类

智源社区 2025-02-28T09:21:28.000000Z

Adaptive Inference Budget Management in Large Language Models through Constrained Policy Optimization

MarkTechPost@AI 2025-02-10T05:35:10.000000Z

o1也会「想太多」？腾讯AI Lab与上海交大揭秘o1模型过度思考问题

机器之心 2025-01-08T07:39:55.000000Z

From Scale to Density: A New AI Framework for Evaluating Large Language Models

MarkTechPost@AI 2024-12-10T05:34:56.000000Z

活动报名｜LLM Alignment综述及RLHF、DPO、UNA的深入分析

智源社区 2024-09-19T08:38:16.000000Z

Copyright © 2019 FISHAI.All Rights Reserved