热点
"LLM训练" 相关文章
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
cs.AI updates on arXiv.org 2025-08-01T04:08:16.000000Z
Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
cs.AI updates on arXiv.org 2025-07-30T04:46:08.000000Z
Interviewing Ross Taylor on the state of AI: Chinese open models, scaling reasoning, useful tools, and what comes next
Interconnects 2025-07-29T13:39:03.000000Z
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
cs.AI updates on arXiv.org 2025-07-23T04:03:20.000000Z
Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning
cs.AI updates on arXiv.org 2025-07-18T04:13:51.000000Z
AI-Compass LLM训练框架生态:整合ms-swift、Unsloth、Megatron-LM等核心框架,涵盖全参数/PEFT训练与分布式优化
掘金 人工智能 2025-07-16T09:53:11.000000Z
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them
cs.AI updates on arXiv.org 2025-07-16T04:28:50.000000Z
WebSailor: Navigating Super-human Reasoning for Web Agent
cs.AI updates on arXiv.org 2025-07-04T04:08:44.000000Z
Beyond First-Order: Training LLMs with Stochastic Conjugate Subgradients and AdamW
cs.AI updates on arXiv.org 2025-07-03T04:07:24.000000Z
The Best Way to Align an LLM: Inner Alignment is Now a Solved Problem?
少点错误 2025-05-28T06:27:33.000000Z
Revelo’s LatAm talent network sees strong demand from US companies, thanks to AI
TechCrunch News 2025-05-04T15:03:26.000000Z
历时6个月,Hugging Face开源LLM「超大规模实战手册」!200页3万字4000次训练
智源社区 2025-03-04T07:07:13.000000Z
历时6个月,Hugging Face开源LLM「超大规模实战手册」!200页3万字4000次训练
新智元 2025-03-03T06:10:47.000000Z
Elevating AI Reasoning: The Art of Sampling for Learnability in LLM Training
MarkTechPost@AI 2025-02-28T04:03:07.000000Z
LeCun力荐!进化算法淘汰77%低质数据:RIP方法让模型性能狂飙60%
智源社区 2025-02-26T01:07:17.000000Z
LeCun力荐!进化算法淘汰77%低质数据:RIP方法让模型性能狂飙60%
新智元 2025-02-25T12:47:34.000000Z
Andrej Karpathy:我们需要让大模型“上学”,强化学习才刚开始
华尔街见闻 2025-01-31T10:57:31.000000Z
田渊栋:2024年年终总结
智源社区 2025-01-03T09:52:06.000000Z
Frenzy: A Memory-Aware Serverless Computing Method for Heterogeneous GPU Clusters
MarkTechPost@AI 2024-12-25T01:34:56.000000Z
Hugging Face Releases Picotron: A Tiny Framework that Solves LLM Training 4D Parallelization
MarkTechPost@AI 2024-12-19T15:46:33.000000Z