LLM训练_Fishai

热点

"LLM训练" 相关文章

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

cs.AI updates on arXiv.org 2025-08-01T04:08:16.000000Z

Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training

cs.AI updates on arXiv.org 2025-07-30T04:46:08.000000Z

Interviewing Ross Taylor on the state of AI: Chinese open models, scaling reasoning, useful tools, and what comes next

Interconnects 2025-07-29T13:39:03.000000Z

LLM Data Selection and Utilization via Dynamic Bi-level Optimization

cs.AI updates on arXiv.org 2025-07-23T04:03:20.000000Z

Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning

cs.AI updates on arXiv.org 2025-07-18T04:13:51.000000Z

AI-Compass LLM训练框架生态：整合ms-swift、Unsloth、Megatron-LM等核心框架，涵盖全参数/PEFT训练与分布式优化

掘金人工智能 2025-07-16T09:53:11.000000Z

Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

cs.AI updates on arXiv.org 2025-07-16T04:28:50.000000Z

WebSailor: Navigating Super-human Reasoning for Web Agent

cs.AI updates on arXiv.org 2025-07-04T04:08:44.000000Z

Beyond First-Order: Training LLMs with Stochastic Conjugate Subgradients and AdamW

cs.AI updates on arXiv.org 2025-07-03T04:07:24.000000Z

The Best Way to Align an LLM: Inner Alignment is Now a Solved Problem?

少点错误 2025-05-28T06:27:33.000000Z

Revelo’s LatAm talent network sees strong demand from US companies, thanks to AI

TechCrunch News 2025-05-04T15:03:26.000000Z

历时6个月，Hugging Face开源LLM「超大规模实战手册」！200页3万字4000次训练

智源社区 2025-03-04T07:07:13.000000Z

历时6个月，Hugging Face开源LLM「超大规模实战手册」！200页3万字4000次训练

新智元 2025-03-03T06:10:47.000000Z

Elevating AI Reasoning: The Art of Sampling for Learnability in LLM Training

MarkTechPost@AI 2025-02-28T04:03:07.000000Z

LeCun力荐！进化算法淘汰77%低质数据：RIP方法让模型性能狂飙60%

智源社区 2025-02-26T01:07:17.000000Z

LeCun力荐！进化算法淘汰77%低质数据：RIP方法让模型性能狂飙60%

新智元 2025-02-25T12:47:34.000000Z

Andrej Karpathy：我们需要让大模型“上学”，强化学习才刚开始

华尔街见闻 2025-01-31T10:57:31.000000Z

田渊栋：2024年年终总结

智源社区 2025-01-03T09:52:06.000000Z

Frenzy: A Memory-Aware Serverless Computing Method for Heterogeneous GPU Clusters

MarkTechPost@AI 2024-12-25T01:34:56.000000Z

Hugging Face Releases Picotron: A Tiny Framework that Solves LLM Training 4D Parallelization

MarkTechPost@AI 2024-12-19T15:46:33.000000Z

Copyright © 2019 FISHAI.All Rights Reserved