热点
关于我们
xx
xx
"
LLM训练
" 相关文章
The Best Way to Align an LLM: Inner Alignment is Now a Solved Problem?
少点错误
2025-05-28T06:27:33.000000Z
Revelo’s LatAm talent network sees strong demand from US companies, thanks to AI
TechCrunch News
2025-05-04T15:03:26.000000Z
历时6个月,Hugging Face开源LLM「超大规模实战手册」!200页3万字4000次训练
智源社区
2025-03-04T07:07:13.000000Z
历时6个月,Hugging Face开源LLM「超大规模实战手册」!200页3万字4000次训练
新智元
2025-03-03T06:10:47.000000Z
Elevating AI Reasoning: The Art of Sampling for Learnability in LLM Training
MarkTechPost@AI
2025-02-28T04:03:07.000000Z
LeCun力荐!进化算法淘汰77%低质数据:RIP方法让模型性能狂飙60%
智源社区
2025-02-26T01:07:17.000000Z
LeCun力荐!进化算法淘汰77%低质数据:RIP方法让模型性能狂飙60%
新智元
2025-02-25T12:47:34.000000Z
Andrej Karpathy:我们需要让大模型“上学”,强化学习才刚开始
华尔街见闻
2025-01-31T10:57:31.000000Z
田渊栋:2024年年终总结
智源社区
2025-01-03T09:52:06.000000Z
Frenzy: A Memory-Aware Serverless Computing Method for Heterogeneous GPU Clusters
MarkTechPost@AI
2024-12-25T01:34:56.000000Z
Hugging Face Releases Picotron: A Tiny Framework that Solves LLM Training 4D Parallelization
MarkTechPost@AI
2024-12-19T15:46:33.000000Z
An introduction to preparing your own dataset for LLM training
AWS Machine Learning Blog
2024-12-19T15:24:10.000000Z
How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model
AWS Machine Learning Blog
2024-12-18T15:32:23.000000Z
LASER: An Adaptive Method for Selecting Reward Models RMs and Iteratively Training LLMs Using Multiple Reward Models RMs
MarkTechPost@AI
2024-10-05T07:20:56.000000Z
Revisiting Weight Decay: Beyond Regularization in Modern Deep Learning
MarkTechPost@AI
2024-09-29T10:20:54.000000Z
instruction tuning and autoregressive distribution shift
少点错误
2024-09-05T17:07:16.000000Z
Why GPU Utilization Falls Short: Understanding Streaming Multiprocessor (SM) Efficiency for Better LLM Performance
MarkTechPost@AI
2024-09-03T15:35:22.000000Z
从裸机到700亿参数大模型,这里有份教程,还有现成可用的脚本
机器之心
2024-07-27T04:08:49.000000Z
Spectrum: An AI Method that Accelerates LLM Training by Selectively Targeting Layer Modules based on their Signal-to-Noise Ratio (SNR)
MarkTechPost@AI
2024-07-04T06:01:50.000000Z
Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by Cutting GPU Usage by 20%
MarkTechPost@AI
2024-06-14T17:31:36.000000Z