热点
"Cut Cross-Entropy" 相关文章
Apple Researchers Propose Cut Cross-Entropy (CCE): A Machine Learning Method that Computes the Cross-Entropy Loss without Materializing the Logits for all Tokens into Global Memory
MarkTechPost@AI 2024-11-15T20:05:08.000000Z