热点
"量化感知训练" 相关文章
征程 6EM 常见 QConfig 配置解读与示例
掘金 人工智能 2025-06-01T10:53:05.000000Z
27B 显存需求 54 → 14.1GB:谷歌发布 Gemma 3 QAT AI 模型,RTX 3090 显卡可运行
IT之家 2025-04-19T01:58:23.000000Z
Efficient Quantization-Aware Training (EfficientQAT): A Novel Machine Learning Quantization Technique for Compressing LLMs
MarkTechPost@AI 2024-07-20T11:03:48.000000Z
LLM-QFA Framework: A Once-for-All Quantization-Aware Training Approach to Reduce the Training Cost of Deploying Large Language Models (LLMs) Across Diverse Scenarios
MarkTechPost@AI 2024-06-03T06:31:03.000000Z