热点
"数据选择" 相关文章
不靠海量数据,如何精准喂养大模型?上交Data Whisperer:免训练数据选择法,10%数据逼近全量效果
机器之心 2025-07-29T13:27:15.000000Z
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
cs.AI updates on arXiv.org 2025-07-18T04:14:06.000000Z
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
cs.AI updates on arXiv.org 2025-07-10T04:06:09.000000Z
TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection
cs.AI updates on arXiv.org 2025-07-08T06:58:12.000000Z
MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models
cs.AI updates on arXiv.org 2025-07-08T05:53:50.000000Z
ACL 2025 | 数据多不如风格齐?SCAR精选<1%样本,指令微调效果飙升
PaperWeekly 2025-06-17T09:22:41.000000Z
字节最新大模型秘籍:只挑能有推理潜力的数据训练!1.3B模型无需标签自动挑选
智源社区 2025-05-16T09:14:18.000000Z
Model Performance Begins with Data: Researchers from Ai2 Release DataDecide—A Benchmark Suite to Understand Pretraining Data Impact Across 30K LLM Checkpoints
MarkTechPost@AI 2025-04-17T06:30:36.000000Z
10篇R1相关的研究全面汇总,万字思考!
Datawhale 2025-03-20T16:32:17.000000Z
Enhancing Instruction Tuning in LLMs: A Diversity-Aware Data Selection Strategy Using Sparse Autoencoders
MarkTechPost@AI 2025-02-25T17:48:40.000000Z
Understanding World Politics through a Systematic Understanding of its Language
Blog on Text Analytics - Provalis Research 2024-11-27T09:06:58.000000Z
Task-Specific Data Selection: A Practical Approach to Enhance Fine-Tuning Efficiency and Performance
MarkTechPost@AI 2024-11-21T10:19:44.000000Z
大模型指令调优数据集万字评测!腾讯上交大联合出品
智源社区 2024-08-19T04:22:42.000000Z
This AI Paper by ByteDance Research Introduces G-DIG: A Gradient-Based Leap Forward in Machine Translation Data Selection
MarkTechPost@AI 2024-05-27T18:31:01.000000Z