热点
关于我们
xx
xx
"
直接偏好优化
" 相关文章
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications
cs.AI updates on arXiv.org
2025-07-15T04:27:08.000000Z
Aligning Generative Speech Enhancement with Human Preferences via Direct Preference Optimization
cs.AI updates on arXiv.org
2025-07-15T04:26:59.000000Z
Principled Foundations for Preference Optimization
cs.AI updates on arXiv.org
2025-07-15T04:24:21.000000Z
Stable Preference Optimization for LLMs: A Bilevel Approach Beyond Direct Preference Optimization
cs.AI updates on arXiv.org
2025-07-11T04:03:58.000000Z
Evaluating the Effectiveness of Direct Preference Optimization for Personalizing German Automatic Text Simplifications for Persons with Intellectual Disabilities
cs.AI updates on arXiv.org
2025-07-03T04:07:31.000000Z
无需百卡集群!港科等开源LightGen: 极低成本文生图方案媲美SOTA模型
机器之心
2025-03-20T05:13:31.000000Z
LLM自学成才变身「预言家」!预测未来能力大幅提升
新智元
2025-02-25T12:47:34.000000Z
LLM自学成才变身「预言家」,预测未来能力大幅提升
36kr
2025-02-25T04:03:33.000000Z
Optimizing Protein Design with Reinforcement Learning-Enhanced pLMs: Introducing DPO_pLM for Efficient and Targeted Sequence Generation
MarkTechPost@AI
2024-12-20T18:08:06.000000Z
微软:两个AI相互纠错,数学再涨5分
36kr
2024-12-02T08:29:23.000000Z
Tackling Hallucinations, Boosting Reasoning Abilities, and New Insights into the Transformer Architecture
Ahead of AI
2024-10-22T06:07:40.000000Z