热点
"人类偏好" 相关文章
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications
cs.AI updates on arXiv.org 2025-07-15T04:27:08.000000Z
Aligned Textual Scoring Rules
cs.AI updates on arXiv.org 2025-07-09T04:01:32.000000Z
Listener-Rewarded Thinking in VLMs for Image Preferences
cs.AI updates on arXiv.org 2025-07-02T22:33:35.000000Z
Advancing MLLM Alignment Through MM-RLHF: A Large-Scale Human Preference Dataset for Multimodal Tasks
MarkTechPost@AI 2025-02-19T18:33:56.000000Z
让 LLM 来评判 | 奖励模型相关内容
智源社区 2025-02-15T05:46:50.000000Z
Artificial Analysis Group Launches the Artificial Analysis Text to Image Leaderboard & Arena
MarkTechPost@AI 2024-06-26T03:31:40.000000Z
GenAI-Arena: An Open Platform for Community-Based Evaluation of Generative AI Models
MarkTechPost@AI 2024-06-13T05:01:50.000000Z