热点
"语言模型对齐" 相关文章
Stable Preference Optimization for LLMs: A Bilevel Approach Beyond Direct Preference Optimization
cs.AI updates on arXiv.org 2025-07-11T04:03:58.000000Z
Google DeepMind Researchers Introduce InfAlign: A Machine Learning Framework for Inference-Aware Language Model Alignment
MarkTechPost@AI 2025-01-02T07:34:53.000000Z