热点
关于我们
xx
xx
"
语言模型对齐
" 相关文章
Stable Preference Optimization for LLMs: A Bilevel Approach Beyond Direct Preference Optimization
cs.AI updates on arXiv.org
2025-07-11T04:03:58.000000Z
Google DeepMind Researchers Introduce InfAlign: A Machine Learning Framework for Inference-Aware Language Model Alignment
MarkTechPost@AI
2025-01-02T07:34:53.000000Z