热点
"Generative Reward Models" 相关文章
Generative Reward Models (GenRM): A Hybrid Approach to Reinforcement Learning from Human and AI Feedback, Solving Task Generalization and Feedback Collection Challenges
MarkTechPost@AI 2024-10-23T02:22:08.000000Z