热点
"奖励塑造" 相关文章
Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach
cs.AI updates on arXiv.org 2025-08-05T11:10:31.000000Z
Learning from Expert Factors: Trajectory-level Reward Shaping for Formulaic Alpha Mining
cs.AI updates on arXiv.org 2025-07-29T04:22:20.000000Z
Bootstrapped Reward Shaping
cs.AI updates on arXiv.org 2025-07-28T04:43:06.000000Z