热点
关于我们
xx
xx
"
奖励塑造
" 相关文章
Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach
cs.AI updates on arXiv.org
2025-08-05T11:10:31.000000Z
Learning from Expert Factors: Trajectory-level Reward Shaping for Formulaic Alpha Mining
cs.AI updates on arXiv.org
2025-07-29T04:22:20.000000Z
Bootstrapped Reward Shaping
cs.AI updates on arXiv.org
2025-07-28T04:43:06.000000Z