Confusion about the intention #14

zichunxx · 2023-07-11T02:04:19Z

I'm new to reinforcement learning and interested in your work.

After I read your article thoroughly, I'm confused about the intention to solve the long horizon task with the goal-conditioned reward scheme.

In my opinion, the goal-conditioned reward can be treated as the sparse reward, which performs badly in long horizon tasks.

Thus, why not use the dense reward with differentiable functions which can lead the training process to convergence? Sometimes, some tasks don't require a lot of goals.

I don't know if I'm on the right point and this may seem meaningless to you, but I'd like to get a response from you.

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusion about the intention #14

Confusion about the intention #14

zichunxx commented Jul 11, 2023 •

edited

Loading

Confusion about the intention #14

Confusion about the intention #14

Comments

zichunxx commented Jul 11, 2023 • edited Loading

zichunxx commented Jul 11, 2023 •

edited

Loading