For Morel, you truncate the uncertain rollouts instead of setting the negative reward? #45

XuJing1022 · 2022-03-25T16:13:00Z

Thanks for your code. However, I find that the implementation is different from the paper Morel.
This implementation truncates the uncertain rollouts instead of setting the negative reward.

If I didn't misunderstand your code, May you explain why there is some difference? And can you release the code which is totally following the algorithm described in your paper?

Look forward your replays. Thanks a lot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For Morel, you truncate the uncertain rollouts instead of setting the negative reward? #45

For Morel, you truncate the uncertain rollouts instead of setting the negative reward? #45

XuJing1022 commented Mar 25, 2022

For Morel, you truncate the uncertain rollouts instead of setting the negative reward? #45

For Morel, you truncate the uncertain rollouts instead of setting the negative reward? #45

Comments

XuJing1022 commented Mar 25, 2022