RelEnt uses trajectories of varying length #40

maxmdaniel · 2018-10-13T00:11:18Z

Our implementation of RelEnt currently works with trajectories of varying length. (This is because we rely on our collect_trajs util, which returns when an episode ends.)

By contrast, the RelEnt paper does all calculations under the assumption of a fixed trajectory length.

I'm not sure if this is problematic, but open this issue lest we forget to look into this.

JohannesHeidecke · 2018-10-31T17:58:07Z

RelEnt will have to be re-added with the new structure, we should take care of it then.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RelEnt uses trajectories of varying length #40

RelEnt uses trajectories of varying length #40

maxmdaniel commented Oct 13, 2018

JohannesHeidecke commented Oct 31, 2018

RelEnt uses trajectories of varying length #40

RelEnt uses trajectories of varying length #40

Comments

maxmdaniel commented Oct 13, 2018

JohannesHeidecke commented Oct 31, 2018