Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RelEnt uses trajectories of varying length #40

Open
maxmdaniel opened this issue Oct 13, 2018 · 1 comment
Open

RelEnt uses trajectories of varying length #40

maxmdaniel opened this issue Oct 13, 2018 · 1 comment

Comments

@maxmdaniel
Copy link
Collaborator

Our implementation of RelEnt currently works with trajectories of varying length. (This is because we rely on our collect_trajs util, which returns when an episode ends.)

By contrast, the RelEnt paper does all calculations under the assumption of a fixed trajectory length.

I'm not sure if this is problematic, but open this issue lest we forget to look into this.

@JohannesHeidecke
Copy link
Owner

RelEnt will have to be re-added with the new structure, we should take care of it then.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants