Run meta_test.py with ProMP-trained policy and get very bad result. #11

R-Ceph · 2020-09-27T09:35:35Z

I ran ppo_run.py and got a .pkl file for HopperRandParamsEnv, of which the average reward was about 200
But when I ran meta_test.py with ProMP-trained policy, the average reward dropped to around 10...
I don't understand where the problem is.
Can somebody help me?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run meta_test.py with ProMP-trained policy and get very bad result. #11

Run meta_test.py with ProMP-trained policy and get very bad result. #11

R-Ceph commented Sep 27, 2020

Run meta_test.py with ProMP-trained policy and get very bad result. #11

Run meta_test.py with ProMP-trained policy and get very bad result. #11

Comments

R-Ceph commented Sep 27, 2020