Possible bug:It may be a version problem but i cant fix it #7

yanglin-code · 2022-06-08T02:44:27Z

Hi,the first point cause indexerror and it may be just a version problem with numpy or it is possible that there is some error

,and the second point , does s mean state and a mean action？

irl-imitation/mdp/value_iteration.py

Line 486 in bc9e9cd

expected_value = rewards_expanded[start:end, :] + gamma * values_tmp
irl-imitation/mdp/value_iteration.py

Line 333 in bc9e9cd

return sum([P_s1_s_a * (self.mdp.get_reward_sas(s, a, s1) + self.gamma * self.values[s1])

Charlesyyun · 2023-05-30T13:38:45Z

How do you fix the first error?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible bug:It may be a version problem but i cant fix it #7

Possible bug:It may be a version problem but i cant fix it #7

yanglin-code commented Jun 8, 2022 •

edited

Loading

Charlesyyun commented May 30, 2023

Possible bug:It may be a version problem but i cant fix it #7

Possible bug:It may be a version problem but i cant fix it #7

Comments

yanglin-code commented Jun 8, 2022 • edited Loading

Charlesyyun commented May 30, 2023

yanglin-code commented Jun 8, 2022 •

edited

Loading