Can `EpsGreedyPolicy` get correct action, if actions depend on state? #446

NeroBlackstone · 2022-11-10T14:28:46Z

NeroBlackstone
Nov 10, 2022

Hi, I have checked the POMDPTools build-in EpsGreedyPolicy source code, and I think it can only select actions from action spaces.

But if actions that can be taken are limited by certain states, (a function is used to get available actions from action space, depending on what state is now), this EpsGreedyPolicy can't select the correct action because it selects actions from full action spaces

Is there any built-in function to do this?

If not implemented, I'm willing to contribute a policy. (Maybe need some help

zsunberg · 2022-11-11T21:41:30Z

zsunberg
Nov 11, 2022
Maintainer

Hi @NeroBlackstone

Good point - I am not sure why that policy uses actions(m) instead of actions(m, s). I think it should be updated. If you just want some special case for your own use, it may be easiest to use a function policy, e.g.

function special_eps_greedy(s)
    if rand() < 0.05
        return rand(actions(m, s))
    end
       return greedy(s)
    end
end
policy = FunctionPolicy(special_eps_greedy)

but it would be nice to put fix EpsGreedyPolicy so it does this.

1 reply

NeroBlackstone Nov 15, 2022
Author

Do you think this is a good solution?
#449

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can `EpsGreedyPolicy` get correct action, if actions depend on state? #446

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Can EpsGreedyPolicy get correct action, if actions depend on state? #446

NeroBlackstone Nov 10, 2022

Replies: 1 comment · 1 reply

zsunberg Nov 11, 2022 Maintainer

NeroBlackstone Nov 15, 2022 Author

Can `EpsGreedyPolicy` get correct action, if actions depend on state? #446

NeroBlackstone
Nov 10, 2022

Replies: 1 comment 1 reply

zsunberg
Nov 11, 2022
Maintainer

NeroBlackstone Nov 15, 2022
Author