Policy formulation for parallel run #477
Replies: 1 comment 1 reply
-
Hi @Etwari3 , You are doing it correctly if you want the policy to be a function of only the previous observation. There is just one thing that is likely confusing you: on the first step, since there is no observation available, the You can fix this by providing an initial observation as the initial_obs = 1
initial_state = 2
Sim(pomdp, xyz, PreviousObservationUpdater(), initial_obs, initial_state, max_steps=32, rng=MersenneTwister(1), metadata=Dict(:policy=>3)) |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I tried to construct a functionpolicy as follows
I got the following error message:
How do you define the actions for the different states?
Thank you
Beta Was this translation helpful? Give feedback.
All reactions