return only one dimension in multidimensional observation? #378

doweichert · 2021-11-29T16:04:12Z

doweichert
Nov 29, 2021

Hej,

I just found your julia package and it looks great.

Anyway, I've got a short question. I've got a problem in a, say, two dimensional space and two actions that lead to observations only for one of the dimensions. To express it generator-like with (x, y)-tuples for actions a_1 and a_2:

((s'_x, s'_y), (o_x, ??), r) = G((s_x, s_y), a_1)
((s'_x, s'_y), (??, o_y), r) = G((s_x, s_y), a_2)

How can I express this best using the framework?

Thanks for your help.
Dorina

Answered by zsunberg

Nov 29, 2021

Hi @doweichert , I would recommend just returning one number for the observation. Since the observation distribution is conditioned on the action, belief updaters and solvers will be able to "understand" that the observation is the x dimension when action 1 is taken and in the y dimension when action 2 is taken. It's also worth noting that for most belief updaters (e.g. particle filters) you will need the explicit observation distribution. Therefore I would recommend implementing the problem something like this (let's say that there is Gaussian noise with standard deviation 1 on the observation):

m = QuickPOMDP(
    obstype = Float64,
    gen = function (s, a, rng)
        # sample s' and r…

View full answer

zsunberg · 2021-11-29T18:42:58Z

zsunberg
Nov 29, 2021
Maintainer

Hi @doweichert , I would recommend just returning one number for the observation. Since the observation distribution is conditioned on the action, belief updaters and solvers will be able to "understand" that the observation is the x dimension when action 1 is taken and in the y dimension when action 2 is taken. It's also worth noting that for most belief updaters (e.g. particle filters) you will need the explicit observation distribution. Therefore I would recommend implementing the problem something like this (let's say that there is Gaussian noise with standard deviation 1 on the observation):

m = QuickPOMDP(
    obstype = Float64,
    gen = function (s, a, rng)
        # sample s' and r
        return (sp = (spx, spy), r=r)
    end,
    observation = function (a, sp)
        if a == a1
            return Normal(sp[1], 1.0)
        else
            @assert a == a2
            return Normal(sp[2], 1.0)
        end
    end,
    # specify other problem elements like initialstate here.
)

1 reply

doweichert Dec 2, 2021
Author

Hej @zsunberg ,

thanks a lot for the answer. I'll check it out.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

return only one dimension in multidimensional observation? #378

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

return only one dimension in multidimensional observation? #378

doweichert Nov 29, 2021

Replies: 1 comment · 1 reply

zsunberg Nov 29, 2021 Maintainer

doweichert Dec 2, 2021 Author

doweichert
Nov 29, 2021

Replies: 1 comment 1 reply

zsunberg
Nov 29, 2021
Maintainer

doweichert Dec 2, 2021
Author