Skip to content

Formulation #6

@oyamad

Description

@oyamad

The current code solves dynamic programs with

  • reward function f(s, x) and
  • state transition function g(s, x, e)

where

  1. s is an N-dimensional continuous state variable
  2. x is a 1-dimensional continuous action variable and
  3. e is a discrete random variable, whose distribution is state independent.

Discussion:

  1. Do we want to allow discrete state variables in some dimensions?
  2. We want to allow discrete actions (Allow discrete actions #5) and multidimensional actions.
  3. Do we want to allow state dependent distributions?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions