Houjun Liu

CALP

Contraindicated offline POMDP solver.

Contrained belief state MDP
Linear Programming
belief set generation
Approximate POMDP with Contrainst

CPOMDPs are Hard

Can’t do DP with pruning: optimal policies may be stochastic
Minimax quadratically contained program: computational intractable
Contained PBVI struggles with contraint satisfaction

CALP Core Idea

Recast CPOMDP as a contrained belief-state MDP.

We replace our state-space with our belief space:

\(S = B\)
\(s_0 = b_0\)

You essentially assume here that there is some finite belief space.