Houjun Liu

observe-act cycle

  1. at time \(t\), agent received observation \(o_{t}\)
  2. agent then chooses \(a_{t}\) based on what it knows through some kind of process in order to affect a possibly nondeterministic change on the environment