SU-CS238 OCT122023
Last edited: August 8, 2025New Concepts
- structure learning
- Markov Equivalence Classes
- Partially Directed Graph Search
- utility and Rational Preferences
- decision network
- utility functions
- quadratic utility
- exponential utility
- power utility and its special case log utility
- maximum expected utility principle
- value of information
Important Results / Claims
- checking Markov Equivalence
- utility of Rational Preference
- von Neumann and Morgenstern Axioms: Axioms for checking rationality
- never have a utility function that’s infinite
- utility of a lottery
- process of observation selection
Questions
SU-CS238 OCT172023
Last edited: August 8, 2025Notation
“state variables” represent the contents of the state; “state” is a complete assignment of state variables.
New Concepts
Important Results / Claims
Questions
- why is it d seperated
Interesting Factoids
SU-CS238 OCT192023
Last edited: August 8, 2025Key Sequence
Notation
New Concepts
- Markov Decision Process
- Bellman Residual
- for continuous state spaces: Approximate Value Function
- use global approximation or local approximation methods
Important Results / Claims
- policy and utility
- creating a good utility function / policy from instantaneous rewards: either policy evaluation or value iteration
- creating a policy from a utility function: value-function policy (“choose the policy that takes the best valued action”)
- calculating the utility function a policy currently uses: use policy evaluation
- kernel smoothing
- value iteration, in practice
Questions
Interesting Factoids
SU-CS238 OCT242023
Last edited: August 8, 2025Key Sequence
Notation
New Concepts
Important Results / Claims
Questions
Interesting Factoids
SU-CS238 OCT262023
Last edited: August 8, 2025Big day. Policy Gradient.
New Concepts
Important Results / Claims
- monte-carlo policy evaluation
- Finite-Difference Gradient Estimation
- Linear Regression Gradient Estimate
