_index.org

SU-CS238V FEB252025

Last edited: August 8, 2025

Key Sequence

Notation

New Concepts

Important Results / Claims

Questions

Interesting Factoids

SU-CS238V JAN072025

Last edited: August 8, 2025

Alignment Problem

autonomous systems will do exactly what we tell them to do… so we need to give them good instructions. This is the Alignment Problem

  1. imperfect objective—underspecified objective
  2. imperfect model—understanding of the world is underspecified
  3. imperfect optimization—the model just didn’t solve the problem correctly

Validation Framework

High level structure:

validation_algorithm(system, spec)

system

  • environment: state of the world, \(T(s’|s,a)\)
  • sensor, \(O(o|s)\)
  • agent, policy \(\pi\qty(a | o)\)

example: inverted pendulum

  • state: \(\qty (\theta, \omega)\) of the pendulum
  • observation: \(O(o|s) = \mathcal{N}\qty (o|s,\Sigma)\), Gaussian noise
  • policy: consider the following proportional controller policy

\begin{equation} \pi \qty(a | o) = \begin{cases} 1, \text{if} a = -15 \tau - 8 \omega \\ 0 \end{cases} \end{equation}

SU-CS238V JAN142025

Last edited: August 8, 2025

Key Sequence

Notation

New Concepts

Important Results / Claims

Questions

Interesting Factoids