ICLR2025 HAIC

ICLR2025 Koyejo

Proposal: Focus AI measurements on the validity of specific terms.

Five pillars of claim making:

content validity: does your evaluation cover all valuable cases?
criterion validity: does your evaluation correlate with a known validated standard?
construct validity: does your evaluation measure the intended construct?
external validity: does your evaluation generalize across different environments or settings?
consequential validity: does your evaluation consider the real world impact of test interpretation and use

Open problem: validaty of measurement for claims of HAIC.

When AI systems aligns with user values, users rank them as more helpful.

Good For unpredictable system, the best is to build in checks and balances + diverse systems.

“finding ways honor and value big-bad failures—to build objectives”

Multi-agent environment to solve factored POMDPs while a human agent is doing somtehing.