EMNLP2025: MUSE, MCTS Driven Red Teaming

One-Liner

Notable Methods

  1. construct a series of perturbation actions
    • \(A\qty(s)\) = decomposition (skip), expansion (rollout), dredirection
  2. sequence actions with MCTS

Key Figs

New Concepts

Notes