Posts

Algorithms Index

Last edited: November 11, 2025

SU-CS161 Things to Review

SU-CS161 Embedded Ethics

Lectures

Divide and Conquer

Sorting

Data Structures

Graphs

DP

Greedy Algorithms

Closing

EMNLP2025 Index

Last edited: November 11, 2025

Talks

Posters

Takes

  • although parsing maybe dead for natural language, structure helps parse scientific information (i.e. drugs, molecules, proteins, etc.)
  • two idea: 1) how to formalize approach mathematically 2) what can LMs do that humans can’t do?
  • information-rich statefulness + constraints for pruning space is the unlock for ability to build on previous results; i.e. “critical thinking”

Tasks to Do

Tasks Can Do

EMNLP2025 Keynote: Heng Ji

Last edited: November 11, 2025

Motivation: drug discovery is extremely slow and expensive; mostly modulating previous iterations of work.

Principles of Drug Discovery

  • observation: acquire/fuse knowledge from multiple data modalities (sequence, stricture, etc.)
  • think: critically generating actually new hypothesis — allowing iteratively
  • allowing LMs to code-switch between moladities (i.e. fuse different modalities together in the most uniform way)

LM as a heuristic helps prune down search space quickly.

EMNLP2025 Tuesday Morning Posters

Last edited: November 11, 2025

EMNLP2025 Xu: tree of prompting

Evaluate the quote attribution score as a way to prioritize more factual quotes.

EMNLP2025 Fan: medium is not the message

Unwanted feature such as language a medium who found in embedding, use linear concept of eraser to learn a projection that minimize information on unwanted features

EMNLP2025 Hong: variance sensitivity induces attention entropy collapse

Softmax is highly sensitive to variance which is why pre-training loss spikes without QK norm

SU-CS229 Midterm Sheet

Last edited: November 11, 2025