Stanford UG Courses Index
Last edited: June 6, 2026Stanford UG Y1, Aut
Stanford UG Y1, Win
Stanford UG Y1, Spr
Stanford UG Y2, Aut
Stanford UG Y2, Win
Stanford UG Y2, Spr
Stanford UG Y3, Aut
Stanford UG Y3, Win
Stanford UG Y3, Spr
Stanford UG Talks
| Date | Topic | Presenter | Link |
|---|---|---|---|
| UG Research Program | Brian Thomas | Stanford UG Research Program | |
| Bld an Ecosystem, Not Monolith | Colin Raffel | Build a System | |
| Training Helpful CHatbots | Nazeen Rajani | Training Helpful Chatbots | |
| AI Intepretability for Bio | Gasper Begus | AI Intepretability | |
| PT Transformers on Long Seqs | Mike Lewis | Pretraining Long Transformers | |
| Transformers! | A. Vaswani | Transformers | |
| Towards Interactive Agents | Jessy Lin | Interactive Agent | |
| Dissociating Language and Thought | Anna Ivanova | Dissociating Language and Thought | |
| Language Agents | Karthik Narasimhan | Language Agents with Karthik | |
| Pretraining Data | |||
| value alignment | Been Kim | LM Alignment | |
| model editing | Peter Hase | Knowledge Editing | |
| Knowledge Localization | |||
| Presentations | Sydney Katz | Presentations | |
| Video Generation with Learned Prior | Meenakshi Sarkar | Priors | |
| Theoretical Drone Control | Sliding Mode UAV Control | ||
| VLM to Agents | Tao Yu | VLM to Agents | |
| Social RL | Natasha Jaques | Social Reinforcement Learning | |
| Model Predictive Control + Prompting | Gabriel Maher | LLM MPC | |
| Planning for Learning | |||
| Theorem Proving | Self-Play Conjection Generalization | ||
| Safety for Trucks | Safety for Autonomous Trucking | ||
| Collaborate Multiagent DM | Collaborative Multiagent DM | ||
| AI Safety Talks | AI Safety Annual Meeting | ||
| Pretraining under infinite compute | Limited Samples and Infinite Compute | ||
| Mel Krusniak | Decisions.jl | ||
| SISL Flash Talks | SISL Talks | ||
| Predicting Scaling Performance | |||
| mixed-autonomy traffic with LLMS | mixed-autonomy traffic with LLMs | ||
| AI Incidents Policy | AI Incidents Policy | ||
| Reliable RL | Reliable RL | ||
| Words to Concepts | Words to Concepts | ||
| Zen’s Defense | |||
| multi-agent LLM | Multi-Agent LLMs | ||
| Alex’s Defense |
Contacts
SU-COLLEGE110 First Essay Planning
Last edited: June 6, 2026Several authors we have read questioned the possibility or appropriateness of democracy in countries where certain social structures and cultural ideologies are dominant—e.g., cases of Singapore and China. Do you think that particular cultures hinder the practice of democracy? Evaluate the debate on whether democracy has a universal appeal or is only appropriate to some cultures. What evidence exists to support each side of the debate and is it compelling? Take a position in this debate and make an argument for that position.
SU-COLLEGE110 Second Essay Planning
Last edited: June 6, 2026General Information
| Due Date | Topic | Important Documents |
|---|---|---|
| Saturday | Polarization |
> Please indicate which prompt you have selected (Q1, Q2, or Q3) at the beginning of your essay.
In recent years, political polarization has increased in many democratic societies. As Diamond has observed, “among the liberal democracies, partisan and ideological polarization is often worrisomely high, while political tolerance and trust have eroded.” This trend also manifests itself in the growing ideological distance between political parties, increasing partisanship among the electorate, and the erosion of civility in public discourse.
SU-CS224N Paper Review
Last edited: June 6, 2026Key Information
- Title: Fine-Grained Language Model Detoxification with Dense, Token-Level Rewards
- Team Member (in 224n): Houjun Liu <[email protected]>
- External Collaborators: Amelia Hardy <[email protected]>, Bernard Lange <[email protected]>
- Custom Project
- Mentor: we have no particular mentor within 224n
- Sharing Project: this project is shared with AA222, and is a part of a research project PI’d by Mykel Kochenderfer <[email protected]>, of which Houjun is taking a leading role
Research Paper Summary
| Title | Fine-Grained Human Feedback Gives Better Rewards for Language Model Training |
|---|---|
| Venue | NeurIPS (Spotlight) |
| Year | 2023 |
| URL | https://arxiv.org/pdf/2306.01693 |
Background
Reinforcement Learning with Human Feedback (RLHF) has demonstrated superb effect for improving performance of a language model (LM) via human preference judgments of LM output desirability–reducing incidences of toxic or false generation trajectories ((Ziegler et al. 2020)). Naive application of RLHF directly has shown success in reducing the toxicity in language model outputs, yet its effects could sometimes be inconsistent without further in-context guidance of the resulting model ((Ouyang et al. 2022)).
Transformer Speech Diarization
Last edited: June 6, 2026Background
Current deep-learning first approaches have shown promising results for the speech text diarization task. For ASR-independent diarization, specifically, two main methods appear as yielding fruitful conclusions:
Auditory feature extraction using deep learning to create a trained, fixed-size latent representation via Mel-frequency cepstral coefficients slices that came from any existing voice-activity detection (VAD) scheme ((Snyder et al. 2018)), where the features extracted with the neural network are later used with traditional clustering and Variational Bayes refinement ((Sell et al. 2018; Landini et al. 2022)) approaches to produce groups of diarized speakers
