AA228/CS238: Probability Review!
Last edited: June 6, 2026Random Variable
random variables takes on different values with different probabilities. Each value a random variable take on is an event.
For instance, here’s a random variable representing a die: \(X\). It can takes on the following values, with the following probabilities:
\begin{align} P(X=1) = \frac{1}{6}\\ P(X=2) = \frac{1}{6}\\ \dots \\ P(X=6) = \frac{1}{6} \end{align}
where each assignment \(X=k\) is what we refer to above as an event.
The set of assignments of a random variable and their associated probability is called a distribution: distributions “assigns probabilities to outcomes.” When we say a certain random variable \(X\) is “distributed” following a distribution \(D\), we say \(X \sim D\). Semantically, we say \(X\) is a \(D\) random variable.
adventuretime
Last edited: June 6, 2026- 1 Executive Summary
- 2 Core Design Principles
- 3 Non-Goals
- 4 System Overview
- 5 Hardware Architecture (Recommended)
- 6 Network & Communication Diagram (Textual)
- 7 Storage Model
- 7.1 Invariant
- 7.2 Checkpoint Flow
- 7.3 Artifacts & Logs
- 8 Unified Checkpointing (JAX Pytrees)
- 9 Repository Structure (Tech Spec)
- 10 Execution Model
- 10.1 Job Lifecycle
- 10.2 Backend Interface
- 11 DAG / “Ray-lite” Model
- 12 Example YAML Specifications
- 12.1 Backend Inventory
- 12.2 Storage
- 12.3 Single Run Spec
- 12.4 DAG Spec (Tokenize → Train → Rollouts)
- 13 Implementation Plan
- 13.1 Phase 0 (2–3 weeks)
- 13.2 Phase 1 (4–6 weeks)
- 13.3 Phase 2 (3–4 weeks)
- 13.4 Phase 3 (optional)
- 14 Cost Estimates
- 14.1 One-Time Hardware (Target ~$50k)
- 14.2 Ongoing
- 15 Risks & Mitigations
- 16 Success Criteria
- 17 Conclusion
After a conversation with an LM https://chatgpt.com/share/697143db-c3e0-8000-b56c-07cf7ca43795 the following proposal was generated.
AML: It Takes Two
Last edited: June 6, 2026Hello everyone! It’s April, which means we are ready again for a new unit. Let’s dive in.
You know what’s better than one neural network? TWO!!! Multi-modal approaches—making two neural networks interact for a certain result—dominate many of the current edge of neural network research. In this unit, we are going to introduce one such approach, Generative Adversarial Networks (GAN), but leave you with some food for thought for other possibilities for what training multiple networks together can do.
AML: REINFORCE(ment learning)
Last edited: June 6, 2026Woof. As I begin to write this I should add that this unit is going to be conceptually dense. Though we are teaching one particular algorithm (incidentally, named, REINFORCE), the world of reinforcement learning is build by one, if not many, very advanced treatments in maths.
So if anything, I would focus on getting the conceptual flavor of how these problems are formulated and discuses. If you can be along for the mathematical and algorithmic journey, then even better — but by no means required or expected… There’s still lots for all of us to learn together.
AML: Time to Convolve
Last edited: June 6, 2026Welcome back! I think, over the last few days, we have been hyping up convolutional neural networks enough such that you are probably ready to dive right in. So… Let’s, uh, motivate it first!
Why do we use a CNN?
Let’s think of a toy problem to play with. Given a pattern made using two colours (let’s name them a and b, or perhaps black and white), let’s classify whether it is the “zebra” pattern" or the “checkerboard” pattern.
