C.3 Agent Foundations: TA Guide
Session overview
Total time: ~6.5 hours
| Block | Duration | Format |
|---|---|---|
| Introductory lecture: bird’s eye view and motivation | 1 hour | Lecture |
| Readings and discussions | 2.5 hours | Structured reading + cross-topic discussion |
| Guest lecture: reflective oracles and nonrealizability | 1 hour | Lecture (Cole Wyeth) |
| Exercises | 1 hour | Individual / small group work |
| Guest lecture: decision theory, information engine, embedded agency and algorithmic thermodynamics | 1 hour | Lecture (Aram Ebtekar) |
Lecture: Bird’s eye view and motivation
Key points to cover
Common questions / sticking points
Readings and discussion block
Format
Separate into fundamental readings vs topic-specific readings. Each participant reads the fundamentals plus one topic, then cross-pollinates in discussion.
Fundamental readings (everyone):
- Embedded agency
- Why agent foundations
- General purpose search
Topic tracks (one per participant):
- Consequentialist foundations (coherence + complete class theorems)
- Lob’s theorem and tiling agents
- Logical induction
- Decision theory
- Optimization and thermodynamics
- Descriptive agent foundations