Richard Sutton page, lists more articles than the folder I found.
3 Matching Annotations
- Last 7 days
-
www.incompleteideas.net www.incompleteideas.net
-
-
www.incompleteideas.net www.incompleteideas.net
-
Some writings, incomplete ideas he calls them, by Richard Sutton. Straight-up HTML, no frills, in a folder. Nice.
Tags
Annotators
URL
-
-
www.dwarkesh.com www.dwarkesh.com
-
LLMs aren’t capable of learning on-the-job, so no matter how much we scale, we’ll need some new architecture to enable continual learning.And once we have it, we won’t need a special training phase — the agent will just learn on-the-fly, like all humans, and indeed, like all animals.This new paradigm will render our current approach with LLMs obsolete.
Richard Sutton on LLM dev: a) core problem is LLMs can't learn from use. Diff architecture necessary for continual learning b) if you've got continual learning then current big-bang training no longer useful. facit: LLM approach not sustainable and dead end.
Tags
Annotators
URL
-