4 Matching Annotations
  1. Oct 2023
    1. Quantitatively, SPRING with GPT-4 outperforms all state-of-the-art RLbaselines, trained for 1M steps, without any training.

      Them's fighten' words!

      I haven't read it yet, but we're putting it on the list for this fall's reading group. Seriously, a strong result with a very strong implied claim. they are careful to say it's from their empirical results, very worth a look. I suspect that amount of implicit knowledge in the papers, text and DAG are helping to do this.

      The Big Question: is their comparison to RL baselines fair, are they being trained from scratch? What does a fair comparison of any from-scratch model (RL or supervised) mean when compared to an LLM approach (or any approach using a foundation model), when that model is not really from scratch.

  2. Oct 2022
    1. To be a successful physicist requires mastering how to make all 29 decisions, but the reflection decisions (decisions 23–26) are arguably the most difficult to learn.

      Of the 29 problem solving decisions identified as important the three "reflection decisions" (23-26 in the list) may be the most difficult to learn as they require metacognition and self-evaluation.

  3. Jun 2020