5 Matching Annotations
  1. Last 7 days
    1. Using these ability scores, the method predicts performance on new tasks with ~88% accuracy, including for models such as GPT-4o and Llama-3.1.

      88%的预测准确率是一个令人印象深刻的数据点,表明ADeLe不仅能够解释现有性能,还能可靠预测模型在新任务上的表现。这一准确率远超传统方法,为AI系统的可靠部署提供了强有力的预测工具,可能是AI评估领域的重要突破。

  2. Oct 2025
    1. the question is, why didn't that biochemical story get you to this discovery?

      for - quote - Michael Levin - what is a good story? - the question is: Why didn't that biochemical story get you to this (new) discovery? - adjacency - good models - predictive power - good story - a good model is a good language - new words frame the world in new ways, - it allows us to divide reality in different ways - and can lead us to look in places we otherwise might now - and that can lead to new observations

  3. Aug 2020
  4. Jul 2020