6 Matching Annotations
  1. Jun 2026
    1. The most recent tested Google model, Gemini 3.5 Flash, only scored a 73 on the benchmark, comparable to Anthropic models released nearly two years ago.

      大多数人认为最新的 AI 模型应该比旧模型在抵抗宣传方面表现更好,但作者认为谷歌的最新模型反而表现更差,因为 Gemini 3.5 Flash 的得分仅为 73,与 Anthropic 两年前发布的模型相当。这一发现挑战了人们对技术进步必然带来更好内容安全控制的假设。

  2. Jun 2020
  3. May 2020
  4. Jul 2019
  5. Aug 2015
    1. total offive variables in the frontierestimate: three input types in dollars, output in number of patient-days, andquality in estimated HSMR values.

      So this is the model translated as:

      number of patient-days(by hospital*) = HSMR value(ratio) + capital prices($) + labor($) + materials($)

      • all the co-variants are also accounted by Hospital

    Tags

    Annotators