5 Matching Annotations
  1. Jun 2026
    1. The most recent tested Google model, Gemini 3.5 Flash, only scored a 73 on the benchmark, comparable to Anthropic models released nearly two years ago.

      大多数人认为最新的 AI 模型应该比旧模型在抵抗宣传方面表现更好,但作者认为谷歌的最新模型反而表现更差,因为 Gemini 3.5 Flash 的得分仅为 73,与 Anthropic 两年前发布的模型相当。这一发现挑战了人们对技术进步必然带来更好内容安全控制的假设。

  2. Feb 2021
  3. Jan 2019
    1. There are some environmental elements of the Withdrawal Agreement which our current proposals do not cover, namely those concerning the independent body’s scope to enforce implementation of the “non-regression” clause. We will consider these provisions of the Withdrawal Agreement ahead of publishing the final Bill

      hmmmmm....

    2. The text sets out that, if the protocol is required, the UK and EU will not reduce their respective levels of environmental protection below those in place at the end of the implementation period

      note the 'if' attached to N-R

  4. Aug 2018