3 Matching Annotations
  1. Last 7 days
    1. in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.

      89%的严重性评估精确一致是一个重要的校准信号:它意味着Mythos不仅能找到漏洞,还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值,但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。

  2. Jun 2024
    1. this is a serious problem because all they need to do is automate AI research 00:41:53 build super intelligence and any lead that the US had would vanish the power dynamics would shift immediately

      for - AI - security risk - once automated AI research is known, bad actors can easily build superintelligence

      AI - security risk - once automated AI research is known, bad actors can easily build superintelligence - Any lead that the US had would immediately vanish.

  3. Jul 2020