7 Matching Annotations
  1. Last 7 days
    1. For the computer-use work that sits at the heart of XBOW's autonomous penetration testing, the new Claude Opus 4.7 is a step change: 98.5% on our visual-acuity benchmark versus 54.5% for Opus 4.6.

      在视觉敏锐度测试中从54.5%跃升至98.5%是一个惊人的进步,这展示了AI在网络安全领域的突破性进展,'our single biggest Opus pain point effectively disappeared'表明这一进步解决了实际应用中的关键瓶颈。

    1. Anthropic found that Mythos Preview was far more capable than previous models at exploiting vulnerabilities in Firefox's JavaScript implementation. Anthropic's previous best model, Claude Opus 4.6, created a successful exploit less than 1% of the time. Mythos Preview did so 72% of the time.

      令人惊讶的是:Claude Mythos Preview在利用Firefox漏洞方面的成功率从Opus 4.6的不到1%跃升至72%,这种能力提升是指数级的,展示了AI在网络安全攻防领域可能带来的革命性变化。

    1. Out of all generated PoCs, 759 triggered crashes across 60 projects, and manual inspection confirmed 17 cases of incomplete patches spanning 15 projects

      令人惊讶的是:AI生成的概念验证(PoC)能够揭示人类安全补丁中的不完整之处。这表明AI不仅能发现漏洞,还能评估现有补丁的有效性,这种能力对于提高软件安全性具有重要意义,因为人类开发者可能会忽略这些细微的补丁缺陷。

    1. Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser.

      令人惊讶的是:Claude Mythos Preview模型已经发现了数千个高危漏洞,包括所有主流操作系统和网络浏览器中的漏洞。这表明AI模型已经达到了能够超越大多数人类专家发现软件漏洞的水平,这种能力在网络安全领域具有革命性意义。

    1. Claude Mythos autonomously identified and exploited several significant vulnerabilities. Notably, it discovered a 27-year-old vulnerability in OpenBSD

      令人惊讶的是,Claude Mythos能够自主发现并利用一个存在了27年的OpenBSD漏洞。这一事实表明AI模型在网络安全领域的能力已经达到了令人难以置信的水平,能够找到人类专家和安全系统长期未发现的漏洞。这引发了关于AI安全性和控制机制的深刻问题。

    1. In the past, exploiting an application required a highly skilled hacker with years of experience and a significant investment of time to find and exploit vulnerabilities.

      令人惊讶的是:文章揭示了网络安全领域的根本性转变——过去需要高技能黑客多年经验才能完成的漏洞利用工作,现在AI可以在短时间内完成。这种技术民主化虽然提高了效率,但也大大降低了攻击门槛,使网络安全形势急剧恶化。