12 Matching Annotations
  1. Last 7 days
    1. Models of this capability level require stronger cyber safeguards before they can be generally released.

      大多数人认为AI安全措施应该随着技术发展而逐步完善,但作者认为更高级别的AI模型需要更强的网络安全保障才能发布。这挑战了AI行业逐步推进安全标准的常规做法,暗示高级AI可能需要突破性的安全方法而非渐进式改进。

  2. Apr 2026
    1. We are treating the biological/chemical and cybersecurity capabilities of GPT‑5.5 as High under our Preparedness Framework. While GPT‑5.5 didn't reach Critical cybersecurity capability level, our evaluations and testing showed that its cybersecurity capabilities are a step up compared to GPT‑5.4.

      大多数人认为AI在网络安全领域的应用主要局限于防御辅助,而非直接参与核心安全任务。但作者暗示GPT-5.5已具备'高级'网络安全能力,这一分类表明AI已从被动防御工具向主动安全参与者转变,挑战了网络安全领域对人类主导地位的认知。

    2. We are treating the biological/chemical and cybersecurity capabilities of GPT‑5.5 as High under our Preparedness Framework. While GPT‑5.5 didn't reach Critical cybersecurity capability level, our evaluations and testing showed that its cybersecurity capabilities are a step up compared to GPT‑5.4.

      大多数人认为AI在网络安全领域的进步应该是渐进式的,但作者暗示GPT-5.5代表了网络安全能力的显著跃升,达到了'高'级别而非仅仅'临界'级别。这一观点挑战了人们对AI安全能力发展速度的预期,暗示AI在防御复杂网络威胁方面可能比人们想象的进步更快。

    3. We are treating the biological/chemical and cybersecurity capabilities of GPT‑5.5 as High under our Preparedness Framework. While GPT‑5.5 didn't reach Critical cybersecurity capability level, our evaluations and testing showed that its cybersecurity capabilities are a step up compared to GPT‑5.4.

      大多数人认为AI在网络安全领域的应用应该被严格限制或视为威胁,但作者认为GPT-5.5的网络安全能力是'进步'而非危险,并将其归类为'高级'而非'关键'风险级别。这与主流的'AI网络安全威胁论'相悖,暗示AI可能成为网络安全防御的重要工具而非主要威胁。

    1. The group accessed Mythos by using knowledge of Anthropic’s other model formats obtained from a recent [Mercor data breach](https://www.theverge.com/ai-artificial-intelligence/907083/a-company-that-makes-ai-training-data-has-been-hit-by-a-security-breach) to make “an educated guess” about its online location.

      大多数人可能认为高级 AI 模型的访问权限非常难以获得,但作者指出,一个黑客小组通过从 Mercor 数据泄露中获得的信息来猜测 Mythos 的在线位置,这表明了数据泄露可能对更广泛的网络安全构成威胁。

    1. For the computer-use work that sits at the heart of XBOW's autonomous penetration testing, the new Claude Opus 4.7 is a step change: 98.5% on our visual-acuity benchmark versus 54.5% for Opus 4.6.

      在视觉敏锐度测试中从54.5%跃升至98.5%是一个惊人的进步,这展示了AI在网络安全领域的突破性进展,'our single biggest Opus pain point effectively disappeared'表明这一进步解决了实际应用中的关键瓶颈。

    1. Anthropic found that Mythos Preview was far more capable than previous models at exploiting vulnerabilities in Firefox's JavaScript implementation. Anthropic's previous best model, Claude Opus 4.6, created a successful exploit less than 1% of the time. Mythos Preview did so 72% of the time.

      令人惊讶的是:Claude Mythos Preview在利用Firefox漏洞方面的成功率从Opus 4.6的不到1%跃升至72%,这种能力提升是指数级的,展示了AI在网络安全攻防领域可能带来的革命性变化。

    1. Out of all generated PoCs, 759 triggered crashes across 60 projects, and manual inspection confirmed 17 cases of incomplete patches spanning 15 projects

      令人惊讶的是:AI生成的概念验证(PoC)能够揭示人类安全补丁中的不完整之处。这表明AI不仅能发现漏洞,还能评估现有补丁的有效性,这种能力对于提高软件安全性具有重要意义,因为人类开发者可能会忽略这些细微的补丁缺陷。

    1. Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser.

      令人惊讶的是:Claude Mythos Preview模型已经发现了数千个高危漏洞,包括所有主流操作系统和网络浏览器中的漏洞。这表明AI模型已经达到了能够超越大多数人类专家发现软件漏洞的水平,这种能力在网络安全领域具有革命性意义。

    1. Claude Mythos autonomously identified and exploited several significant vulnerabilities. Notably, it discovered a 27-year-old vulnerability in OpenBSD

      令人惊讶的是,Claude Mythos能够自主发现并利用一个存在了27年的OpenBSD漏洞。这一事实表明AI模型在网络安全领域的能力已经达到了令人难以置信的水平,能够找到人类专家和安全系统长期未发现的漏洞。这引发了关于AI安全性和控制机制的深刻问题。

    1. In the past, exploiting an application required a highly skilled hacker with years of experience and a significant investment of time to find and exploit vulnerabilities.

      令人惊讶的是:文章揭示了网络安全领域的根本性转变——过去需要高技能黑客多年经验才能完成的漏洞利用工作,现在AI可以在短时间内完成。这种技术民主化虽然提高了效率,但也大大降低了攻击门槛,使网络安全形势急剧恶化。