1 Matching Annotations
  1. Last 7 days
    1. in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.

      89%的严重性评估精确一致是一个重要的校准信号:它意味着Mythos不仅能找到漏洞,还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值,但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。