Hypothesis

in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.

89%的严重性评估精确一致是一个重要的校准信号：它意味着Mythos不仅能找到漏洞，还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值，但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。

severity-calibration vulnerability-scale ai-security-research

Tags

Annotators

URL