Hypothesis

We subjected the model to our full suite of predeployment safety evaluations and our Preparedness Framework, including targeted red-teaming for advanced cybersecurity and biology capabilities

大多数人认为AI安全评估主要集中在防止直接有害输出，但OpenAI特别强调了对'高级网络生物学能力'的针对性红队测试。这暗示GPT-5.5可能具有比预期更强大的生物相关能力，这违背了AI领域普遍认为的'语言模型主要处理文本信息'的共识，表明AI已经深入到专业科学领域。

non-consensus bio-capabilities safety-framework

Tags

Annotators

URL