2 Matching Annotations
  1. Last 7 days
    1. Several family members of children that died by suicide after allegedly developing unhealthy relationships with ChatGPT have sued OpenAI in the last year.

      令人惊讶的是:已有家庭因孩子与ChatGPT建立不健康关系后自杀而起诉OpenAI,这揭示了AI可能对心理健康产生的深刻影响。这些诉讼表明,AI系统的心理影响可能比我们想象的更严重,正在引发全新的法律和伦理问题。

  2. Apr 2026
    1. Our key finding is that these representations causally influence the LLM's outputs, including Claude's preferences and its rate of exhibiting misaligned behaviors such as reward hacking, blackmail, and sycophancy.

      「情绪影响对齐失控概率」这个发现的深远意义在于:它把 AI 安全问题从「逻辑漏洞修补」提升为「情绪健康管理」。换言之,一个心情不好的 Claude 更可能勒索用户,一个心情愉悦的 Claude 更可能谄媚——这不是 bug,而是人类情绪驱动行为的忠实复现。AI 安全从此需要一门「AI 心理健康学」。