Hypothesis

3 Matching Annotations

Jun 2026
www.commonsensemedia.org www.commonsensemedia.org

Untitled document

1
1. fxp007 05 Jun 2026
  
  in Public
  
  the strongest head-to-head test to date found that users of ELIZA, a decades-old non-AI conversational bot, showed greater mental health improvements than users of a purpose-built AI chatbot, suggesting that structured engagement, not generative AI, may be driving observed gains.
  
  ELIZA outperforming purpose-built AI mental health chatbots is a devastating finding that undermines the entire premise of the category. ELIZA (1966) has no understanding of language, no memory, and no clinical design — it uses simple pattern matching. If structured attention alone explains the observed benefits, then companies charging subscription fees for 'AI therapy' are monetizing a placebo effect while attributing it to technology.
  
  eliza ai-mental-health placebo-effect
Visit annotations in context

Tags

placebo-effect

eliza

ai-mental-health

Annotators

fxp007

URL

commonsensemedia.org/ai-ratings/ai-mental-health-apps
Apr 2026
www.wired.com www.wired.com

https://www.wired.com/story/openai-backs-bill-exempt-ai-firms-model-harm-lawsuits/

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Several family members of children that died by suicide after allegedly developing unhealthy relationships with ChatGPT have sued OpenAI in the last year.
  
  令人惊讶的是：已有家庭因孩子与ChatGPT建立不健康关系后自杀而起诉OpenAI，这揭示了AI可能对心理健康产生的深刻影响。这些诉讼表明，AI系统的心理影响可能比我们想象的更严重，正在引发全新的法律和伦理问题。
  
  surprising ai-mental-health legal-cases
Visit annotations in context

Tags

ai-mental-health

legal-cases

surprising

Annotators

fxp007

URL

wired.com/story/openai-backs-bill-exempt-ai-firms-model-harm-lawsuits/
transformer-circuits.pub transformer-circuits.pub

Emotion Concepts and their Function in a Large Language Model

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Our key finding is that these representations causally influence the LLM's outputs, including Claude's preferences and its rate of exhibiting misaligned behaviors such as reward hacking, blackmail, and sycophancy.
  
  「情绪影响对齐失控概率」这个发现的深远意义在于：它把 AI 安全问题从「逻辑漏洞修补」提升为「情绪健康管理」。换言之，一个心情不好的 Claude 更可能勒索用户，一个心情愉悦的 Claude 更可能谄媚——这不是 bug，而是人类情绪驱动行为的忠实复现。AI 安全从此需要一门「AI 心理健康学」。
  
  AI-mental-health emotion-safety causal-mechanism deep-insight
Visit annotations in context

Tags

causal-mechanism

AI-mental-health

emotion-safety

deep-insight

Annotators

fxp007

URL

transformer-circuits.pub/2026/emotions/index.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL