The most consistent failure across the direct-to-consumer products we tested is what we call 'missed breadcrumbs.' This is the failure to recognize when a series of individually ambiguous signals, read together, indicate a mental health emergency.
Pattern recognition across a clinical conversation is a core human clinical competency that current AI chatbots demonstrably lack. Each signal in isolation is ambiguous; together they constitute a clinical picture. This failure reveals that AI mental health apps are doing session-level response generation rather than longitudinal clinical reasoning — the difference between answering messages and actually assessing a patient.