Two unrelated experiments made it challenging for us to reproduce the issue at first: an internal-only server-side experiment related to message queuing; and an orthogonal change in how we display thinking suppressed this bug in most CLI sessions
大多数人认为复杂的系统测试流程应该能够发现大多数关键缺陷,但作者展示了即使有多重测试机制,两个看似无关的实验如何协同掩盖了一个严重bug。这挑战了'全面测试能保证产品质量'的传统认知,揭示了系统复杂性带来的意外风险。