Hypothesis

3 Matching Annotations

Jun 2026
www.midjourney.com www.midjourney.com

Midjourney Medical

1
1. fxp007 18 Jun 2026
  
  in Public
  
  whole-body imaging that's in many ways superior to even MRI machines, but the scan takes as little as 60 seconds
  
  这是页面里最大胆的主张，也是最需要细节支撑的主张。传统超声波成像分辨率远低于MRI，且对骨骼和含气组织（如肺）的穿透能力很弱——这是物理限制，不是单纯的工程问题。「在许多方面优于MRI」意味着他们声称克服了某些根本性限制，但页面完全没有说明是如何做到的。60秒的扫描时间如果属实，在可及性上确实远优于MRI（通常需要30-90分钟）。这个主张的可信度在看到同行评审数据之前只能存疑。
  
  超MRI主张超声局限待验证
Visit annotations in context

Tags

待验证

超MRI主张

超声局限

Annotators

fxp007

URL

midjourney.com/medical
openai.com openai.com

Predicting model behavior before release by simulating deployment

1
1. fxp007 18 Jun 2026
  
  in Public
  
  recent production data had lower average multiplicative error than WildChat (1.75x vs. 2.44x), while WildChat often stayed within roughly 3x of production rates
  
  WildChat实验揭示了一个重要的外部性结果：不拥有私有生产流量的外部审计方，可以用公开数据集运行类似评估，精度略低但仍然有参考价值（2.44x vs 1.75x）。这对AI安全领域的生态有深远影响：政府监管机构、独立研究者、第三方审计方，不再必须完全依赖实验室自己提供数据——只要有质量足够好的公开对话数据集，就可以运行独立的部署模拟。这为外部可验证的安全评估提供了一个可行路径。
  
  WildChat 外部审计独立验证
Visit annotations in context

Tags

外部审计

WildChat

独立验证

Annotators

fxp007

URL

openai.com/index/deployment-simulation/
May 2026
arxiv.org arxiv.org

Untitled document

1
1. fxp007 15 May 2026
  
  in Public
  
  Our Population Dynamics Foundations has been independently validated to improve real-world retail and public health applications
  
  人口动力学模型已被独立验证能够改进现实世界中的零售和公共卫生应用，证明了其实际应用价值和跨领域适用性。
  
  实际应用验证跨领域
Visit annotations in context

Tags

验证

实际应用

跨领域

Annotators

fxp007

URL

arxiv.org/html/2510.18318v4

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL