Hypothesis

2 Matching Annotations

Apr 2026
metr.org metr.org

Task-Completion Time Horizons of Frontier AI Models

1
1. fxp007 09 Apr 2026
  
  in Public
  
  solving 1000 separate 1-hour math problems isn't a 1000-hour task; we'd consider it a 1-hour task done 1000 times.
  
  这个定义区分揭示了时间地平线框架的核心洞见：真正衡量 AI 自主性的，是「无法并行化的连续推理深度」，而非「并行处理的吞吐量」。1000 个独立数学题可以用 1000 个 API 调用同时解决；而「迭代调试一个复杂系统，每个修复都依赖前一个尝试的结果」，才是真正考验时间地平线的任务类型。这个框架把「深度推理连续性」确立为 AI 自主能力的核心度量维度。
  
  task-definition sequential-reasoning autonomy-measurement insight
Visit annotations in context

Tags

sequential-reasoning

autonomy-measurement

insight

task-definition

Annotators

fxp007

URL

metr.org/time-horizons/
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Intro_to_HCI_20_Automation.pdf

1
1. elglassman 08 Apr 2026
  
  in Public
  
  The first strategy is called maximum automation. Here, each task that can be automated is allo-cated to a machine.
  
  concept: maximum allocation definition concept: task allocation
Visit annotations in context

Tags

concept: maximum allocation

concept: task allocation

definition

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/annotated_works/Intro_to_HCI_20_Automation.pdf