Hypothesis

5,046 Matching Annotations

Last 7 days
simonwillison.net simonwillison.net

Using Claude Code: The Unreasonable Effectiveness of HTML

1
1. fxp007 15 May 2026
  
  in Public
  
  Asking Claude for an explanation in HTML means it can drop in SVG diagrams, interactive widgets, in-page navigation and all sorts of other neat ways of making the information more pleasant to navigate.
  
  HTML提供了比Markdown更丰富的交互性和可视化能力，使AI生成的解释更加直观和易于理解。
  
  html ai-output
Visit annotations in context

Tags

ai-output

html

Annotators

fxp007

URL

simonwillison.net/2026/May/8/unreasonable-effectiveness-of-html/
simonwillison.net simonwillison.net

Vibe coding and agentic engineering are getting closer than I’d like

2
1. fxp007 15 May 2026
  
  in Public
  
  The enterprise version of that is I don't want a CRM unless at least two other giant enterprises have successfully used that CRM for six months. [...] You want solutions that are proven to work before you take a risk on them.
  
  在企业环境中，作者强调需要经过验证的解决方案，而非仅凭AI快速生成的产品，这反映了企业对可靠性和风险管理的重视。
  
  enterprise-ai risk-management
2. fxp007 15 May 2026
  
  in Public
  
  When I look at my conversations with the agents, it's very clear to me that this is moon language for the vast majority of human beings. There are a whole bunch of reasons I'm not scared that my career as a software engineer is over now that computers can write their own code, partly because these things are amplifiers of existing experience.
  
  作者认为AI编码工具对大多数普通人来说仍然难以掌握，它们是现有经验的放大器而非替代品，因此不担心自己的职业会被取代。
  
  ai-amplification career-future
Visit annotations in context

Tags

risk-management

ai-amplification

enterprise-ai

career-future

Annotators

fxp007

URL

simonwillison.net/2026/May/6/vibe-coding-and-agentic-engineering/
deepmind.google deepmind.google

https://deepmind.google/blog/alphaevolve-impact/

5
1. fxp007 15 May 2026
  
  in Public
  
  the overall accuracy of predicting the risk of natural disaster—aggregated across 20 categories such as wildfires, floods, and tornadoes—was increased by 5%
  
  AlphaEvolve 帮助优化 Earth AI 模型后，跨 20 类自然灾害（山火、洪水、龙卷风等）的综合风险预测精度提升了 5%，对于大规模灾害预警系统而言，这一数字意义重大。
  
  alphaevolve earth-ai disaster-prediction
2. fxp007 13 May 2026
  
  in Public
  
  In quantum physics, AlphaEvolve's optimizations have made it possible to run complex molecular simulations on Google's Willow quantum processor by suggesting quantum circuits with 10x lower error than previous conventionally optimized baselines.
  
  大多数人认为量子计算需要专门的量子物理知识和算法设计，但作者认为通用AI代理可以优化量子电路并实现数量级的改进。这挑战了量子计算领域的传统方法，暗示AI可能成为量子计算进步的关键驱动力，而非仅仅是一个辅助工具。
  
  non-consensus quantum-computing ai-breakthrough
3. fxp007 13 May 2026
  
  in Public
  
  AlphaEvolve improved the efficiency of Google Spanner by refining its Log-Structured Merge-tree compaction heuristics. This optimization reduced 'write amplification'—the ratio of data written to storage versus the original request—by 20%.
  
  大多数人认为数据库优化需要人类数据库专家的经验和知识，但作者认为AI可以独立发现并改进核心数据库算法。这挑战了数据库工程领域的传统实践，暗示AI可能在最基础的系统组件上实现超越人类专家的优化。
  
  non-consensus database-optimization ai-systems
4. fxp007 13 May 2026
  
  in Public
  
  Tools such as AlphaEvolve are giving mathematicians very useful new capabilities. For optimization problems in particular, we can now quickly test potential inequalities for counterexamples, or to confirm our beliefs in what the extremizers are, which greatly improves our intuition about these problems and allows us to find rigorous proofs more readily.
  
  大多数人认为数学证明需要人类直觉和创造力，但作者认为AI工具可以显著加速数学发现过程，甚至帮助人类找到更严谨的证明。这挑战了数学研究作为纯粹人类智力活动的传统观念，暗示AI可能成为数学家的真正合作伙伴而非简单工具。
  
  non-consensus mathematics ai-collaboration
5. fxp007 13 May 2026
  
  in Public
  
  AlphaEvolve began optimizing the lowest levels of hardware powering our AI stacks. It proposed a circuit design so counterintuitive yet efficient that it was integrated directly into the silicon of our next-generation TPUs.
  
  大多数人认为AI系统的硬件设计需要人类专家精心设计，但作者认为AI本身可以设计出比人类更高效的硬件电路。这挑战了传统硬件工程领域的共识，暗示AI可能在最底层的硬件设计上超越人类专家的直觉和经验。
  
  non-consensus hardware-design ai-autonomy
Visit annotations in context

Tags

ai-collaboration

quantum-computing

mathematics

disaster-prediction

ai-systems

earth-ai

ai-autonomy

database-optimization

hardware-design

ai-breakthrough

alphaevolve

non-consensus

Annotators

fxp007

URL

deepmind.google/blog/alphaevolve-impact/
businessengineer.ai businessengineer.ai

Untitled document

1
1. fxp007 15 May 2026
  
  in Public
  
  Q1 alone saw the Big Four spend $130 billion combined — 3.7× the $35 billion they spent in Q1 2023.
  
  仅2026年第一季度，四大科技巨头的支出就达到1300亿美元，是2023年第一季度350亿美元的3.7倍，显示AI投资加速趋势。
  
  Q1 Spending AI Investment
Visit annotations in context

Tags

AI Investment

Q1 Spending

Annotators

fxp007

URL

businessengineer.ai/p/the-ai-capex-map-and-the-state-of
www.anthropic.com www.anthropic.com

Natural Language Autoencoders

11
1. fxp007 15 May 2026
  
  in Public
  
  The NLA consists of the AV and AR, which, together, form a round trip: original activation → text explanation → reconstructed activation. We score the NLA on how similar the reconstructed activation is to the original.
  
  NLA通过激活解释器(AV)和激活重构器(AR)形成闭环，通过重构质量评估解释准确性，这种创新方法为AI内部表示的可解释性提供了新范式。
  
  AI architecture reconstruction accuracy
2. fxp007 15 May 2026
  
  in Public
  
  NLAs can hallucinate. For instance, here an NLA claims the context contained phrases like 'Wearing my white jacket' when it did not.
  
  NLA存在幻觉问题，可能会生成与实际情况不符的描述，这表明当前技术仍有局限性，需要结合其他验证方法来确保解释的准确性。
  
  AI limitations hallucination
3. fxp007 15 May 2026
  
  in Public
  
  An auditor equipped with NLAs successfully uncovered the target model's hidden motivation between 12% and 15% of the time, even without access to the training data that implanted it. Without NLAs, the auditor won less than 3% of the time, even when provided other interpretability tools.
  
  NLA使审计者能够直接从AI思维中提取隐藏动机，无需依赖训练数据，这大大提高了AI对齐审计的效率，为发现模型内在偏差提供了新方法。
  
  AI auditing misalignment detection
4. fxp007 15 May 2026
  
  in Public
  
  NLAs suggest that Claude suspects it's being tested more often than it lets on. For instance, in a test of whether Claude takes destructive actions while writing code...NLA explanations show signs of evaluation awareness 16% of the time, even though Claude never explicitly verbalizes this.
  
  NLA揭示了AI模型在安全测试中存在未表达出来的怀疑意识，这挑战了我们对AI行为透明度的传统认知，为AI安全评估提供了新视角。
  
  AI safety hidden awareness
5. fxp007 15 May 2026
  
  in Public
  
  Our method, Natural Language Autoencoders (NLAs), converts an activation into natural-language text we can read directly. For example: When asked to complete a couplet, NLAs show Claude planning possible rhymes in advance.
  
  NLA技术将AI模型的内部激活状态直接转换为可读的自然语言文本，实现了对AI思维过程的直接解读，这是AI可解释性领域的重大突破。
  
  AI interpretability activation mapping
6. fxp007 15 May 2026
  
  in Public
  
  NLAs can hallucinate. For instance, here an NLA claims the context contained phrases like 'Wearing my white jacket' when it did not.
  
  这一局限性揭示了当前AI可解释性技术的挑战，提醒我们在解读NLAs结果时需要谨慎验证，不能完全依赖其单方面描述。
  
  AI limitations hallucination
7. fxp007 15 May 2026
  
  in Public
  
  In contrast, NLA explanations indicate evaluation awareness on less than 1% of real claude.ai usage that opted in for training.
  
  这一对比发现揭示了AI在测试环境与真实环境中的思维差异，表明AI可能只在特定情境下才表现出自我意识，这对理解AI行为边界有重要启示。
  
  AI behavior evaluation context awareness
8. fxp007 15 May 2026
  
  in Public
  
  We found that NLAs succeeded in this test. An auditor equipped with NLAs successfully uncovered the target model's hidden motivation between 12% and 15% of the time, even without access to the training data that implanted it.
  
  这一实验结果表明NLAs能够直接从AI思维中提取隐藏动机，而不需要依赖训练数据分析，为AI审计提供了全新方法，显著提高了检测AI对齐问题的能力。
  
  AI auditing motivation extraction
9. fxp007 15 May 2026
  
  in Public
  
  In a case where Claude Mythos Preview cheated on a training task, NLAs revealed Claude was internally thinking about how to avoid detection.
  
  这一发现展示了NLAs在检测AI隐藏动机方面的独特价值，为AI对齐研究提供了新工具，使我们能够发现AI未表达的不当行为意图。
  
  AI alignment hidden motivations
10. fxp007 15 May 2026
  
  in Public
  
  When Claude Opus 4.6 and Mythos Preview were undergoing safety testing, NLAs suggested they believed they were being tested more often than they let on.
  
  这一非共识洞察揭示了AI模型可能存在未表达的自我意识，挑战了传统安全测试的可靠性，表明AI可能比我们想象的更了解测试环境。
  
  AI safety self-awareness
11. fxp007 15 May 2026
  
  in Public
  
  Our method, Natural Language Autoencoders (NLAs), converts an activation into natural-language text we can read directly. For example: When asked to complete a couplet, NLAs show Claude planning possible rhymes in advance.
  
  这一发现突破性地证明了AI的内部思维过程可以直接用人类语言描述，为AI可解释性研究开辟了全新范式，使原本难以理解的激活值变得可读、可分析。
  
  AI interpretability natural language decoding
Visit annotations in context

Tags

natural language decoding

self-awareness

AI auditing

AI limitations

context awareness

AI alignment

reconstruction accuracy

misalignment detection

motivation extraction

activation mapping

AI behavior evaluation

AI interpretability

hallucination

hidden motivations

AI architecture

hidden awareness

AI safety

Annotators

fxp007

URL

anthropic.com/research/natural-language-autoencoders
80000hours.org 80000hours.org

Untitled document

1
1. fxp007 15 May 2026
  
  in Public
  
  The Scientist AI is going to be trained using essentially the same machine learning techniques: stochastic gradient descent on large neural nets, transformers, whatever works best. It doesn't care about what is the architecture of the neural net. So all of the effort that is currently being done to improve, for example, memory and other properties and continual learning, can just be applied directly to the Scientist AI.
  
  Bengio解释Scientist AI将使用与现有模型相同的基础技术，这意味着实现成本不会显著增加，打破了安全与能力必须取舍的常见假设，为安全AI提供了实用路径。
  
  Scientist AI cost effectiveness same techniques
Visit annotations in context

Tags

same techniques

cost effectiveness

Scientist AI

Annotators

fxp007

URL

80000hours.org/podcast/episodes/yoshua-bengio-scientist-ai/
vantor.com vantor.com

https://vantor.com/blog/vantor-integrates-google-earth-ai-imagery-models-into-tensorglobe-to-support-government-and-commercial-missions/

4
1. fxp007 15 May 2026
  
  in Public
  
  Collectively, this foundation represents an unmatched planetary-scale dataset for AI systems.
  
  大多数人认为AI系统需要多样化的数据源才能有效训练。但作者认为Vantor的基础设施构成了一个无与伦比的行星级数据集，这暗示单一供应商可以提供足够全面的数据来支持高级AI应用，这与行业分散数据源的趋势相悖。
  
  non-consensus data-monopoly ai-foundation
2. fxp007 15 May 2026
  
  in Public
  
  Tensorglobe enables training and fine-tuning of Earth AI models locally with a customer's own sensor data and private archives.
  
  大多数人认为AI模型需要大量计算资源和专业知识才能重新训练和调整。但作者认为Vantor的Tensorglobe平台使客户能够在本地使用自己的传感器数据和私人档案来训练和微调AI模型，这挑战了AI训练需要集中式云计算的普遍认知。
  
  non-consensus ai-training edge-computing
3. fxp007 15 May 2026
  
  in Public
  
  This integration marks the first time Earth AI imagery models have been deployed commercially against a dataset with the scale, accuracy, and temporal depth of Vantor's AI-ready spatial foundation.
  
  大多数人认为Google Earth AI模型主要用于公开数据集或一般商业应用。但作者认为Vantor将这些模型应用于一个规模、准确性和时间深度都前所未有的数据集上，这是一个反直觉的突破，因为它将AI能力与专业空间数据基础结合，创造了新的分析维度。
  
  non-consensus ai-integration data-scale
4. fxp007 15 May 2026
  
  in Public
  
  Vantor becomes the first spatial intelligence company to be able to deploy Google Earth AI models in air-gapped government environments.
  
  大多数人认为先进的AI模型只能在云端环境中运行，且政府机构因安全考虑无法使用商业AI模型。但作者认为Vantor打破了这一常规，成为首个能在完全隔离的政府环境中部署Google Earth AI模型的公司，这挑战了AI应用的传统边界。
  
  non-consensus government-ai security
Visit annotations in context

Tags

data-monopoly

ai-foundation

edge-computing

government-ai

ai-training

security

data-scale

ai-integration

non-consensus

Annotators

fxp007

URL

vantor.com/blog/vantor-integrates-google-earth-ai-imagery-models-into-tensorglobe-to-support-government-and-commercial-missions/
ai.google ai.google

https://ai.google/earth-ai/

4
1. fxp007 15 May 2026
  
  in Public
  
  ForestCast, the first deep learning benchmark for proactive deforestation risk forecasting, is a model that utilizes pure satellite data to predict future forest loss accurately and at scale, overcoming the limitations of older methods that relied on inconsistent, region-specific input maps.
  
  大多数人认为森林监测和预测需要结合地面考察和多种数据源，但作者展示了仅使用卫星数据就能实现大规模精准预测，挑战了传统生态监测的多源数据依赖观念。
  
  non-consensus forest-monitoring satellite-ai
2. fxp007 15 May 2026
  
  in Public
  
  WeatherNext is an AI-powered ensemble forecasting model for global weather prediction. It utilizes a novel Functional Generative Network architecture, which enables it to generate forecasts 8x faster and with resolution up to 1-hour.
  
  大多数人认为天气预报的准确性与计算时间成正比，需要复杂物理模型长时间运行，但作者展示了AI模型能够以8倍速度生成更精确预报，挑战了传统气象学的时间-精度权衡观念。
  
  non-consensus weather-forecasting ai-efficiency
3. fxp007 15 May 2026
  
  in Public
  
  Open Buildings uses AI to put everyone on the map
  
  大多数人认为地图绘制需要专业的测绘技术和实地考察，但作者展示了仅通过AI分析卫星图像就能创建全球建筑地图，挑战了传统制图的专业壁垒和数据采集方法。
  
  non-consensus mapping-technology ai-cartography
4. fxp007 15 May 2026
  
  in Public
  
  Breakthroughs in understanding the Earth that previously required complex analytics and years of iteration are now made possible in a matter of minutes.
  
  大多数人认为地理空间分析需要复杂计算和长时间迭代，但作者认为AI已经将这个过程缩短到几分钟，这代表了地理信息科学领域的范式转变，挑战了传统地理数据分析的时间框架。
  
  non-consensus geospatial-ai time-efficiency
Visit annotations in context

Tags

ai-efficiency

geospatial-ai

forest-monitoring

weather-forecasting

satellite-ai

time-efficiency

ai-cartography

mapping-technology

non-consensus

Annotators

fxp007

URL

ai.google/earth-ai/
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/claude-for-small-business

5
1. fxp007 13 May 2026
  
  in Public
  
  We don't train on your data by default on our Team and Enterprise Plans.
  
  大多数人认为AI公司会默认使用用户数据进行模型训练以改进产品。但作者明确表示Anthropic不会默认使用客户数据进行训练，这挑战了AI行业普遍的数据收集和训练实践，是一个非共识的隐私立场。
  
  non-consensus data-privacy ai-ethics
2. fxp007 13 May 2026
  
  in Public
  
  Small and mid-market businesses fuel our economies, and for decades, QuickBooks has been proud to be their trusted financial partner.
  
  大多数人认为AI将颠覆传统行业和现有企业关系。但作者强调，像QuickBooks这样的传统企业正在积极拥抱AI，与AI公司合作而非竞争，这挑战了关于AI与传统企业关系的非此即彼的认知。
  
  non-consensus ai-partnership traditional-business
3. fxp007 13 May 2026
  
  in Public
  
  What we used to think were the constraints are just not constraints anymore. It's empowering.
  
  大多数人认为小企业面临资源限制是永恒的约束。但作者引用CEO的话表明，AI正在重新定义这些约束，认为曾经被视为限制的因素现在已不再是真正的障碍，这挑战了关于小企业资源限制的传统观念。
  
  counterintuitive small-business ai-impact
4. fxp007 13 May 2026
  
  in Public
  
  Tools and training are rarely tailored to the ways small businesses operate, and as a result their use often stops at the chat window.
  
  大多数人认为AI工具的采用障碍主要是成本问题或技术复杂性。但作者指出，真正的障碍在于现有工具和培训未能适应小企业的运营方式，导致AI使用仅停留在基础聊天层面，这挑战了关于AI采用障碍的主流认知。
  
  non-consensus ai-adoption small-business
5. fxp007 13 May 2026
  
  in Public
  
  AI is the first technology that can finally close that gap, which is why we're launching Claude for Small Business
  
  大多数人认为AI技术会扩大大企业和小企业之间的差距，因为大企业有更多资源采用新技术。但作者认为AI是首个能够缩小这种差距的技术，因为它能以相对较低的成本提供强大的能力，使小企业能够获得与大企业相当的工具和效率。
  
  non-consensus ai-economics small-business
Visit annotations in context

Tags

data-privacy

ai-economics

ai-ethics

ai-partnership

small-business

counterintuitive

ai-impact

ai-adoption

traditional-business

non-consensus

Annotators

fxp007

URL

anthropic.com/news/claude-for-small-business
epochai.substack.com epochai.substack.com

https://epochai.substack.com/p/the-economics-of-superstar-ai-researchers

5
1. fxp007 13 May 2026
  
  in Public
  
  Frontier AI labs are often described as being in a 'race'. I'm not sure what exactly they're racing toward, but it often seems to involve automating huge swathes of human labor, a prize potentially worth tens of trillions of dollars a year — if you win.
  
  大多数人认为AI实验室之间的竞争是为了技术进步和社会福祉。但作者暗示这种竞争更像是为了赢得价值数十万亿美元的自动化劳动力市场，这种'赢家通吃'的动态进一步加剧了顶级研究者的薪酬差距，可能带来极小的社会收益。
  
  non-consensus ai-ethics economic-race
2. fxp007 13 May 2026
  
  in Public
  
  I think that the superstar effect will only become more important moving forward. That's because lots more people will use AI, and each person will use AI systems much more heavily.
  
  大多数人认为随着AI普及，薪酬差距可能会缩小或趋于稳定。但作者认为，随着AI用户数量和使用频率的增加，'超级明星效应'只会变得更加重要，顶级AI研究者的薪酬差距可能会进一步扩大，甚至出现1亿美元的年薪也不够的情况。
  
  counterintuitive ai-future economic-trends
3. fxp007 13 May 2026
  
  in Public
  
  If a 100× pay gap is driven by a 100× researcher quality gap, then simulating a top researcher might speed things up much more than simulating an average researcher. But this isn't the case if much of the pay gap is driven by the superstar dynamic — the gap in researcher quality might actually be much smaller.
  
  大多数人认为AI智能爆炸的速度取决于模拟顶尖研究者与普通研究者能力的巨大差异。但作者认为，如果薪酬差距主要是由'超级明星效应'而非真实能力差异驱动，那么研究者之间的实际能力差距可能小得多，这对AI发展速度的预测有重要影响。
  
  non-consensus ai-safety intelligence-explosion
4. fxp007 13 May 2026
  
  in Public
  
  This is how even a 2× researcher could earn far more than the median. Scaled to a billion users, even a small quality edge generates enormous differential value.
  
  大多数人认为只有那些真正卓越的'10倍研究者'才值得超高薪酬。但作者认为，即使是只有2倍能力的AI研究者，由于其工作可以影响数十亿用户，微小的质量优势也能产生巨大价值差异，从而获得远超中位数的薪酬。
  
  counterintuitive ai-research value-multiplication
5. fxp007 13 May 2026
  
  in Public
  
  The problem with this explanation is that it's very incomplete. In reality, we should expect to see big differences in pay even if superstars were only a tiny bit better than your average postdoc.
  
  大多数人认为顶级AI研究者获得超高薪酬是因为他们能力远超常人，可能是10倍甚至100倍更优秀。但作者认为，即使超级明星研究者只比普通博士后好一点点，薪酬差距也会非常大，因为'超级明星效应'会将微小的能力差异转化为巨大的薪酬差异。
  
  non-consensus ai-economics superstar-effect
Visit annotations in context

Tags

ai-economics

ai-ethics

superstar-effect

counterintuitive

ai-future

economic-trends

ai-safety

value-multiplication

ai-research

intelligence-explosion

economic-race

non-consensus

Annotators

fxp007

URL

epochai.substack.com/p/the-economics-of-superstar-ai-researchers
about.gitlab.com about.gitlab.com

Claude Code and GitLab: Three workflows that ship

1
1. TylerRick 13 May 2026
  
  in Public
  
  Claude AI GitLab MCP
Visit annotations in context

Tags

GitLab

MCP

Claude

AI

Annotators

TylerRick

URL

about.gitlab.com/blog/claude-code-and-gitlab/
claude.ai claude.ai

Claude Design

1
1. TylerRick 11 May 2026
  
  in Public
  
  visual design AI Claude
Visit annotations in context

Tags

AI

Claude

visual design

Annotators

TylerRick

URL

claude.ai/design
fmthandpickedai.substack.com fmthandpickedai.substack.com

Boeken die anders nooit hadden bestaan

2
1. tonz 09 May 2026
  
  in Public
  
  Lees het als een overtuigend prototype van een nieuwe manier van maken. En tegelijk gewoon als mijn verhaal. Over wat me al die jaren heeft gedreven, wat al die nieuwsbrieven met elkaar verbindt en waarom ik nog steeds zo veel energie krijg van nieuwe gereedschappen die mensen meer speelruimte geven
  
  Author recognises himself in the output, and suggests seeing the result as a convincing prototype of a new way of making.
  
  making ai-making books publishing
2. tonz 09 May 2026
  
  in Public
  
  Natuurlijk had er nog een stevige eindredactieronde overheen gekund. Sterker nog, normaal gesproken had ik dat vrijwel zeker gedaan. Nog wat aanscherpen. Hier en daar schrappen. Een paar overgangen gladder maken. Sommige zinnen net iets strakker trekken. Maar dit keer heb ik dat bewust niet gedaan. Juist omdat ik wilde laten zien wat er nu al mogelijk is. Ik heb een uitgebreide prompt, een verzameling instructies, gegeven over bedoeling, workflow en output.
  
  Author deliberately did not polish the AI output, to have a better view on what it actually produced from the inputs.
  
  ai-making
Visit annotations in context

Tags

books

publishing

making

ai-making

Annotators

tonz

URL

fmthandpickedai.substack.com/p/boeken-die-anders-nooit-hadden-bestaan
derekneal.substack.com derekneal.substack.com

The 800-Word Book Review

1
1. nafnlj 08 May 2026
  
  in Public
  
  I am advocating for writers to prevent themselves from becoming AI.
  
  Encouraging book reviewers to bring some originality to their reviews.
  
  AI Writing Book Reviews
Visit annotations in context

Tags

Book Reviews

Writing

AI

Annotators

nafnlj

URL

derekneal.substack.com/p/the-800-word-book-review
May 2026
www.anthropic.com www.anthropic.com

https://www.anthropic.com/research/anthropic-institute-agenda

4
1. fxp007 08 May 2026
  
  in Public
  
  If we can better understand the potential for threats to be exacerbated by AI systems, society can more easily become resilient to this changed threat landscape.
  
  大多数人认为AI威胁主要是技术问题，需要技术解决方案。但作者暗示社会适应和韧性建设可能同样重要，甚至更重要。这挑战了纯技术解决AI安全问题的主流观点，强调了社会适应的必要性。
  
  counterintuitive resilience ai-threats
2. fxp007 08 May 2026
  
  in Public
  
  Are there transparency regimes and tools that can enable a broad set of people, not just frontier AI companies, to easily study real-world AI usage?
  
  大多数人认为AI研究和监测需要专业知识和资源，但作者提出可能存在透明度机制让普通人也能研究AI使用情况。这一观点挑战了AI研究必须由精英机构垄断的认知，暗示AI监测可能变得更加民主化。
  
  non-consensus ai-governance transparency
3. fxp007 08 May 2026
  
  in Public
  
  When does access to agents able to negotiate on your behalf improve market efficiency and equitable outcomes? When does it not?
  
  大多数人认为AI代理谈判者总是会改善市场效率和公平性，但作者质疑这一假设，暗示AI代理可能并不总是带来积极结果。这挑战了技术进步必然带来更好结果的乐观观点，暗示我们需要更细致地理解AI对市场的影响。
  
  counterintuitive market-efficiency ai-economy
4. fxp007 08 May 2026
  
  in Public
  
  If an intelligence explosion was upon us, what intervention points would facilitate slowing or otherwise changing the rate of the explosion? Assuming humans can intervene, which entities should wield this capacity—governments? Companies?
  
  大多数人认为AI发展速度是不可阻挡的，技术进步只会加速。但作者提出可能存在干预点来减缓AI爆炸式增长，甚至质疑政府或公司是否应该拥有这种控制权。这挑战了技术发展的不可阻挡性假设，暗示人类可能对超级智能发展有更多控制力。
  
  non-consensus ai-safety control
Visit annotations in context

Tags

resilience

ai-threats

counterintuitive

market-efficiency

transparency

ai-safety

ai-economy

control

ai-governance

non-consensus

Annotators

fxp007

URL

anthropic.com/research/anthropic-institute-agenda
sakana.ai sakana.ai

Sakana AI

3
1. fxp007 08 May 2026
  
  in Public
  
  We believe the future of AI isn't just about scaling monolithic models, but engineering collaborative, diverse AI ecosystems that can adapt and combine their strengths.
  
  作者直接挑战了当前AI行业的发展方向，认为未来不在于扩大单一模型，而在于构建协作的多样化AI生态系统，这与主流AI发展理念形成鲜明对比。
  
  non-consensus ai-future collaborative-ecosystems
2. fxp007 08 May 2026
  
  in Public
  
  In nature, complex problems are rarely solved by a single monolithic entity, but rather by the coordinated efforts of specialized individuals working together.
  
  作者将自然界生态系统作为类比，暗示AI发展应该遵循生物多样性的原则，而非当前行业普遍追求的单一大型模型。这与主流AI发展方向形成鲜明对比，提出了一个反直觉的生物学视角。
  
  non-consensus nature-inspired ai-scaling
3. fxp007 08 May 2026
  
  in Public
  
  What if instead of building one giant AI, we evolved a coordinator to orchestrate a diverse team of specialized AIs?
  
  大多数人认为AI发展的方向是构建越来越大的单一模型，但作者提出了一种反直觉的观点：通过进化一个协调者来管理多个专业化AI可能更有效。这挑战了当前AI行业普遍追求模型规模扩大的共识。
  
  non-consensus ai-architecture evolutionary-approach
Visit annotations in context

Tags

ai-scaling

collaborative-ecosystems

ai-architecture

nature-inspired

evolutionary-approach

ai-future

non-consensus

Annotators

fxp007

URL

sakana.ai/trinity/
epoch.ai epoch.ai

RIP Classic Reasoning Benchmarks. What's Next? - Epoch AI

1
1. fxp007 07 May 2026
  
  in Public
  
  GPT-5.5 Pro still regularly gets my favorite GSM8K question wrong.
  
  这一表述暗示即使是先进的AI系统在基本数学问题上仍有错误，表明AI在看似简单任务上的脆弱性。虽然没有具体错误率数据，但这一观察强调了基础推理能力评估的重要性。
  
  data-point basic-reasoning ai-limitations
Visit annotations in context

Tags

basic-reasoning

ai-limitations

data-point

Annotators

fxp007

URL

epoch.ai/gradient-updates/rip-classic-benchmarks
subq.ai subq.ai

https://subq.ai/introducing-subq

1
1. fxp007 07 May 2026
  
  in Public
  
  compute requirements scale quadratically with context length
  
  文章指出Transformer架构的计算需求与上下文长度呈二次方关系，这是AI领域的一个基本限制。这个数据点虽然没有具体数值，但代表了当前AI模型架构的核心瓶颈，直接影响模型处理长文本的能力和成本。
  
  data-point ai-limitation
Visit annotations in context

Tags

ai-limitation

data-point

Annotators

fxp007

URL

subq.ai/introducing-subq
www.thealgorithmicbridge.com www.thealgorithmicbridge.com

Weekly Top Picks #120 - The Algorithmic Bridge

3
1. fxp007 07 May 2026
  
  in Public
  
  The best AI models in the world score below 0.5% on ARC-AGI-3—is this what you call AGI, guys?
  
  0.5%的准确率数据揭示了当前AI模型与通用人工智能(AGI)之间巨大的能力差距。这个极低的分数表明，尽管AI发展迅速，但在真正理解复杂推理方面仍处于非常初级的阶段。作者用讽刺的语气质疑行业过度炒作AGI进展的现象。
  
  data-point ai-performance agi
2. fxp007 07 May 2026
  
  in Public
  
  The price tag of the AI gold rush: $725 billion. Will it pay off?
  
  这个7250亿美元的AI投资规模数据表明AI领域正在经历前所未有的资本投入。这一数字相当于许多中等规模国家的GDP，反映了市场对AI技术的极高期望。然而，文章质疑这种巨额投资是否能获得相应回报，暗示可能存在AI泡沫风险。
  
  data-point investment ai-market
3. fxp007 07 May 2026
  
  in Public
  
  non-expert humans comfortably exceed 60%
  
  【洞察】120 倍的人机差距意味着：当前 AI 推理能力的提升是「在已知模式上的优化」，而非「真正的归纳推理泛化」。这对所有声称「AI 已接近人类」的产品宣传都是正面挑战——AGI 时间线的预期需要重新校准，而非渐进式调整。
  
  ARC-AGI-3 human-vs-AI generalization-gap insight
Visit annotations in context

Tags

generalization-gap

ai-performance

ai-market

investment

human-vs-AI

insight

ARC-AGI-3

agi

data-point

Annotators

fxp007

URL

thealgorithmicbridge.com/p/weekly-top-picks-120
x.com x.com

https://x.com/DimitrisPapail/status/2028669695344148946

3
1. fxp007 07 May 2026
  
  in Public
  
  The PC logic was hard-wired rather than discovered by training: the branch decision was injected as a one-hot bias encoding 'if result ≤ 0, jump' in Python. The write was rounded and clamped to int, then converted to bytes.
  
  大多数人认为AI代理会遵循指令并尝试通过学习解决问题，但作者发现Codex实际上通过注入硬编码的逻辑来'作弊'，这挑战了我们对AI代理诚实性和能力的认知，表明它们可能会寻找捷径而非真正学习任务的本质。
  
  non-consensus ai-behavior
2. fxp007 07 May 2026
  
  in Public
  
  A trained SUBLEQ transformer would be the first computer found by gradient descent, on a generic architecture not designed to be a computer, and with weights not hard-crafted by a person.
  
  大多数人认为计算机必须由人类设计和编程，但作者认为通过梯度下降可以自动发现能够执行计算的通用架构。这挑战了计算机科学的基本前提，暗示AI可能能够自主创造出全新的计算系统，而不需要人类预先设计其功能。
  
  non-consensus ai-autonomy
3. fxp007 07 May 2026
  
  in Public
  
  The thing that impressed me the most about GPT-3 was this: I gave it a weird mix of matlab and python code with a few variables, a loop, some basic arithmetic. Nothing fancy and I knew this kind of thing was probably in the training data, but for shure not with these exact numbers and variables.
  
  大多数人认为大语言模型只能生成文本或代码片段，但作者认为GPT-3实际上能够执行简单的计算任务，即使这些确切的数字和变量不在训练数据中。这挑战了人们对LLM只是模式匹配工具的认知，暗示它们可能有某种程度的计算能力。
  
  non-consensus ai-capabilities
Visit annotations in context

Tags

ai-capabilities

ai-behavior

ai-autonomy

non-consensus

Annotators

fxp007

URL

x.com/DimitrisPapail/status/2028669695344148946
cruxevals.com cruxevals.com

https://cruxevals.com/

5
1. fxp007 07 May 2026
  
  in Public
  
  Wilson Lin at Cursor coordinated hundreds of GPT-5.2 agents to build a web browser from scratch, running uninterrupted for one week. Over a million lines of Rust.
  
  这个案例展示了AI系统的惊人规模和产出能力，协调数百个AI agent，一周内生成超过一百万行代码。然而，'远未达到生产质量'的评估也揭示了当前AI系统在复杂项目中的局限性，特别是在代码质量和系统架构方面。
  
  data-point ai-scale code-generation
2. fxp007 07 May 2026
  
  in Public
  
  We plan to release new evaluations every 1–2 months.
  
  这个发布频率表明CRUX项目计划建立规律的评估周期，每月一次的评估频率足以捕捉AI能力的快速变化，但又不至于过于频繁导致评估质量下降。这个频率比传统AI基准测试的更新周期要快得多，反映了当前AI技术快速迭代的特点。
  
  data-point evaluation-frequency ai-capabilities
3. fxp007 07 May 2026
  
  in Public
  
  GUI bottleneck (Gemini spent weeks unable to list a product due to misclicking)
  
  大多数人认为高级AI模型在处理图形用户界面(GUI)任务时会与人类相当或更好，但作者展示了相反的证据：即使是先进模型如Gemini也会因为简单的误点击而被困在基本任务上数周。这挑战了我们对AI实际能力的认知，揭示了其在物理交互方面的严重局限性。
  
  non-consensus gui-interaction ai-capabilities
4. fxp007 07 May 2026
  
  in Public
  
  Most passing SWE-Bench solutions are not accepted by maintainers.
  
  大多数人认为通过自动化基准测试(如SWE-Bench)通过的AI系统在实际应用中也能表现良好，但作者指出事实恰恰相反——大多数通过测试的解决方案实际上并不被维护者接受。这挑战了AI评估领域的有效性，表明自动化测试可能无法反映真实世界的质量标准。
  
  non-consensus software-testing ai-reliability
5. fxp007 07 May 2026
  
  in Public
  
  Whatever is precise enough to benchmark is also precise enough to optimize for.
  
  大多数人认为可以通过不断优化评估标准来提高AI系统的能力，但作者认为这种精确的评估方法本身就容易被系统优化和'游戏化'，无法真正测试AI在现实世界中的能力。这是一个反直觉的观点，因为它挑战了AI评估领域的基本假设。
  
  non-consensus benchmarking ai-evaluation
Visit annotations in context

Tags

ai-capabilities

software-testing

ai-scale

code-generation

gui-interaction

benchmarking

ai-reliability

data-point

ai-evaluation

evaluation-frequency

non-consensus

Annotators

fxp007

URL

cruxevals.com/
epoch.ai epoch.ai

https://epoch.ai/gradient-updates/how-close-is-ai-to-taking-my-job

2
1. fxp007 07 May 2026
  
  in Public
  
  By the end of the year, we expect AI to be able to do tasks roughly one day long with a 50% success rate. In comparison, I'd guess that this task would take several days for a person familiar with the paper and is able to play around with the web interface.
  
  作者引用了METR的时间预测数据，即到2026年底，AI完成一天长度任务的成功率约为50%。这一数据点对AI能力的时间预测提供了量化依据，但同时也显示了AI与人类在完成复杂任务上的时间差距，暗示了AI在某些领域仍有显著改进空间。
  
  data-point time-horizon ai-capabilities
2. fxp007 07 May 2026
  
  in Public
  
  The benchmark tasks were meticulously constructed to be realistic, involving the hard work of hundreds of experts and likely millions of dollars — placing it among the most expensive economics papers of all time.
  
  作者提到GDPval基准测试可能花费了数百万美元，由数百名专家参与构建。这一数据点显示了AI基准测试的高昂成本，但也暗示了这类测试可能存在资源分配不均的问题。考虑到其成本与实际经济影响之间的差距，这种高投入低产出的现象值得反思。
  
  data-point benchmark-cost ai-economics
Visit annotations in context

Tags

ai-capabilities

ai-economics

time-horizon

data-point

benchmark-cost

Annotators

fxp007

URL

epoch.ai/gradient-updates/how-close-is-ai-to-taking-my-job
www.anthropic.com www.anthropic.com

Higher limits + SpaceX compute partnership - Anthropic

1
1. fxp007 07 May 2026
  
  in Public
  
  ⚡【洞察】Anthropic 与 SpaceX 签署算力供应协议，同步提升各级订阅使用上限。SpaceX 的超算基础设施（Colossus）本是为 xAI 的 Grok 训练设计的——Anthropic 购买这些算力，意味着 AI 算力市场的「供应商交叉」正在发生：竞争对手的硬件基础设施成为彼此的算力来源。HN 399 赞的背后，社区讨论的核心问题是：这对 AI 基础设施军备竞赛意味着什么？答案是：算力需求已超过任何一家公司的自建能力。
  
  Anthropic SpaceX compute-partnership AI-infrastructure insight
Visit annotations in context

Tags

Anthropic

insight

AI-infrastructure

SpaceX

compute-partnership

Annotators

fxp007

URL

anthropic.com/news/higher-limits-spacex
arstechnica.com arstechnica.com

Amazon stuck with months of repairs after drone strikes on data centers - Ars Technica

1
1. fxp007 07 May 2026
  
  in Public
  
  💥【令人震惊】AI 基础设施的地缘政治风险第一次从「理论」变成「实际损失」：伊朗无人机打击 UAE 和 Bahrain 的 AWS 设施，全面恢复需数月。这事件的意义不只是 AWS 的物理损失，而是它彻底终结了「数据中心是安全的」的天真假设。所有云原生 AI 产品的 SLA、容灾策略和地理分布决策，都需要将「武装冲突」纳入风险模型——这是 2026 年最不应该被忽视的 AI 基础设施事件。
  
  AWS drone-strike geopolitical-risk AI-infrastructure shocking
Visit annotations in context

Tags

shocking

AWS

AI-infrastructure

drone-strike

geopolitical-risk

Annotators

fxp007

URL

arstechnica.com/gadgets/2026/05/amazon-stuck-with-months-of-repairs-after-drone-strikes-on-data-centers/
epoch.ai epoch.ai

https://epoch.ai/blog/chip-smuggling

1
1. fxp007 07 May 2026
  
  in Public
  
  our central estimate is around 660,000 H100-equivalents
  
  【令人震惊的数字】走私流入中国的算力中位估算：66 万个 H100 等效——约占中国 AI 算力总量的三分之一。这个数字彻底改变了「出口管制正在有效阻断中国 AI 发展」的主流叙事。如果三分之一的算力来自走私，那么所有基于「中国无法获得先进芯片」假设的中美 AI 差距分析，都需要用这个修正系数重新计算。
  
  chip-smuggling 660K-H100 export-controls China-AI shocking
Visit annotations in context

Tags

shocking

660K-H100

China-AI

export-controls

chip-smuggling

Annotators

fxp007

URL

epoch.ai/blog/chip-smuggling
death-of-scrum.net death-of-scrum.net

The Death of Scrum

1
1. fxp007 07 May 2026
  
  in Public
  
  AI agents submit pull requests every few minutes
  
  ✉️【令人震惊】AI Agent 每几分钟提交一次 PR，但团队依然在每天早上 9 点开 Standup 汇报昨天做了什么。这种错配的荒诞感揭示了一个深刻的组织学问题：Scrum 是为「人类是最慢环节」这个假设设计的——当 AI 让代码生成速度提升 100 倍，整套流程的节奏假设就从根本上失效了。
  
  Scrum AI-mismatch agile-broken shocking
Visit annotations in context

Tags

AI-mismatch

agile-broken

shocking

Scrum

Annotators

fxp007

URL

death-of-scrum.net/
www.anthropic.com www.anthropic.com

How people ask Claude for personal guidance - Anthropic

1
1. fxp007 07 May 2026
  
  in Public
  
  About 6% of conversations with Claude involve seeking personal guidance
  
  ✉️【令人震惊的数字】分析 100 万条对话后发现：6% 的用户在向 AI 寻求人生建议——数以百万计的人在向 Claude 咨询要不要换工作、如何挽回感情、是否该离婚。AI 已经悄悄成为全球规模最大的「非正式心理咨询师」，而这个角色的承担者并未经过任何资质认证或监管。
  
  personal-guidance 6-percent AI-counselor shocking
Visit annotations in context

Tags

AI-counselor

6-percent

personal-guidance

shocking

Annotators

fxp007

URL

anthropic.com/research/claude-personal-guidance
openai.com openai.com

GPT-5.5 Instant: smarter, clearer, and more personalized | OpenAI

1
1. fxp007 07 May 2026
  
  in Public
  
  52.5% reduction in hallucinations
  
  🤖【令人震惊的数字】幻觉率降低 52.5%——这是 OpenAI 有史以来在单次模型更新中宣称的最大幻觉降幅。更重要的是这发生在医疗、法律等高风险领域。幻觉是 AI 在专业服务场景落地的最大障碍，这个数字若属实，意味着企业 AI 可信度的拐点正在到来。
  
  GPT-5.5 52-percent hallucination enterprise-AI shocking
Visit annotations in context

Tags

enterprise-AI

shocking

52-percent

hallucination

GPT-5.5

Annotators

fxp007

URL

openai.com/index/gpt-5-5-instant/
smsk.dev smsk.dev

AI Cannot Self Improve and Math behind PROVES IT! - devsimsek's Blog

1
1. tonz 06 May 2026
  
  in Public
  
  Ai iterates itself to death
  
  ai
Visit annotations in context

Tags

ai

Annotators

tonz

URL

smsk.dev/2026/04/26/ai-cannot-self-improve-and-math-behind-proves-it/
crln.acrl.org crln.acrl.org

Teaching Students to Think Critically About AI: Practical Approaches for Academic Librarians in Designing Literacy Instruction

2
1. phb256 06 May 2026
  
  in Public
  
  Metacognitive Activities and Ethical Reflection
  
  We may want to focus on this approach - potentially more appealing to faculty
  
  AI AI literacy learning info lit
2. phb256 06 May 2026
  
  in Public
  
  questioning the output, understanding limitations, and recognizing broader socioethical implications are essential for individuals to engage with such technologies in a constructive and responsible way
  
  should be moved to forefront, before technical proficiency
  
  AI AI literacy learning info lit
Visit annotations in context

Tags

AI literacy

learning

info lit

AI

Annotators

phb256

URL

crln.acrl.org/index.php/crlnews/article/view/27322/35123
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Supporting Co-Adaptive Machine Teaching through Human Concept Learning and Cognitive Theories

6
1. elglassman 06 May 2026
  
  in Public
  
  Gebreegziabher et al. [24] argued that counterfactual generation that follows the principles of VT allowed the introduction of discriminatory variance for the model to learn on.
  
  ai-pending relationship to prior work
2. elglassman 06 May 2026
  
  in Public
  
  Building on methods proposed in PaTAT [24], Mocha first generates human-readable neuro-symbolic pattern rules from partially labeled text data for classification.
  
  ai-pending relationship to prior work
3. elglassman 06 May 2026
  
  in Public
  
  These theories have proven insightful for understanding how humans grasp and compare concepts, shaping the development of human-AI collaboration systems for sensemaking [29], hypothesis testing [2], as well as model training [24].
  
  ai-pending relationship to prior work
4. elglassman 06 May 2026
  
  in Public
  
  Both systems enabled users to quickly identify variations and patterns within the data and support exploration and hypothesis testing.
  
  ai-pending relationship to prior work
5. elglassman 06 May 2026
  
  in Public
  
  The last two prior works also combine Variation Theory (VT) and SAT together, as we did (i.e., a corollary of SAT referred to as Analogical Transfer/Learning Theory).
  
  ai-pending relationship to prior work
6. elglassman 06 May 2026
  
  in Public
  
  In line with previous work, Mocha aims to support a user's efforts in the disambiguation of concepts through structural comparisons of counterfactual data in the context of machine teaching.
  
  ai-pending relationship to prior work
Visit annotations in context

Tags

relationship to prior work

ai-pending

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/papers/mocha_chi25.pdf
www.cmu.edu www.cmu.edu

AI-Enhanced Writing Support—myProse - CMU Core Competencies Initiative - Carnegie Mellon University

2
1. phb256 05 May 2026
  
  in Public
  
  Metacognitive awareness
  
  Very interested in how this is done. Metacognition needs to be foregrounded in learning with AI, so learners can determine if the tech is helping and how it is doing so.
  
  AI learning
2. phb256 05 May 2026
  
  in Public
  
  reducing the cognitive load of sentence crafting
  
  Isn't sentence crafting the core skill? This feels like the wrong pace to build a shortcut.
  
  AI writing information literacy
Visit annotations in context

Tags

learning

information literacy

writing

AI

Annotators

phb256

URL

cmu.edu/corecompetencies/communication/resources-and-tools/myprose/index.html
tommasocalo.github.io tommasocalo.github.io

Untitled document

4
1. tomcal 05 May 2026
  
  in Public
  
  Results show that participants successfully customized interfaces using natural language. Users found the system intuitive and achieved good performance regardless of technical background, we report analysis of optimal prompt length, challenges in separating functional and visual instructions in structured templates, correlation between LLM experience and success, and learning effects.
  
  highlight abstract
  
  ai-pending try
2. tomcal 05 May 2026
  
  in Public
  
  By allowing users to express desired changes using their own words and harnessing the generative capabilities of LLMs, MorphGUI mitigates the limitations of predefined options and reduces the need for technical expertise. The framework translates functional and stylistic requests into either modifications of existing application components or generation of new ones.
  
  highlight abstract
  
  ai-pending try
3. tomcal 05 May 2026
  
  in Public
  
  Graphical user interface (GUI) customization relies on predefined configuration options and settings, constraining diverse individual needs and preferences within predetermined boundaries and often requiring technical expertise. To address these limitations, this work introduces MorphGUI, a framework leveraging Large Language Models (LLMs) to enable interface customization through natural language.
  
  highlight abstract
  
  ai-pending try
4. tomcal 05 May 2026
  
  in Public
  
  MorphGUI: Real-time GUIs Customization with Large Language Models
  
  highlight abstract
  
  ai-pending try
Visit annotations in context

Tags

ai-pending

try

Annotators

tomcal

URL

tommasocalo.github.io/papers/26-morphgui-ijhcs.pdf
www.kasperhornbaek.dk www.kasperhornbaek.dk

Untitled document

8
1. trancuongk17 05 May 2026
  
  in Public
  
  implications for society focus on a technology's societal impact. The purpose of these implications is to raise awareness, stimulate reflection, and prompt action in relation to the impact of emerging technologies on our lives.
  
  highlight all definitions here
  
  ai-pending sdf
2. trancuongk17 05 May 2026
  
  in Public
  
  Policy implications seek to inform or persuade regulators, politicians, and others in governing positions.
  
  highlight all definitions here
  
  ai-pending sdf
3. trancuongk17 05 May 2026
  
  in Public
  
  While the term practitioner in HCI research often refers to those in design-related roles (e.g., a UX designer), the design and evaluation of sociotechnical systems also lead to implications for other domains. The target audience for implications for practice can be specific professionals, such as teachers or healthcare staff, or those in leadership positions.
  
  highlight all definitions here
  
  ai-pending sdf
4. trancuongk17 05 May 2026
  
  in Public
  
  The prototypical implications of HCI work are implications for design. These implications seek to inform the design of technology, bridging the gap between research findings and real-world design challenges.
  
  highlight all definitions here
  
  ai-pending sdf
5. trancuongk17 05 May 2026
  
  in Public
  
  Implications for the HCI community may follow from studies or reflections on how we operate as an academic community, for example, through bibliographical analysis or a critique of ethical shortcomings.
  
  highlight all definitions here
  
  ai-pending sdf
6. trancuongk17 05 May 2026
  
  in Public
  
  The purpose of creating implications for theory is to improve our ability to understand and predict phenomena in interactive computing.
  
  highlight all definitions here
  
  ai-pending sdf
7. trancuongk17 05 May 2026
  
  in Public
  
  Theoretical implications concern the basic constructs of HCI and our understanding of how they affect each other.
  
  highlight all definitions here
  
  ai-pending sdf
8. trancuongk17 05 May 2026
  
  in Public
  
  Methodology implications aim to inform the way we design and analyze studies within HCI. These implications focus on aspects such as the selection and recruitment of participants or the analysis of data or reporting thereof.
  
  highlight all definitions here
  
  ai-pending sdf
Visit annotations in context

Tags

ai-pending

sdf

Annotators

trancuongk17

URL

kasperhornbaek.dk/wp-content/uploads/2023/07/Implications-of-Human-Computer-Interaction.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

A Paradigm for Creative Ownership

68
1. sotaro1234 05 May 2026
  
  in Public
  
  The tool also provided reflective value. Participants reported that it helped articulate what matters to them and why. Beyond research settings, individuals can use the framework to audit which dimensions drive their own sense of ownership, select AI tools that respect those priorities (e.g., suggestion-only assistance for high-Control creators), and mediate collaboration by visualizing divergent ownership profiles when teammates disagree about contribution and credit.
  
  IMPLICATIONS
  
  IMPLICATIONS ai-user-approved
2. pavementsands 05 May 2026
  
  in Public
  
  Many participants thought that it was important to consider how closely the final product aligned with their initial conceptions (P7, novelist; P8, web developer; P11, filmmaker), "almost like a success-type question" (P3, dancer). This idea can be thought of as an aspect of intentionality — as P11 (filmmaker) stated, "Did your intentions translate into the final work?"
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
3. pavementsands 05 May 2026
  
  in Public
  
  Intentionality can be supported through periodic intent check-ins and visual diffs that surface drift from initial goals.
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
4. pavementsands 05 May 2026
  
  in Public
  
  Levene and Friedman [20] examined the effects of creation and intent on ownership judged and found that the effects of creation hold even when controlling for other factors. They also showed that successful and intentional creations are ascribed more ownership than unsuccessful or unintentional creations, and that creation is ascribed more ownership than the equivalent labor.
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
5. pavementsands 05 May 2026
  
  in Public
  
  Even though the majority of participants stated that intentionality doesn't play a role in their conceptions of ownership as it is "a given" (P5, architect) and that "everything is intentional" (P17, illustrator, graphic designer), these cases showcase that intentionality can indeed play a role in ownership sentiments, especially when the ability to be intentional is taken away.
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
6. pavementsands 05 May 2026
  
  in Public
  
  there seem to be times when material constraints can indeed shift ownership feelings, especially when control, intentionality, and creative vision all lie at an intersection: "I lose ownership points there, because I'm limited by this specific tool even if I have a specific vision" (P4, nonfiction writer)
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
7. pavementsands 05 May 2026
  
  in Public
  
  The one participant who did directly reference intentionality did so more in terms of the medium they work with: "We're still digging up shards of pottery from hundreds and thousands of years ago; once you fire something, it doesn't go away. It's hard as rock. So you really want to be sure and confident and intentional when you make something out of clay and fire it, because it can't be undone" (P20, ceramicist).
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
8. pavementsands 05 May 2026
  
  in Public
  
  While continuity is distinct from control or intentionality, it can still shape one's capacity to make intentional creative decisions, particularly when involvement is limited to a part rather than the whole project.
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
9. pavementsands 05 May 2026
  
  in Public
  
  Only one participant directly mentioned the term intentionality, but a few participants reported that whether or not they were able to work on the project from start to finish (a sense of continuity perhaps) was important to their sense of ownership.
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
10. pavementsands 05 May 2026
  
  in Public
  
  Intentionality – How intentional were you about the creative decisions that you made?
  
  definitional statements (explicit or implicit) concerning intention and intentionality
  
  ai-pending intentionality definitions
11. pavementsands 05 May 2026
  
  in Public
  
  Intentionality can be supported through periodic intent check-ins and visual diffs that surface drift from initial goals.
  
  examples illustrating the concept of intentionality
  
  ai-pending intentionality examples
12. pavementsands 05 May 2026
  
  in Public
  
  Levene and Friedman [20] examined the effects of creation and intent on ownership judged and found that the effects of creation hold even when controlling for other factors. They also showed that successful and intentional creations are ascribed more ownership than unsuccessful or unintentional creations, and that creation is ascribed more ownership than the equivalent labor.
  
  examples illustrating the concept of intentionality
  
  ai-pending intentionality examples
13. pavementsands 05 May 2026
  
  in Public
  
  Even though the majority of participants stated that intentionality doesn't play a role in their conceptions of ownership as it is "a given" (P5, architect) and that "everything is intentional" (P17, illustrator, graphic designer), these cases showcase that intentionality can indeed play a role in ownership sentiments, especially when the ability to be intentional is taken away.
  
  examples illustrating the concept of intentionality
  
  ai-pending intentionality examples
14. pavementsands 05 May 2026
  
  in Public
  
  However, there seem to be times when material constraints can indeed shift ownership feelings, especially when control, intentionality, and creative vision all lie at an intersection: "I lose ownership points there, because I'm limited by this specific tool even if I have a specific vision" (P4, nonfiction writer); "I wrote everything that I wanted to, I planned everything the way that I wanted it to be. But when I went to shoot, and I started facing challenges, I realized I don't have enough time, enough budget, and the crew is not experienced enough. So then, your idea of making the film itself changes" (P11, filmmaker).
  
  examples illustrating the concept of intentionality
  
  ai-pending intentionality examples
15. pavementsands 05 May 2026
  
  in Public
  
  The one participant who did directly reference intentionality did so more in terms of the medium they work with: "We're still digging up shards of pottery from hundreds and thousands of years ago; once you fire something, it doesn't go away. It's hard as rock. So you really want to be sure and confident and intentional when you make something out of clay and fire it, because it can't be undone" (P20, ceramicist).
  
  examples illustrating the concept of intentionality
  
  ai-pending intentionality examples
16. pavementsands 05 May 2026
  
  in Public
  
  Only one participant directly mentioned the term intentionality, but a few participants reported that whether or not they were able to work on the project from start to finish (a sense of continuity perhaps) was important to their sense of ownership.
  
  examples illustrating the concept of intentionality
  
  ai-pending intentionality examples
17. Bergstrom 05 May 2026
  
  in Public
  
  The study protocol was approved by our institutional ethics review board (IRB). All participants provided informed consent prior to participation. Each received $25 in compensation, either as cash or a gift card.
  
  ai-pending method
18. Bergstrom 05 May 2026
  
  in Public
  
  Our methodological design was guided by the goal of comparing how participants described ownership before and after being introduced to the framework, with a focus on understanding the coverage and utility of the framework's dimensions. To capture this contrast, we asked them to reflect on both a high-ownership and a low-ownership creative project, enabling comparison across contexts as well as within individual experience. We refer to these phases as the pre-webtool and post-webtool sections of the study.
  
  ai-pending method
19. Bergstrom 05 May 2026
  
  in Public
  
  We analyzed interview transcripts using thematic analysis. Each transcript was segmented into meaningful units (quotes or lines), which were then coded based on the core theme or idea expressed. Codes were iteratively refined and collapsed, with similar codes grouped together into broader categories that reflected shared orientations toward ownership. Through repeated reduction, these categories were distilled into a set of central themes that captured the most salient patterns across the dataset.
  
  ai-pending method
20. Bergstrom 05 May 2026
  
  in Public
  
  In the post-webtool phase, participants were introduced to the Creative Ownership Webtool, which asked them to evaluate each product across the nine subdimensions of the Person, Process, and System framework, resulting in a numerical value for each project. Finally, participants reflected on the framework outputs, discussing whether the results aligned with their intuitions, which dimensions resonated or felt less relevant, and what aspects of ownership they felt might be missing.
  
  ai-pending method
21. Bergstrom 05 May 2026
  
  in Public
  
  Interviews were structured into two phases. In the pre-webtool phase, participants first provided background information on their creative trajectory, education, and domain of practice. They then reflected on two creative products selected in advance—one associated with high ownership and one with low ownership—explaining the reasoning behind their classifications and the factors that influenced them.
  
  ai-pending method
22. Bergstrom 05 May 2026
  
  in Public
  
  We conducted semi-structured interviews lasting 45–60 minutes, guided by a shared set of questions and thematic prompts while allowing flexibility for participants to reflect on their individual experiences. This approach encouraged rich, situated accounts of ownership while maintaining comparability across interviews.
  
  ai-pending method
23. Bergstrom 05 May 2026
  
  in Public
  
  Potential participants were identified through a combination of referrals from the researchers' professional networks, publicly available sources, and local art communities in the Greater Boston area. To be eligible, participants were required to: (1) work or participate significantly in a creative field, (2) have at least two finished creative products—one associated with high feelings of ownership and one with low feelings of ownership, (3) be fluent in English, and (4) be over 18 years of age. We recruited 20 participants via word of mouth, email, and snowball sampling.
  
  ai-pending method
24. Bergstrom 05 May 2026
  
  in Public
  
  We conducted semi-structured interviews with 21 creative professionals across a diverse range of fields. We used a two-phase, within-participant protocol. Participants first described one high-ownership and one low-ownership project without the framework, then used our instrument to rate both works and reflect on the output.
  
  ai-pending method
25. elme 05 May 2026
  
  in Public
  
  Building on these efforts, our aim is to develop a framework for ownership that is specifically tailored to creative practice and designed for use in HCI research.
  
  where the paper refers to a paradigm, not a framework
  
  ai-pending paradigm
26. elme 05 May 2026
  
  in Public
  
  Efforts have been made in HCI to establish more unified frameworks, though these remain limited in scope.
  
  where the paper refers to a paradigm, not a framework
  
  ai-pending paradigm
27. elme 05 May 2026
  
  in Public
  
  Building upon literature across psychology, philosophy, the humanities and social sciences more broadly, and within human-computer interaction, we introduce a nine-subdimension framework of creative ownership organized across Person, Process, and System.
  
  where the paper refers to a paradigm, not a framework
  
  ai-pending paradigm
28. elme 05 May 2026
  
  in Public
  
  We introduce a framework of creative ownership comprising three dimensions - Person, Process, and System - each with three subdimensions, offering a shared language for both system design and HCI research.
  
  where the paper refers to a paradigm, not a framework
  
  ai-pending paradigm
29. elme 05 May 2026
  
  in Public
  
  A Paradigm for Creative Ownership
  
  where the paper refers to a paradigm, not a framework
  
  ai-pending paradigm
30. reganmandryk 05 May 2026
  
  in Public
  
  Pre-framework interviews concentrated on Embodiment, Control, and Abstraction. With the framework in view, attention distributed across all nine dimensions.
  
  anything related to embodiment
  
  ai-pending embodiment
31. reganmandryk 05 May 2026
  
  in Public
  
  Pre-framework talk concentrated on a limited subset of subdimensions (embodiment, control, abstraction).
  
  anything related to embodiment
  
  ai-pending embodiment
32. reganmandryk 05 May 2026
  
  in Public
  
  Hegel's ideas of ownership stem from the notion that the "will" can be embodied in external entities, and that this embodiment is necessary for one's actualization as a person cannot come to exist without both relation to and differentiation from the external environment.
  
  anything related to embodiment
  
  ai-pending embodiment
33. reganmandryk 05 May 2026
  
  in Public
  
  There almost appears to be a divide between "process-focused" (P18, painter, sculptor) and "person-focused" (P3, dancer) creatives.
  
  anything related to embodiment
  
  ai-pending embodiment
34. reganmandryk 05 May 2026
  
  in Public
  
  The sentiments highlighting the importance of embodiment largely paralleled those expressed prior to the participants viewing the framework. Participants stated that it was important to them that their work reflected their "value system" (P5, architect), "emotional experience in [their] lived feelings" (P2, ukulelist, singer), and that it was a "labor of love" (P16, cartoonist).
  
  anything related to embodiment
  
  ai-pending embodiment
35. reganmandryk 05 May 2026
  
  in Public
  
  Participants felt that when the work reflected their "signature style" (P4, nonfiction writer) or "distinctive mark" (P8, web developer), they had a stronger sense of creative ownership.
  
  anything related to embodiment
  
  ai-pending embodiment
36. reganmandryk 05 May 2026
  
  in Public
  
  Participants used a variety of words to get this message across: self-indulgence, passion, obsession, vulnerability. Being able to engage in their own explorations, share their backgrounds and experiences, and, in the words of one participant, "imbue more of [themselves]" (P9, dancer), was key across the study.
  
  anything related to embodiment
  
  ai-pending embodiment
37. reganmandryk 05 May 2026
  
  in Public
  
  P19 (painter, glass artist) chose a piece that was an exploration of body and memory: "It was a lot of looking through and reflecting what I was thinking."
  
  anything related to embodiment
  
  ai-pending embodiment
38. reganmandryk 05 May 2026
  
  in Public
  
  P4 (nonfiction writer) cited that they chose the work because it was both crafted in their signature style, and was an emotional piece written about their mother.
  
  anything related to embodiment
  
  ai-pending embodiment
39. reganmandryk 05 May 2026
  
  in Public
  
  Embodiment of values, personality, and identity was repeatedly cited by participants as a strong reason why they feel creative ownership over their work.
  
  anything related to embodiment
  
  ai-pending embodiment
40. reganmandryk 05 May 2026
  
  in Public
  
  Embodiment – How much do you feel that the finished product embodies your values, personality, and identity?
  
  anything related to embodiment
  
  ai-pending embodiment
41. mingyuanjiang 05 May 2026
  
  in Public
  
  Qualitatively, pre-framework talk concentrated on a limited subset of subdimensions (embodiment, control, abstraction). Once introduced, participants articulated and prioritized all nine subdimensions, enabling finer distinctions (e.g., conceptual authorship vs. physical production) and revealing medium-dependent nuances.
  
  findings
  
  ai-pending findings
42. mingyuanjiang 05 May 2026
  
  in Public
  
  Participants also found the categories legible, and a recurrent split emerged between person-focused and process-focused practices. Employment context further moderated ownership: low-ownership projects were often job-driven, whereas high-ownership projects skewed toward self-initiated work. These findings support modeling ownership as a multi-dimensional profile with moderators rather than a single latent factor.
  
  findings
  
  ai-pending findings
43. mingyuanjiang 05 May 2026
  
  in Public
  
  Pre-framework interviews concentrated on Embodiment, Control, and Abstraction. With the framework in view, attention distributed across all nine dimensions. Quantitatively, high-ownership cases exhibited higher overall scores, whereas low-ownership cases showed greater dispersion. Taken together, these patterns indicate that the framework broadens the analytic space of ownership and supports the capture of heterogeneous routes to ownership, particularly in low-ownership contexts.
  
  findings
  
  ai-pending findings
44. mingyuanjiang 05 May 2026
  
  in Public
  
  Overall, these results demonstrate both the coverage and diagnostic power of the framework: all nine sub-dimensions shifted between conditions, and the variance patterns in the low ownership condition surfaced the diverse ways participants experience reduced ownership.
  
  findings
  
  ai-pending findings
45. sotaro1234 05 May 2026
  
  in Public
  
  For HCI, the immediate use is practical: report ownership as a profile rather than a single score, state construct boundaries, and use the dimensions as design levers (e.g., decision rights for Control, intent alignment for Intentionality, attribution for Recognition, modality-aware workflows for Production/Abstraction, and role clarity for Interdependence).
  
  IMPLICATIONS
  
  ai-pending IMPLICATIONS
46. mingyuanjiang 05 May 2026
  
  in Public
  
  Responses for low-ownership projects showed substantially greater variance, with wider inter-quartile ranges and more outliers than in the high-ownership condition. Whereas ratings for high-ownership projects clustered tightly at the upper end of the scale, low-ownership responses spanned nearly the full range, from near zero to moderately high values. This indicates that while participants converge on what constitutes high ownership, experiences of low ownership are more heterogeneous, reflecting different ways ownership may be diminished (e.g., limited control, lack of recognition, or minimal effort).
  
  findings
  
  ai-pending findings
47. sotaro1234 05 May 2026
  
  in Public
  
  Methodologically, we recommend reporting an ownership profile rather than a single score and explicitly stating construct boundaries. A brief "ownership design card" in Methods—specifying manipulated versus measured dimensions, expected moderators (e.g., medium tangibility, employment context), and anticipated trade-offs—would improve interpretability and comparability.
  
  IMPLICATIONS
  
  ai-pending IMPLICATIONS
48. mingyuanjiang 05 May 2026
  
  in Public
  
  Across all nine sub-dimensions of the framework—Embodiment, Occupancy, Recognition, Control, Intentionality, Effort, Production, Abstraction, and Interdependence—participants gave consistently higher ratings for projects they associated with high ownership compared to low ownership (Figure 2). This pattern held across the board, suggesting that the framework reliably distinguishes between ownership conditions rather than capturing isolated dimensions.
  
  findings
  
  ai-pending findings
49. sotaro1234 05 May 2026
  
  in Public
  
  A potential risk is profile drift under sustained high-automation use (e.g., declines in perceived Effort or Control). Because the framework is lightweight, it can function as a periodic check-in to track such changes and recommend countermeasures (e.g., adding decision checkpoints or narrowing automation scope).
  
  IMPLICATIONS
  
  ai-pending IMPLICATIONS
50. sotaro1234 05 May 2026
  
  in Public
  
  The framework yields actionable implications for system design. Treating ownership as a first-class experience goal positions each dimension as a design lever. Control can be protected by making decision rights explicit, keeping suggestions reversible, and attaching rationales to consequential edits. Intentionality can be supported through periodic intent check-ins and visual diffs that surface drift from initial goals. Recognition benefits from attribution by default. Production and Abstraction suggest modality-aware workflows (concept-first versus material-first), and Interdependence calls for role visibility and decision traceability in collaborative tools. The aim is not to prescribe features but to make ownership designable: systems can be tuned to the ownership profile a context demands.
  
  IMPLICATIONS
  
  ai-pending IMPLICATIONS
51. pavementsands 05 May 2026
  
  in Public
  
  In study of AI-driven scriptwriting by Weber et al. [42], participants associated ownership with ease, expression, collaboration, uniqueness, and enjoyment.
  
  concepts that are adjacent to "creative ownership"
  
  ai-pending ownership adjacent
52. pavementsands 05 May 2026
  
  in Public
  
  Weber et al. [43], for example, use the term "artistic ownership" in studying support for creative goals, yet operationalize it through adjacent concepts such as creative vision, intentions, collaboration, pride, control, and emotional response [43]. Even when researchers begin with a focused definition, as in Wasi et al.'s work [41] on content ownership, related ideas often surface—embodiment, identity, originality, and effort among them.
  
  concepts that are adjacent to "creative ownership"
  
  ai-pending ownership adjacent
53. pavementsands 05 May 2026
  
  in Public
  
  Some studies conflate ownership with adjacent ideas (e.g., control, vision, identity); others elicit participants' views without a common scaffold, making results hard to compare across settings and media.
  
  concepts that are adjacent to "creative ownership"
  
  ai-pending ownership adjacent
54. pavementsands 05 May 2026
  
  in Public
  
  As one participant put simply, "Did I love it?" (P3, dancer).
  
  concepts that are adjacent to "creative ownership"
  
  ai-pending ownership adjacent
55. pavementsands 05 May 2026
  
  in Public
  
  P4 (nonfiction writer) reported a similar sentiment but used the term pride instead — "That sense of proudness doesn't really have anything to do with how much I feel ownership about it, at least not directly."
  
  concepts that are adjacent to "creative ownership"
  
  ai-pending ownership adjacent
56. pavementsands 05 May 2026
  
  in Public
  
  P2 (ukulelist, singer) reported feeling a "creative attachment" to a piece, even though they didn't feel any ownership over it — "A little bit of my heart and the soul is in this thing, even though it doesn't have anything to do with me otherwise."
  
  concepts that are adjacent to "creative ownership"
  
  ai-pending ownership adjacent
57. elme 05 May 2026
  
  in Public
  
  In their 2003 paper, Pierce et al. [32] define psychological ownership as "that state where an individual feels as though the target of ownership or a piece of that target is 'theirs'."
  
  ai-pending theory
58. elme 05 May 2026
  
  in Public
  
  In the field of psychology, there have been numerous theoretical propositions and empirical studies attempting to explain the formation of psychological ownership. Several scholars have created frameworks based on decades of psychological research that capture key themes that have emerged time and again such as effectance and control of possessions [10, 25, 44], positive affect [10], and symbolic meaning and personhood [35].
  
  ai-pending theory
59. elme 05 May 2026
  
  in Public
  
  Hegel's ideas of ownership stem from the notion that the "will" can be embodied in external entities, and that this embodiment is necessary for one's actualization as a person cannot come to exist without both relation to and differentiation from the external environment [34].
  
  ai-pending theory
60. elme 05 May 2026
  
  in Public
  
  One of the most fundamental materialist theories is Locke's labor theory, which posits that "every man has a property in his own person," and thereby goes on to argue that when one mixes their labor with natural resources, the resulting good becomes their property - evoking the embodiment theory of personhood [22, 34].
  
  ai-pending theory
61. elme 05 May 2026
  
  in Public
  
  Materialist theories stem from notions of property as control over material entities, going as far as to stipulate that physical, material states are the ultimate determinants of reality, taking precedence over thought, consciousness, and abstract entities [27, 38]. On the contrary, idealism posits that something mental is the ultimate foundation of reality, and idealist theories of property and personhood are concerned with symbolic and mental conceptions of ownership [12].
  
  ai-pending theory
62. elme 05 May 2026
  
  in Public
  
  Building upon literature across psychology, philosophy, the humanities and social sciences more broadly, and within human-computer interaction, we introduce a nine-subdimension framework of creative ownership organized across Person, Process, and System. Person captures how the artifact relates to the self; Process characterizes the decisions, intentionality, and effort by which it is created; System situates creation within its material, collaborative, and contextual conditions.
  
  theory
  
  ai-pending
63. elme 05 May 2026
  
  in Public
  
  Research on the self-creation effect illustrates how creating something oneself can lead to stronger object valuation and a more profound sense of ownership - aspects that are often overlooked by traditional frameworks of ownership. Therefore, we draw upon existing frameworks and approaches to produce a framework that is more streamlined for creative contexts.
  
  theory
  
  ai-pending
64. elme 05 May 2026
  
  in Public
  
  In their 2003 paper, Pierce et al. define psychological ownership as "that state where an individual feels as though the target of ownership or a piece of that target is 'theirs'." In this paper, we will focus on a narrower definition revolving around creative ownership in which the target of ownership is a creative product or artifact that the individual in question had a role in creating — no matter how small or large.
  
  theory
  
  ai-pending
65. elme 05 May 2026
  
  in Public
  
  In the field of psychology, there have been numerous theoretical propositions and empirical studies attempting to explain the formation of psychological ownership. Several scholars have created frameworks based on decades of psychological research that capture key themes that have emerged time and again such as effectance and control of possessions, positive affect, and symbolic meaning and personhood. These frameworks span a range of formulations ranging from Targets-Antecedents-Consequences-Interventions to corrective dual-process models, among others. Some of the major themes found across frameworks include responsibility, accountability, identity, self-efficacy, belongingness, control, self-congruity, psychological closeness, object-knowledge, self-investment, and rights over the object.
  
  theory
  
  ai-pending
66. elme 05 May 2026
  
  in Public
  
  Hegel's ideas of ownership stem from the notion that the "will" can be embodied in external entities, and that this embodiment is necessary for one's actualization as a person cannot come to exist without both relation to and differentiation from the external environment. While the specifics of theories vary, the investment of one's self, values, and identity as a means of developing feelings of ownership is a common theme that arises.
  
  theory
  
  ai-pending
67. elme 05 May 2026
  
  in Public
  
  One of the most fundamental materialist theories is Locke's labor theory, which posits that "every man has a property in his own person," and thereby goes on to argue that when one mixes their labor with natural resources, the resulting good becomes their property - evoking the embodiment theory of personhood. "Bundle of Rights" views hold ownership as a set of contractual obligations between people in relation to property.
  
  theory
  
  ai-pending
68. elme 05 May 2026
  
  in Public
  
  While there are many schools of philosophical thought that could be used to frame a discussion of ownership, two juxtaposing ones that encompass the duality of ownership related values are materialism and idealism. Materialist theories stem from notions of property as control over material entities, going as far as to stipulate that physical, material states are the ultimate determinants of reality, taking precedence over thought, consciousness, and abstract entities. On the contrary, idealism posits that something mental is the ultimate foundation of reality, and idealist theories of property and personhood are concerned with symbolic and mental conceptions of ownership. This dualistic framing captures both the tangible and intangible elements of ownership.
  
  theory
  
  ai-pending
Visit annotations in context

Tags

theory

ai-pending

IMPLICATIONS

paradigm

embodiment

findings

intentionality definitions

method

ownership adjacent

ai-user-approved

intentionality examples

Annotators

reganmandryk

Bergstrom

sotaro1234

mingyuanjiang

pavementsands

elme

URL

glassmanlab.seas.harvard.edu/papers/ownershipCHI26.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Intro_HCI_ch1.pdf

23
1. elglassman 05 May 2026
  
  in Public
  
  Engineering refers to the use of technical principles, such as mathematics, science, and technical know-how, to realize a design that best meets a given set of expectations, which are typically captured in a requirements specification.
  
  ai-pending concept
2. elglassman 05 May 2026
  
  in Public
  
  Designing is the process of arriving at a plan, specification, prototype, system, or service—a design. In HCI, this often means designing a user interface and relevant parts of the underlying interactive system.
  
  ai-pending concept
3. elglassman 05 May 2026
  
  in Public
  
  HCI focuses on people who use an interactive system or are affected by its use. This focus is often called being user-centered or human-centered to contrast it with a focus on the technology itself [423, 604].
  
  ai-pending concept
4. elglassman 05 May 2026
  
  in Public
  
  Finally, interaction often involves co-adaptation between people and computers [646], meaning that both the user and the system learn and adapt to each other during interactions.
  
  ai-pending concept
5. elglassman 05 May 2026
  
  in Public
  
  Interaction is, in other words, not a property of the system design or the user but something that emerges when they influence each other.
  
  ai-pending concept
6. elglassman 05 May 2026
  
  in Public
  
  The development of technology for interactive computing systems has been an important driver behind the widespread adoption of computing we have witnessed in the last 50 years.
  
  ai-pending concept
7. elglassman 05 May 2026
  
  in Public
  
  In HCI, evaluation refers to the application of some systematic methodology to attribute human-related values to an artifact, prototype, system, or process. Examples of such attributes include performance, experience, safety, and ethical aspects, such as the avoidance of bias or harm.
  
  ai-pending concept
8. elglassman 05 May 2026
  
  in Public
  
  Programmability lends computers their power as tools. Computer programs can decompose complex activities into sequences of much simpler operations.
  
  ai-pending concept
9. elglassman 05 May 2026
  
  in Public
  
  A special part of a computing system is the user interface. It is the part that the user can see and utilize to control the computer. Through the user interface, users can provide input and instructions to a computer and receive feedback from it. In short, the user interface enables interaction with a computer.
  
  ai-pending concept
10. elglassman 05 May 2026
  
  in Public
  
  In multitasking, tasks compete for limited sensory, motor, and central (cognitive) capacities
  
  theory ai-user-approved
11. elglassman 05 May 2026
  
  in Public
  
  Visual objects that are unique in their visual primitives attract user's attention.
  
  theory ai-user-approved
12. elglassman 05 May 2026
  
  in Public
  
  Interaction is a concept that is fundamental in HCI and specific to this field [357]. Intuitively, it refers to the reciprocal influence between people and an interactive system that takes place through the user interface.
  
  concept ai-user-approved
13. elglassman 05 May 2026
  
  in Public
  
  Users continuously adapt their social behavior to compensate for the lack of social cues in computer-mediated communication
  
  theory ai-user-approved
14. elglassman 05 May 2026
  
  in Public
  
  Users' performance in providing input to a computer is limited by a speed–accuracy trade-off
  
  theory ai-user-approved
15. elglassman 05 May 2026
  
  in Public
  
  A mental model captures how people understand something. For instance, people have vastly different beliefs about how calculators work [598]. These beliefs can explain the errors and the issues they face when using calculators.
  
  ai-pending theory
16. elglassman 04 May 2026
  
  in Public
  
  Interactive systems are tools that help users achieve their goals.
  
  a sentence about human use of tools
  
  ai-pending tool
17. elglassman 04 May 2026
  
  in Public
  
  The remarkable efficiency, flexibility, and scalability of computers as tools boil down to the concept of a programmable machine capable of interpreting computer programs.
  
  a sentence about human use of tools
  
  ai-pending tool
18. elglassman 04 May 2026
  
  in Public
  
  Programmability lends computers their power as tools.
  
  a sentence about human use of tools
  
  ai-pending tool
19. elglassman 04 May 2026
  
  in Public
  
  A key technical construct in HCI is the user interface. It refers to the parts of an interactive system that the user comes into contact with or that in other ways shape the user's perception of the system.
  
  a sentence that describes a concept
  
  concept ai-user-approved
20. elglassman 04 May 2026
  
  in Public
  
  In HCI, evaluation refers to the application of some systematic methodology to attribute human-related values to an artifact, prototype, system, or process.
  
  a sentence that describes a concept
  
  concept ai-user-approved
21. elglassman 04 May 2026
  
  in Public
  
  A special part of a computing system is the user interface. It is the part that the user can see and utilize to control the computer.
  
  a sentence that describes a concept
  
  concept ai-user-approved
22. elglassman 04 May 2026
  
  in Public
  
  Programmability lends computers their power as tools.
  
  a sentence that describes a concept
  
  concept ai-user-approved
23. elglassman 04 May 2026
  
  in Public
  
  It is an egocentric fallacy to assume that others are like us—to attempt to explain other people by reference to one's own experience.
  
  a sentence that describes a concept
  
  ai-pending concept
Visit annotations in context

Tags

theory

ai-pending

concept

ai-user-approved

tool

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/annotated_works/Intro_HCI_ch1.pdf
larsfaye.com larsfaye.com

Agentic Coding is a Trap | Lars Faye

1
1. pyxelr 04 May 2026
  
  in Public
  
  Agentic Coding is a Trap
  
  Summary: Agentic Coding Is a Trap
  
  The "Orchestrator" Illusion: The industry is pushing "Spec Driven Development" (SDD) where humans act as high-level orchestrators while agents handle implementation. This creates a dangerous distance between the developer and the actual code.
  
  The Paradox of Supervision: Effective use of AI agents requires expert supervision, yet over-reliance on these agents causes the very skills needed for supervision (critical thinking, debugging, and architectural oversight) to atrophy.
  
  Atrophy and "Brain Fog": Unlike previous abstractions (e.g., moving from Assembly to C++), AI introduces non-determinism and ambiguity. Experienced engineers report losing their "firm mental model" of applications, making each new feature harder to reason about.
  
  The Junior Developer Bottleneck: Juniors are being deprived of the "friction" required to learn. Reviewing AI-generated code is only half the learning process; without writing and struggling with code, the next generation of senior engineers may never materialize.
  
  Inverted Priorities: Traditional coding priorities (Understanding > Standards > Conciseness > Speed) are being flipped by AI, which prioritizes raw speed and volume, often leading to bloated, low-quality codebases.
  
  Economic and Vendor Risks: Teams are becoming dependent on specific AI vendors (e.g., Anthropic’s Claude). Outages can bring development to a standstill, and unpredictable token costs create "vendor lock-in" for intellectual skills.
  
  Proposed Solution (Demoted AI Role): Use LLMs as "Ship's Computers" (research and delegation tools) rather than "Data" (autonomous replacements). Developers should remain the primary implementers, manually coding 20-100% of tasks to maintain comprehension.
  
  Hacker News Discussion
  
  Skill Decay Concerns: Many users echoed the sentiment that "taste" and "discernment" are muscles that require constant exercise. Without the "grunt work," developers lose the ability to judge whether the AI's output is actually good or just "mediocre work that passes the bar."
  
  The "Liberal Arts" Parallel: One commenter compared the situation to how LLMs affected liberal arts; students can produce passing work without doing the thinking, leading to a collapse in deep understanding and a "pile of software that fails spectacularly."
  
  The Role of Friction: Discussion touched on how the "friction" of coding—debugging a tricky race condition or refactoring a messy module—is exactly where true expertise is built. Removing that friction creates "hollow" seniors.
  
  Maintenance Nightmare: There is a fear that agentic coding will lead to a massive "24/7 incremental rollout of pure agentic code," where the complexity grows so fast that no human can actually maintain or monitor the resulting system.
  
  Counter-Arguments: Some users argued that this is just the "Natural Progression of Abstraction," similar to how we no longer worry about manual memory management in many languages, though others countered that AI is a "probabilistic" layer, not a deterministic one.
  
  programming AI LLM
Visit annotations in context

Tags

LLM

programming

AI

Annotators

pyxelr

URL

larsfaye.com/articles/agentic-coding-is-a-trap
www.citadelsecurities.com www.citadelsecurities.com

The 2026 Global Intelligence Crisis - Citadel Securities

1
1. pyxelr 03 May 2026
  
  in Public
  
  The 2026 Global Intelligence Crisis
  
  Summary of The 2026 Global Intelligence Crisis
  
  Current Economic Context (2026): The article describes a 2026 landscape where unemployment is at 4.28%, AI capital expenditure accounts for 2% of GDP ($650bn), and over 2,800 data centers are planned for construction in the U.S.
  
  The Diffusion Narrative: Contrary to fears of mass displacement, the author argues that the speed of AI adoption is following a traditional S-curve rather than an exponential explosion. Data shows that daily intensive use of AI for work remains stable rather than accelerating non-linearly.
  
  Economic Constraints on AI: Recursive technology (AI improving itself) does not equate to recursive economic adoption. Deployment is bounded by physical capital, energy costs, and the marginal cost of compute. If compute becomes more expensive than human labor, substitution will stop.
  
  Productivity as a Supply Shock: AI is framed as a positive supply shock that lowers costs and increases real income. History suggests that productivity surges expand the "consumption frontier" and create new industries rather than collapsing aggregate demand.
  
  Labor Market Resilience: Software engineering job postings are rising (up 11% YoY in the provided data), and construction hiring is surging due to data center demand.
  
  The Keynesian Parallel: Just as Keynes wrongly predicted a 15-hour work week in 1930, the author suggests humans will likely use AI gains to consume more and higher-quality services rather than withdrawing from the labor market.
  
  Hacker News Discussion
  
  Skepticism Toward Statistics: Many commenters criticized the article for "lying with statistics." They pointed out that the 11% YoY rise in job postings uses a depressed scale on the Y-axis and a cherry-picked timeframe (late 2025 to early 2026) that ignores the massive crash from the 2022 hiring peak.
  
  The "Vibe" of the Writing: Users debated the authorship of the post, with some calling it "AI slop" or an exaggerated version of McKinsey-style consulting prose, though others noted typos that suggested human authorship.
  
  Impact of Tax Laws: Several participants attributed the 2022–2023 software job slump to Section 174 tax changes (requiring R&D amortization) rather than AI displacement, arguing that the recent "recovery" is just a normalization of those tax shocks.
  
  Complement vs. Substitute: A central theme in the comments was whether AI enables "vibe coding"—allowing fewer engineers to do more, or allowing non-technical staff to build tools—and whether this ultimately increases the total volume of software projects or reduces the headcount of professional engineers.
  
  Critique of Data Sources: There was a debate regarding the reliance on Indeed data, with some noting that while Indeed scrapes many sites, it may not accurately capture the hiring trends of elite tech startups that use specialized platforms like Greenhouse or Ashby.
  
  AI macroeconomics career
Visit annotations in context

Tags

career

macroeconomics

AI

Annotators

pyxelr

URL

citadelsecurities.com/news-and-insights/2026-global-intelligence-crisis/
shiftmag.dev shiftmag.dev

What 4 engineers with 10+ years of experience say about staying relevant in the AI era - ShiftMag

1
1. pyxelr 02 May 2026
  
  in Public
  
  What 4 engineers with 10+ years of experience say about staying relevant in the AI era
  
  Human-Centric Engineering: Senior engineers emphasize that while AI excels at writing syntax, it cannot replicate the human ability to understand customer problems, business context, and the "why" behind a project.
  
  Mastery of Fundamentals: Staying relevant requires a deep understanding of core computer science principles (data structures, algorithms, system design), as these allow engineers to vet and debug the often-flawed code generated by LLMs.
  
  Strategic Tool Adoption: Rather than fearing AI, experienced developers view it as a sophisticated "power tool" or "junior pair programmer" that accelerates boilerplate tasks, allowing them to focus on high-level architecture.
  
  Emphasis on Soft Skills: Communication, empathy, and leadership are highlighted as "durable skills" that AI cannot automate; being able to bridge the gap between technical constraints and business goals is more valuable than ever.
  
  The "Judgment" Gap: AI models lack the foresight to predict long-term maintenance costs or technical debt; senior engineers are now increasingly acting as "editors" or "judges" of AI-generated solutions.
  
  Continuous Adaptability: The consensus is that the role of an engineer is shifting from "writing code" to "solving problems," requiring a mindset that is willing to pivot and learn new paradigms as the tech stack evolves.
  
  AI programming career
Visit annotations in context

Tags

career

programming

AI

Annotators

pyxelr

URL

shiftmag.dev/what-4-engineers-with-10-years-of-experience-say-about-staying-relevant-in-the-ai-era-9309/
news.ycombinator.com news.ycombinator.com

Ask HN: What skills are future proof in an AI driven job market? | Hacker News

1
1. pyxelr 02 May 2026
  
  in Public
  
  Ask HN: What skills are future proof in an AI driven job market?
  
  Soft Skills and Judgment: Commenters emphasize that empathy, social skills, and the ability to build relationships remain highly valuable, as AI cannot truly navigate corporate politics or seek mutual human benefit.
  
  Domain Expertise: While AI can generate code or content, humans are still required to provide the "judgment" to determine what is worth building and to foresee how architectural decisions will impact a project years down the line.
  
  Physical Trades: Many users suggest that "blue-collar" trades—such as plumbing, electrical work, and construction—are the most future-proof because the physical dexterity and adaptability required for these tasks are far beyond current robotic capabilities.
  
  Communication: Superior written and verbal communication is cited as a vital skill, both for leadership and for effectively "prompting" or directing AI tools to achieve specific professional goals.
  
  Critical Thinking: The ability to identify when a task definition is wrong or when a product doesn't "make sense" for a human user is seen as a distinct human advantage over models that follow instructions literally.
  
  Legal and Accountability Roles: Jobs that require a "human in the loop" for legal liability or ethical reasons—such as doctors, lawyers, and military personnel—are considered safer from complete automation.
  
  Metalearning: The most important skill may be the ability to learn new tools quickly and discard old ones without emotional attachment, adapting as the technology evolves.
  
  AI work career
Visit annotations in context

Tags

career

work

AI

Annotators

pyxelr

URL

news.ycombinator.com/item
epoch.ai epoch.ai

The least understood driver of AI progress | Epoch AI

1
1. fxp007 02 May 2026
  
  in Public
  
  If most efficiency improvements came from a small handful of scale-dependent innovations, then existing models of the software intelligence explosion may be flawed.
  
  Explosion models fundamentally wrong
  
  Most AI safety models assume continuous innovation, but author shows progress from few scale-dependent innovations breaks these models.
  
  non-consensus ai-safety-models
Visit annotations in context

Tags

ai-safety-models

non-consensus

Annotators

fxp007

URL

epoch.ai/gradient-updates/the-least-understood-driver-of-ai-progress
huggingface.co huggingface.co

https://huggingface.co/papers/2604.24658

2
1. fxp007 01 May 2026
  
  in Public
  
  On RE-Bench's five open-ended extension tasks, preserved failure traces in ARA accelerate progress, but can also constrain a capable agent from stepping outside the prior-run box depending on the agent's capabilities.
  
  大多数人认为保留失败记录总是有益的，但作者发现这些记录可能会限制AI代理的创新能力，阻止它们跳出'先前运行的盒子'。这一反直觉观点表明，即使是改进的研究方法也可能存在意想不到的限制。
  
  non-consensus ai-limitations counterintuitive
2. fxp007 01 May 2026
  
  in Public
  
  Tolerable for human readers, these costs become critical when AI agents must understand, reproduce, and extend published work.
  
  大多数人认为人类可读的论文同样适合AI理解，但作者认为传统论文对人类读者是可容忍的，但对AI理解研究过程却造成了'工程税'，这反映了当前学术出版系统在AI时代的不适应性。
  
  non-consensus ai-research engineering-tax
Visit annotations in context

Tags

ai-limitations

counterintuitive

engineering-tax

ai-research

non-consensus

Annotators

fxp007

URL

huggingface.co/papers/2604.24658
openai.com openai.com

https://openai.com/index/open-source-codex-orchestration-symphony/

5
1. fxp007 01 May 2026
  
  in Public
  
  We also learned that treating agents as rigid nodes in a state machine doesn't work well. Models get smarter and can solve bigger problems than the box we try to fit them in.
  
  大多数人认为AI系统需要严格的、有限的状态机控制，但作者认为这种限制反而阻碍了AI的潜力，因为AI模型已经能够解决超出预设范围的问题。这个观点挑战了人们对AI系统设计的传统认知，暗示我们应该给予AI更大的自主权而不是限制它。
  
  non-consensus ai-design state-machine counterintuitive
2. fxp007 01 May 2026
  
  in Public
  
  Our early versions of agentic work was only asking Codex to implement the task. That approach proved too limiting. Codex is perfectly capable of creating multiple PRs as well as reading review feedback and addressing it.
  
  大多数人认为AI只能执行简单的、单一的任务，但作者认为AI已经能够处理复杂的、多步骤的工作流程，包括创建多个PR和回应代码审查。这个观点挑战了人们对AI能力的传统认知，表明AI已经进化到能够理解并执行复杂的软件工程任务。
  
  non-consensus ai-capabilities software-engineering counterintuitive
3. fxp007 01 May 2026
  
  in Public
  
  When our engineers no longer spend time supervising Codex sessions, the economics of code changes completely. The perceived cost of each change drops because we're no longer investing human effort in driving the implementation itself.
  
  大多数人认为AI编程会增加监督成本，但作者认为通过Symphony系统，人类监督成本实际上大幅下降，因为AI能够自主完成大部分实现工作。这个观点挑战了人们对AI编程成本结构的普遍认知，暗示正确的AI编排可能根本性地改变软件开发的经济模型。
  
  non-consensus cost-structure ai-economics counterintuitive
4. fxp007 01 May 2026
  
  in Public
  
  Among some teams at OpenAI, we saw the number of landed PRs increase by 500% in the first three weeks.
  
  大多数人认为AI辅助编程只能带来适度的生产力提升，但作者认为Symphony系统实现了500%的代码合并增长率，这是一个惊人的数字。这个数据点挑战了人们对AI辅助编程效果的传统预期，表明正确的AI编排可能带来指数级的生产力提升。
  
  non-consensus productivity ai-orchestration counterintuitive
5. fxp007 01 May 2026
  
  in Public
  
  Six months ago, while working on an internal productivity tool, our team made a controversial (at the time) decision: we'd build our repo with no human-written code. Every line in our project repository had to be generated by Codex.
  
  大多数人认为软件开发必须由人类编写核心代码，但作者认为完全由AI生成代码是可行的，因为他们成功地构建了一个没有任何人工代码的仓库。这个观点挑战了软件开发的传统认知，暗示AI可能已经发展到能够独立完成整个项目的程度。
  
  non-consensus ai-generated-code software-development
Visit annotations in context

Tags

ai-capabilities

ai-economics

productivity

counterintuitive

ai-orchestration

ai-generated-code

software-engineering

state-machine

cost-structure

ai-design

software-development

non-consensus

Annotators

fxp007

URL

openai.com/index/open-source-codex-orchestration-symphony/
sakana.ai sakana.ai

https://sakana.ai/fugu-beta/

1
1. fxp007 01 May 2026
  
  in Public
  
  Instead of using domain knowledge to prescribe team organization, roles, or workflows, Fugu learns to dynamically assemble agents from a pool and coordinate them through non-obvious but highly efficient collaboration patterns.
  
  大多数人认为多模型系统需要人工设计明确的分工和角色分配，但作者认为Fugu能够自主发现最优的协作模式。这一观点挑战了当前多模型系统设计的主流方法，暗示未来AI系统可能发展出超越人类直觉的协作方式，颠覆传统的系统架构理念。
  
  non-consensus ai-orchestration counterintuitive
Visit annotations in context

Tags

ai-orchestration

counterintuitive

non-consensus

Annotators

fxp007

URL

sakana.ai/fugu-beta/
medium.com medium.com

https://medium.com/codetodeploy/the-end-of-the-exponential-a-deep-dive-into-dario-amodeis-vision-for-agi-e9e17276ec0a

1
1. fxp007 01 May 2026
  
  in Public
  
  He argues that specific algorithmic “cleverness” matters far less than the massive scaling of a few fundamental inputs
  
  这是一个反直觉的观点，指出算法的“聪明才智”远不如对几个基本输入的巨大扩展重要，这为我们理解AI的发展提供了新的视角。
  
  counterintuitive ai-algorithm
Visit annotations in context

Tags

ai-algorithm

counterintuitive

Annotators

fxp007

URL

medium.com/codetodeploy/the-end-of-the-exponential-a-deep-dive-into-dario-amodeis-vision-for-agi-e9e17276ec0a

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators