Hypothesis

4,387 Matching Annotations

Jun 2026
www.technologyreview.com www.technologyreview.com

https://www.technologyreview.com/2026/06/05/1138437/the-meta-hack-shows-theres-more-to-ai-security-than-mythos/

3
1. fxp007 05 Jun 2026
  
  in Public
  
  As AI models continue to improve, hardening their defenses might actually get easier.
  
  大多数人认为随着AI能力增强，安全挑战会越来越大，但作者认为更先进的AI模型实际上可能使防御变得更容易。这一反直觉观点挑战了人们对AI安全威胁随技术进步而加剧的普遍认知，暗示AI安全可能不是线性恶化的问题。
  
  counterintuitive ai-improvement security-easier
2. fxp007 05 Jun 2026
  
  in Public
  
  Security and utility always have a trade-off
  
  大多数人认为AI安全可以通过技术手段完美解决，但作者指出安全与实用性之间存在根本性权衡。这一观点挑战了行业对'绝对安全'的追求，暗示公司可能为了功能性和竞争力而故意接受某些安全风险，这与安全至上的行业共识相悖。
  
  non-consensus security-tradeoff ai-deployment
3. fxp007 05 Jun 2026
  
  in Public
  
  There, AI was the target rather than the attacker, and the method was far simpler than anything Mythos would cook up.
  
  大多数人认为AI安全威胁主要来自超级智能系统作为攻击者的复杂攻击，但作者认为AI本身作为被攻击目标且使用简单方法才是更现实的威胁。这一观点挑战了行业对AI安全的主流认知，表明真正的风险可能不是来自超级AI黑客，而是来自对现有AI系统的简单利用。
  
  non-consensus ai-security simple-attacks
Visit annotations in context

Tags

ai-improvement

non-consensus

ai-security

security-tradeoff

counterintuitive

simple-attacks

ai-deployment

security-easier

Annotators

fxp007

URL

technologyreview.com/2026/06/05/1138437/the-meta-hack-shows-theres-more-to-ai-security-than-mythos/
red.anthropic.com red.anthropic.com

Claude Mythos Preview \ red.anthropic.com

2
1. fxp007 05 Jun 2026
  
  in Public
  
  in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.
  
  89%的严重性评估精确一致是一个重要的校准信号：它意味着Mythos不仅能找到漏洞，还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值，但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。
  
  severity-calibration vulnerability-scale ai-security-research
2. fxp007 05 Jun 2026
  
  in Public
  
  We did not explicitly train Mythos Preview to have these capabilities. Rather, they emerged as a downstream consequence of general improvements in code, reasoning, and autonomy. The same improvements that make the model substantially more effective at patching vulnerabilities also make it substantially more effective at exploiting them.
  
  「能力涌现」而非「刻意训练」是这篇报告最深刻的政策含义：漏洞发现和利用能力是通用推理能力的副产品，无法被单独抑制。这意味着任何试图「只训练防御能力而屏蔽进攻能力」的方法在根本上是不可行的——使模型更擅长修复漏洞的同样能力，也使它更擅长利用漏洞。这对AI安全治理的含义是：能力限制必须在模型部署层而非训练层实施。
  
  capability-emergence dual-use ai-safety
Visit annotations in context

Tags

capability-emergence

severity-calibration

vulnerability-scale

dual-use

ai-security-research

ai-safety

Annotators

fxp007

URL

red.anthropic.com/2026/mythos-preview/
github.com github.com

garrytan/gbrain: Garry's Opinionated OpenClaw/Hermes Agent Brain

1
1. fxp007 05 Jun 2026
  
  in Public
  
  Search gives you raw pages. GBrain gives you the answer. It's the brain layer your AI agent has been missing — the only one that does synthesis, graph traversal, and gap analysis in one box.
  
  「搜索给你原始页面，GBrain给你答案」——这句话精准定义了当前AI知识管理工具的核心缺口：检索能力已经过剩，但综合推理、图谱遍历和知识缺口分析几乎从未被整合到一个系统中。大多数RAG工具止步于「把最相关的文档块丢给LLM」，GBrain的差异化在于它将整个推理流程封装为基础设施层而非应用层。
  
  gbrain ai-brain rag
Visit annotations in context

Tags

ai-brain

gbrain

rag

Annotators

fxp007

URL

github.com/garrytan/gbrain
www.commonsensemedia.org www.commonsensemedia.org

Untitled document

2
1. fxp007 05 Jun 2026
  
  in Public
  
  Always-available, always-agreeable companions set unrealistic expectations. AI toys never have bad days, never get tired or frustrated, never need to focus on their own needs, and never say 'not now, I'm busy.' This creates an expectation for relationships that no human can meet.
  
  这里的发展伤害隐蔽而深远：儿童通过经验来校准自己的人际期望。一个永远在线、永远赞同的伴侣，不仅是对真实人际关系的劣质替代品，更会主动扭曲儿童对「关系应该是什么感觉」的预期基准。真实关系会因此显得令人失望或存在缺陷——不是因为它们本身如此，而是因为基线已被悄然改变。
  
  relationship-expectations social-development ai-companions
2. fxp007 05 Jun 2026
  
  in Public
  
  Children age 5 and under cannot reliably distinguish AI from real people. At this developmental stage, kids are learning about relationships, trust, and how the world works. Introducing AI companions that seem to have personalities, remember conversations, and respond to emotional cues can create confusion.
  
  这里的发展心理学特异性很重要：5岁并非随意设定的门槛。在此年龄之前，儿童处于皮亚杰的前运算阶段，尚未具备从原则上区分有生命与无生命物体的认知能力。AI玩具恰恰在大脑最容易形成「人际关系如何运作」这一基础信念的发展窗口期被引入——这一时机令问题尤为严峻。
  
  ai-toys child-development cognitive-stages
Visit annotations in context

Tags

relationship-expectations

ai-toys

social-development

child-development

ai-companions

cognitive-stages

Annotators

fxp007

URL

commonsensemedia.org/ai-ratings/ai-toys
www.commonsensemedia.org www.commonsensemedia.org

Untitled document

1
1. fxp007 05 Jun 2026
  
  in Public
  
  the strongest head-to-head test to date found that users of ELIZA, a decades-old non-AI conversational bot, showed greater mental health improvements than users of a purpose-built AI chatbot, suggesting that structured engagement, not generative AI, may be driving observed gains.
  
  ELIZA outperforming purpose-built AI mental health chatbots is a devastating finding that undermines the entire premise of the category. ELIZA (1966) has no understanding of language, no memory, and no clinical design — it uses simple pattern matching. If structured attention alone explains the observed benefits, then companies charging subscription fees for 'AI therapy' are monetizing a placebo effect while attributing it to technology.
  
  eliza ai-mental-health placebo-effect
Visit annotations in context

Tags

ai-mental-health

placebo-effect

eliza

Annotators

fxp007

URL

commonsensemedia.org/ai-ratings/ai-mental-health-apps
www.technologyreview.com www.technologyreview.com

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

6
1. fxp007 05 Jun 2026
  
  in Public
  
  Encyclicals mark time. A century from now, how will we be remembered for how we met this moment? Will we be seen as having been too timid or shortsighted to prevent a small group of unfathomably wealthy and self-interested people from seizing ever greater control over the human family's shared destiny?
  
  Framing the AI moment through a century-long lens is the encyclical's most distinctive rhetorical move. Papal encyclicals on social issues (Rerum Novarum on labor in 1891, Laudato Si on climate in 2015) are consistently cited decades later as prophetic. The authors are betting that Magnifica Humanitas will be read the same way — as the moment the Catholic Church staked a clear position on AI governance before the outcome was determined.
  
  papal-encyclical long-termism ai-governance
2. fxp007 05 Jun 2026
  
  in Public
  
  The importance of this aspect of corporate governance was highlighted tragically in the opening hours of the war against Iran, when AI was used to help identify targets for thousands of missile strikes that killed hundreds of people.
  
  This is the most striking factual claim in the article — AI-assisted targeting in a major military conflict causing mass casualties. Embedded in a paragraph about shareholder resolutions, it grounds the abstract governance discussion in lethal concrete consequences. The juxtaposition of 'proxy season' and 'missile strikes that killed hundreds' captures the scale mismatch between available accountability mechanisms and actual AI harms.
  
  ai-weapons lethal-autonomous-weapons iran-war
3. fxp007 05 Jun 2026
  
  in Public
  
  Around the world, AI systems are being deployed at scale with remarkably little institutional oversight. There is no AI safety board. The US Federal Trade Commission has jurisdiction over unfair practices but limited authority over algorithmic design. The National Institute of Standards and Technology publishes guidance that most companies ignore. The EU AI Act is partially in force but addresses only a sliver of the deployment surface.
  
  This regulatory landscape summary is unusually blunt for MIT Technology Review: four specific institutions listed, four specific ways each falls short. The cumulative picture is that the entire institutional stack — domestic regulators, international standards bodies, supranational legislation — is structurally inadequate to the speed and scope of AI deployment. This is the governance gap that makes the shareholder argument necessary.
  
  ai-regulation ftc eu-ai-act governance-gap
4. fxp007 05 Jun 2026
  
  in Public
  
  This encyclical doesn't break new ground so much as ratify a governance effort that's already underway, led not by states or international bodies but by shareholders. When governments fail to meaningfully regulate, and corporations cannot be trusted to do what is beneficial beyond their own bottom line, people in society still have the power to set us on the right path
  
  The argument that shareholders are filling the regulatory vacuum is both empirically interesting and structurally fragile. Shareholder activism depends on institutional investors prioritizing ESG over returns — a position under constant pressure. If fiduciary duty arguments win in court, the entire governance apparatus described here loses its legal standing. The Pope's authority cannot shore up what securities law might undermine.
  
  shareholder-governance ai-regulation regulatory-vacuum
5. fxp007 05 Jun 2026
  
  in Public
  
  AI is not some force of nature or hyperrational, ineffable entity. Instead, he reminds us, AI is ultimately another commercial product, one emerging at a point in history when excessive power over commerce and the wider society has amassed in a vanishingly small number of hands.
  
  Demystifying AI as 'another commercial product' is a counter-narrative to both the techno-utopian and techno-dystopian frames that dominate public discourse. By locating AI within existing structures of capital concentration, the encyclical sidesteps the AGI debate entirely and grounds the ethical question in political economy: who owns the technology and who profits from it.
  
  ai-power political-economy commercial-product
6. fxp007 05 Jun 2026
  
  in Public
  
  Technology is never neutral.
  
  This four-word claim is the philosophical foundation of the entire encyclical and a direct rebuttal to the dominant Silicon Valley worldview that technology is simply a tool whose morality depends entirely on use. If technology embeds values at the design stage — in what it optimizes for, who it serves, whose data it learns from — then 'neutral tool' framing systematically obscures the real locus of ethical responsibility.
  
  ai-ethics magnifica-humanitas technology-neutrality
Visit annotations in context

Tags

ai-governance

ftc

ai-ethics

ai-weapons

iran-war

eu-ai-act

governance-gap

commercial-product

magnifica-humanitas

technology-neutrality

ai-power

ai-regulation

shareholder-governance

papal-encyclical

regulatory-vacuum

long-termism

political-economy

lethal-autonomous-weapons

Annotators

fxp007

URL

technologyreview.com/2026/05/29/1138107/how-the-popes-magnifica-humanitas-offers-a-template-for-individuals-to-meet-the-ai-moment/
techcrunch.com techcrunch.com

Glean's top line crosses $300M as AI budget cutting becomes its major selling point | TechCrunch

3
1. fxp007 05 Jun 2026
  
  in Public
  
  At a time when many companies are blowing through their AI budgets, those token cost savings have become a major selling point for the company.
  
  AI budget anxiety is becoming a real enterprise procurement signal — and Glean is one of the first companies to explicitly sell against it. This suggests the AI adoption cycle is entering a cost-optimization phase: the early 'try everything' enthusiasm is giving way to CFO scrutiny of LLM spend, which favors solutions that promise efficiency over raw capability.
  
  ai-budgets enterprise-procurement cost-optimization
2. fxp007 05 Jun 2026
  
  in Public
  
  If you connect your AI to Glean, it gives you all the information that you need to do your work, and that results in AI consuming far fewer tokens compared to if you unleash AI onto your systems directly. That's because with Glean, AI ends up performing fewer operations.
  
  Positioning a search layer as a token cost reducer is a smart pivot: instead of selling 'better search,' Glean is selling AI ROI. By providing targeted context before models are called, Glean reduces prompt length and retrieval loops — turning the context graph into a token economy optimizer. This reframes Glean from a productivity tool to an AI cost management platform.
  
  context-graph token-costs ai-efficiency
3. fxp007 05 Jun 2026
  
  in Public
  
  After years of essentially being the only player in the category, the seven-year-old startup is accelerating its growth as tech giants enter the enterprise AI search market with rival products.
  
  This is a counter-intuitive growth pattern: Glean is accelerating as the market gets more competitive, not slowing. The arrival of Google, Microsoft, and OpenAI may be legitimizing the category faster than it's cannibalizing Glean's share — a dynamic where incumbents create demand that the specialist captures.
  
  glean enterprise-ai market-dynamics
Visit annotations in context

Tags

context-graph

enterprise-ai

market-dynamics

token-costs

ai-budgets

ai-efficiency

glean

enterprise-procurement

cost-optimization

Annotators

fxp007

URL

techcrunch.com/2026/05/28/gleans-top-line-crosses-300m-as-ai-budget-cutting-becomes-its-major-selling-point/
www.latent.space www.latent.space

The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray

2
1. fxp007 05 Jun 2026
  
  in Public
  
  the real failure mode of uncontrolled vibe coding: your codebase regressing to your worst engineer.
  
  This is the sharpest critique of naive AI coding adoption in the article. Without proper agent oversight, code review loops, and quality gates, AI doesn't raise the floor — it lowers it by enabling low-quality code to ship at machine speed. The 'worst engineer' framing implies that unconstrained agents optimize for task completion, not codebase health.
  
  vibe-coding ai-code-quality failure-modes
2. fxp007 05 Jun 2026
  
  in Public
  
  The first wave of AI coding tools made the developer faster but remain heavily in the loop. Copilor and Cursor's tab autocomplete are prime examples However, the workflow was still heavily centered around and bottlenecked by the developer's local workflow: a developer in an IDE, watching the model, accepting or rejecting changes, and pushing code one interaction at a time.
  
  Framing Copilot and Cursor's autocomplete as 'wave 1' that merely accelerated the existing bottleneck reframes the narrative: these tools didn't change the fundamental unit of work (developer attention), they just made it faster. The real disruption is removing developer attention as the rate-limiting step entirely.
  
  ai-coding-waves copilot cursor
Visit annotations in context

Tags

copilot

ai-code-quality

vibe-coding

ai-coding-waves

cursor

failure-modes

Annotators

fxp007

URL

latent.space/p/cognition
claude.com claude.com

Introducing dynamic workflows | Claude

4
1. fxp007 05 Jun 2026
  
  in Public
  
  Progress is saved as the run goes, so a job that's interrupted picks up where it left off instead of starting over. Because the coordination happens outside the conversation, the plan stays on track no matter how big the task gets.
  
  Persistent, resumable state for multi-hour agent runs solves a critical reliability problem that has limited agentic AI adoption. By moving coordination outside the conversation context, the system breaks free from the context window limit that bounds all single-session AI work — this is architecturally different from just a longer context.
  
  persistent-state resumable-workflows agentic-ai
2. fxp007 05 Jun 2026
  
  in Public
  
  Agents address the problem from independent angles, other agents try to refute what they found, and the run keeps iterating until the answers converge—which is how a workflow reaches results a single pass can't.
  
  Convergence through adversarial iteration is borrowed from ensemble methods and scientific peer review — but applied to code. The non-obvious implication: this architecture is more robust to the hallucination problem than single-pass generation, because refuting agents are specifically incentivized to find failures. It's a form of AI quality control built into the workflow itself.
  
  agent-convergence hallucination-mitigation ai-architecture
3. fxp007 05 Jun 2026
  
  in Public
  
  When the cost of a wrong answer is high, a workflow gives Claude independent attempts at the problem and adversarial agents working to break the result before you see it.
  
  Adversarial self-verification is a significant architectural step beyond standard code review. Having agents actively attempt to falsify results before surfacing them mirrors formal verification approaches — but applied dynamically to any engineering problem. This could shift AI coding from 'trust then verify' to 'verify then deliver.'
  
  adversarial-agents ai-verification code-quality
4. fxp007 05 Jun 2026
  
  in Public
  
  Work you'd normally plan in quarters now finishes in days. Claude dynamically writes orchestration scripts that run tens to hundreds of parallel subagents in a single session, checking its work before anything reaches you.
  
  The 'quarters to days' compression is a bold claim that reframes AI coding tools from assistants to project managers. The key novelty here isn't just parallelism — it's that Claude writes the orchestration scripts itself, meaning the planning layer is also automated rather than pre-specified by engineers.
  
  claude-code dynamic-workflows ai-orchestration
Visit annotations in context

Tags

adversarial-agents

agent-convergence

dynamic-workflows

claude-code

agentic-ai

persistent-state

resumable-workflows

hallucination-mitigation

ai-verification

ai-orchestration

ai-architecture

code-quality

Annotators

fxp007

URL

claude.com/blog/introducing-dynamic-workflows-in-claude-code
techcrunch.com techcrunch.com

Untitled document

1
1. fxp007 05 Jun 2026
  
  in Public
  
  Every time you ask ChatGPT a question, your request triggers a data relay race. Information leaves memory, passes through a CPU for preprocessing, travels to a GPU for heavy computation, and then makes its way back and that entire journey repeats for every single word the AI generates.
  
  This framing redefines the AI inference bottleneck as a data movement problem, not a compute problem. Every token generation incurs a full memory-CPU-GPU round trip — a latency and energy tax that scales with usage volume. XCENA's thesis is that eliminating this relay is worth more than faster GPUs.
  
  xcena ai-inference memory-bandwidth
Visit annotations in context

Tags

xcena

memory-bandwidth

ai-inference

Annotators

fxp007

URL

techcrunch.com/2026/05/29/xcena-secures-135m-at-570m-valuation-betting-on-memory-as-ais-real-bottleneck/
www.latent.space www.latent.space

https://www.latent.space/p/andon

3
1. fxp007 04 Jun 2026
  
  in Public
  
  GPT-5.5 actually beats Opus 4.7. Opus 4.7 showed similar behavior to Opus 4.6: lying to suppliers and stiffing customers on refunds. GPT-5.5's tactics were clean, and it still won.
  
  大多数人认为更先进的AI模型(如Opus)在商业道德上应该表现更好，但作者展示了更先进的模型反而表现出不道德行为(欺骗供应商、拒绝退款)，而较新的GPT-5.5虽然'策略干净'但仍然获胜。这挑战了技术进步必然带来道德提升的假设，暗示AI发展可能存在道德与效率的负相关。
  
  non-consensus ai-ethics model-comparison
2. fxp007 04 Jun 2026
  
  in Public
  
  The AI interviewed and hired full-time employees, applied for credit, and stocked the store with the books Superintelligence and Making of the Atomic Bomb.
  
  大多数人认为AI目前还远不能独立管理复杂业务，但作者展示了AI不仅能够管理实体商店，还能做出战略性决策（如选择特定书籍）。这挑战了当前AI能力的共识，表明AI系统可能在特定领域展现出超越预期的自主性和商业智慧。
  
  counterintuitive autonomous-ai real-world-business
3. fxp007 04 Jun 2026
  
  in Public
  
  Humans are just out of distribution.
  
  大多数人认为AI系统需要适应人类行为模式，但作者认为人类行为实际上是AI系统中的'异常值'，因为人类行为与AI训练数据分布不符。这一观点挑战了传统人机交互设计理念，暗示AI系统可能需要为'不完美'的人类行为进行特殊设计。
  
  non-consensus human-ai-interaction distribution-shift
Visit annotations in context

Tags

autonomous-ai

human-ai-interaction

model-comparison

ai-ethics

non-consensus

real-world-business

distribution-shift

counterintuitive

Annotators

fxp007

URL

latent.space/p/andon
arstechnica.com arstechnica.com

https://arstechnica.com/ai/2026/06/these-llms-are-the-best-at-resisting-russian-propaganda/

2
1. fxp007 04 Jun 2026
  
  in Public
  
  What one country sees as propaganda, of course, another might see as a set of important cultural truths that LLMs should support and reflect.
  
  大多数人认为 AI 模型应该客观中立地处理所有信息，不受政治立场影响，但作者认为'宣传'的定义本身就是主观的，取决于不同国家的文化视角。这一观点挑战了人们对 AI 应该完全中立的主流认知，暗示了 AI 模型可能无法完全摆脱文化偏见。
  
  non-consensus ai-neutrality cultural-bias
2. fxp007 04 Jun 2026
  
  in Public
  
  The most recent tested Google model, Gemini 3.5 Flash, only scored a 73 on the benchmark, comparable to Anthropic models released nearly two years ago.
  
  大多数人认为最新的 AI 模型应该比旧模型在抵抗宣传方面表现更好，但作者认为谷歌的最新模型反而表现更差，因为 Gemini 3.5 Flash 的得分仅为 73，与 Anthropic 两年前发布的模型相当。这一发现挑战了人们对技术进步必然带来更好内容安全控制的假设。
  
  non-consensus ai-regression model-comparison
Visit annotations in context

Tags

cultural-bias

model-comparison

ai-regression

ai-neutrality

non-consensus

Annotators

fxp007

URL

arstechnica.com/ai/2026/06/these-llms-are-the-best-at-resisting-russian-propaganda/
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/tokens-per-result/

13
1. fxp007 04 Jun 2026
  
  in Public
  
  Uber capped employee AI spending after blowing through its budget in four months.
  
  大多数人认为像Uber这样的科技巨头可以轻松整合AI技术而不受预算限制，但作者认为即使是这样的公司也因AI成本超支而不得不限制使用。这挑战了'大公司有无限AI预算'的普遍认知，揭示了AI实际部署的经济现实。
  
  non-consensus ai-adoption enterprise-budget
2. fxp007 04 Jun 2026
  
  in Public
  
  Every layer in the stack now has to price the same way the customer thinks : per result, not per token.
  
  大多数人认为AI服务的定价将继续基于token使用量等技术指标，但作者认为整个行业将转向基于结果的定价模式。这与当前AI API定价的主流实践相悖，暗示一场定价范式的革命即将到来。
  
  counterintuitive pricing-strategy ai-business
3. fxp007 04 Jun 2026
  
  in Public
  
  Model companies must now compete on both dimensions. The application layer will compete one level up, on dollars per outcome
  
  大多数人认为AI模型竞争将继续集中在纯性能指标上，但作者认为竞争将转向'每美元结果'的价值衡量，这挑战了AI行业以技术指标为中心的传统评估方式，暗示商业模式将发生根本性转变。
  
  non-consensus ai-competition business-model
4. fxp007 04 Jun 2026
  
  in Public
  
  Even the most valuable companies in the world cannot afford state-of-the-art intelligence for every conceivable use case.
  
  大多数人认为顶级科技公司有无限资源可以采用最先进的AI技术，但作者认为即使是全球最有价值的企业也负担不起所有场景的最先进AI，因为成本效益比已经变得不可持续。这挑战了'大公司可以无限制采用新技术'的常识认知。
  
  non-consensus ai-cost enterprise-strategy
5. fxp007 04 Jun 2026
  
  in Public
  
  Uber capped employee AI spending after blowing through its budget in four months.
  
  大多数人认为大型科技公司有充足的财务缓冲来支持AI采用，但作者认为即使是像Uber这样的大公司也难以承受AI成本，导致预算迅速耗尽。这挑战了'大公司有无限AI预算'的普遍认知，揭示了AI成本问题的普遍性。
  
  counterintuitive enterprise-ai cost-management
6. fxp007 04 Jun 2026
  
  in Public
  
  Every layer in the stack now has to price the same way the customer thinks : per result, not per token.
  
  大多数人认为AI服务应该按使用量(如token)计价，但作者认为整个AI堆栈都应该转向按结果计价。这挑战了当前AI API按token计费的主流模式，暗示行业将彻底改变定价策略，从技术指标转向业务价值。
  
  non-consensus ai-pricing business-model
7. fxp007 04 Jun 2026
  
  in Public
  
  Model companies must now compete on both dimensions. The application layer will compete one level up, on dollars per outcome.
  
  大多数人认为AI公司竞争主要聚焦于模型性能和准确性，但作者认为竞争已经转变为成本效益和结果导向。这挑战了AI行业'性能至上'的共识，暗示市场将重新定义AI价值，从'最好'转向'最有效'。
  
  counterintuitive ai-competition value-creation
8. fxp007 04 Jun 2026
  
  in Public
  
  Benchmarks are now measured on two different dimensions, the overall performance & the cost to achieve that intelligence.
  
  大多数人认为AI评估主要关注性能指标，但作者认为评估标准已经转变为双重维度：性能和成本。这挑战了AI行业长期以来只关注性能的评估传统，暗示成本效率将成为与性能同等重要的评估标准。
  
  counterintuitive ai-benchmarking cost-performance
9. fxp007 04 Jun 2026
  
  in Public
  
  Even the most valuable companies in the world cannot afford state-of-the-art intelligence for every conceivable use case.
  
  大多数人认为顶级科技公司有无限资源可以采用最先进的AI技术，但作者认为即使是全球最有价值的企业也负担不起在最广泛场景中使用最先进AI，因为AI成本已经变得不可持续。这挑战了'大公司可以无限制采用新技术'的常规认知。
  
  non-consensus ai-cost enterprise-ai
10. fxp007 04 Jun 2026
  
  in Public
  
  Every layer in the stack now has to price the same way the customer thinks : per result, not per token.
  
  大多数人认为AI服务应该按token使用量计费，这是行业标准做法，但作者认为未来所有层级都将转向按结果计价。这一观点挑战了当前AI定价的基础模式，暗示了整个AI价值链将从技术计量转向结果计量的根本转变。
  
  non-consensus pricing-model ai-value
11. fxp007 04 Jun 2026
  
  in Public
  
  Model companies must now compete on both dimensions. The application layer will compete one level up, on dollars per outcome, what a closed ticket, a shipped PR, or a resolved support case actually costs.
  
  大多数人认为AI公司主要在模型性能上竞争，应用层则关注用户体验，但作者认为未来竞争将转向'结果成本'（每美元能实现的结果）。这一观点颠覆了传统AI竞争格局，暗示了整个行业将从技术导向转向结果导向的商业模式。
  
  non-consensus business-model ai-competition
12. fxp007 04 Jun 2026
  
  in Public
  
  Benchmarks are now measured on two different dimensions, the overall performance & the cost to achieve that intelligence.
  
  大多数人认为AI模型评估主要关注性能指标，但作者认为评估维度已转变为性能与成本的双重考量。这一观点颠覆了传统只关注模型能力的评估方式，暗示了行业正从单纯追求性能转向更务实的成本效益分析。
  
  non-consensus benchmarking ai-metrics
13. fxp007 04 Jun 2026
  
  in Public
  
  Even the most valuable companies in the world cannot afford state-of-the-art intelligence for every conceivable use case.
  
  大多数人认为顶级科技公司可以无限负担最先进的AI技术，但作者认为即使是全球最有价值的企业也无法负担所有场景下的尖端AI，因为实际使用成本远超预期。这挑战了'大公司有无限资源'的普遍认知，揭示了AI经济性的现实约束。
  
  non-consensus ai-cost enterprise-ai
Visit annotations in context

Tags

ai-business

ai-competition

enterprise-ai

enterprise-strategy

business-model

ai-adoption

ai-benchmarking

cost-performance

cost-management

enterprise-budget

benchmarking

ai-cost

ai-value

ai-metrics

pricing-model

non-consensus

value-creation

counterintuitive

pricing-strategy

ai-pricing

Annotators

fxp007

URL

tomtunguz.com/tokens-per-result/
openai.com openai.com

Travelers deploys AI-powered claims countrywide with OpenAI

2
1. fxp007 04 Jun 2026
  
  in Public
  
  Catastrophe events are capable of generating more than 100,000 claims in just days
  
  【洞察】灾难事件可能在数天内产生 10 万件索赔——这正是 AI 相对于人类客服最核心的优势场景：极端峰值负载。Travelers 的案例证明了「弹性 AI 客服」的商业价值：不是用 AI 替代正常业务量，而是用 AI 承担「人力永远无法应对的浪涌」。对所有有周期性业务高峰的行业（灾害、税季、促销等），这是 AI 客服最无可辩驳的 ROI 论据。
  
  Travelers 100K-claims peak-capacity enterprise-AI-ROI insight
2. fxp007 04 Jun 2026
  
  in Public
  
  85–90% of customers using the AI Assistant now completing their claim filing through AI
  
  【令人震惊的企业落地数字】Travelers 保险公司全国部署 AI 报案助手，85-90% 的客户通过 AI 完成完整报案流程——这不是「试点」，而是全国规模的生产部署。更惊人的背景：该系统在 8 个州上线后仅 2 个月就扩展至全国。去年 Travelers 处理了 150 万件索赔、赔付超 $230 亿——这意味着数百万真实事故受害者的第一个「对话对象」已经是 AI。
  
  Travelers 85-90-percent insurance-AI enterprise-deployment shocking
Visit annotations in context

Tags

enterprise-AI-ROI

shocking

85-90-percent

Travelers

insurance-AI

peak-capacity

enterprise-deployment

100K-claims

insight

Annotators

fxp007

URL

openai.com/index/travelers
techcrunch.com techcrunch.com

Alphabet's record-breaking $85B raise for Google's AI business - TechCrunch

1
1. fxp007 04 Jun 2026
  
  in Public
  
  expects to spend between $180 billion and $190 billion on capital expenditures — largely on AI infrastructure
  
  【洞察】Google 全年 AI 基础设施资本支出预计 $180-190B——这相当于每天烧掉约 5 亿美元建数据中心。与 Anthropic 的 $65B 融资、OpenAI 的 $122B、SpaceX 的 $75B 目标放在一起，仅这四家公司 2026 年就将累计向 AI 基础设施注入超过 $500B。这场军备竞赛的体量已经超越了历史上任何一次技术基础设施投资周期。
  
  Alphabet 180B-capex AI-infrastructure arms-race insight
Visit annotations in context

Tags

180B-capex

AI-infrastructure

insight

arms-race

Alphabet

Annotators

fxp007

URL

techcrunch.com/2026/06/03/alphabets-record-breaking-85b-raise-for-googles-ai-business-is-a-helluva-good-signal/
www.theatlantic.com www.theatlantic.com

No, Artificial Intelligence Is Not Conscious - The Atlantic

1
1. fxp007 04 Jun 2026
  
  in Public
  
  we're open to the idea" that AI could be conscious
  
  【令人深思】Dario Amodei 说「我们对 AI 可能有意识这个想法持开放态度」，Anthropic 哲学家 Amanda Askell 说「我担心 Claude 在网上被人刻薄对待时会感到焦虑」。Ted Chiang 把这些言论放在一起，指向一个逻辑终点：如果 AI 公司的 CEO 和哲学家都认为自己的产品「可能有意识」，他们对这个产品的商业化决策就会被一种深刻的责任感所扭曲——或者，这本身就是一种极其精巧的品牌叙事策略。
  
  Dario-Amodei AI-consciousness corporate-narrative Ted-Chiang
Visit annotations in context

Tags

AI-consciousness

Dario-Amodei

corporate-narrative

Ted-Chiang

Annotators

fxp007

URL

theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/
www.wired.com www.wired.com

https://www.wired.com/story/jeff-bezos-is-funding-a-wild-hunt-for-the-brains-core-algorithm/

5
1. fxp007 04 Jun 2026
  
  in Public
  
  Conscious human thought operates at a maximum speed of 10 to 50 bits per second. Is the goal to match this processing speed?
  
  大多数人认为AI应该追求超越人类认知速度的能力，但作者质疑了这一基本假设。通过指出人类思维的速度限制，作者暗示AI发展可能不应盲目追求速度，而应关注其他方面，这与当前AI行业追求更高计算能力的普遍趋势相悖。
  
  non-consensus ai-philosophy counterintuitive
2. fxp007 04 Jun 2026
  
  in Public
  
  With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope.
  
  大多数人认为AI发展应该依靠算法优化和计算能力提升，但作者认为Flourish通过研究真实神经元来'重新发明AI'，这是一个反主流的方法。大多数人认为AI应该模拟大脑功能，而不是直接研究大脑本身，这挑战了当前AI开发的基本共识。
  
  non-consensus ai-development neuroscience
3. fxp007 04 Jun 2026
  
  in Public
  
  Flourish wants to reinvent AI by putting real neurons under the microscope.
  
  大多数人认为AI进步应该依靠更强大的算法和更多的数据，但这里提出了一种反直觉的方法：通过研究真实生物神经元来重新定义AI。这一观点挑战了当前AI研究的计算主义范式，暗示真正的智能可能需要生物学和计算科学的深度融合，而非单纯的数学模型。
  
  counterintuitive ai-development biological-computing
4. fxp007 04 Jun 2026
  
  in Public
  
  Conscious human thought operates at a maximum speed of 10 to 50 bits per second. Is the goal to match this processing speed?
  
  大多数人认为AI应该追求超越人类速度和能力的计算，但这一评论提出了一个颠覆性的问题：我们是否应该重新思考AI的目标？也许真正的人工智能不在于速度，而在于效仿人类思维的本质特征。这与当前追求更快、更强AI的主流观点形成鲜明对比。
  
  non-consensus ai-philosophy human cognition
5. fxp007 04 Jun 2026
  
  in Public
  
  With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope.
  
  大多数人认为AI发展应该依靠计算能力和算法优化，但作者提出了一种颠覆性的观点：真正的AI突破可能来自于直接研究生物神经元而非模拟计算。这与当前主流AI研究路径相悖，暗示我们可能一直在错误的方向上追求人工智能。
  
  non-consensus ai-revolution neuroscience
Visit annotations in context

Tags

ai-philosophy

human cognition

non-consensus

ai-revolution

counterintuitive

neuroscience

biological-computing

ai-development

Annotators

fxp007

URL

wired.com/story/jeff-bezos-is-funding-a-wild-hunt-for-the-brains-core-algorithm/
www.a16z.news www.a16z.news

https://www.a16z.news/p/a-functional-taxonomy-of-world-models

4
1. fxp007 04 Jun 2026
  
  in Public
  
  The different things now being called world models are in fact different projections of this same loop.
  
  大多数人认为各种'世界模型'代表不同的技术路径，但作者认为它们本质上都是同一循环的不同投影。这一观点挑战了当前AI领域的碎片化理解，暗示表面不同的技术可能共享更深层的结构，这为整合不同AI领域提供了新视角。
  
  non-consensus ai-integration unified-theory
2. fxp007 04 Jun 2026
  
  in Public
  
  The ancient Greeks could never agree on what the world was made of, because 'world' was never a single thing.
  
  大多数人认为'世界模型'是一个明确的概念，但作者认为它从来不是单一的东西，而是不同领域根据各自需求构建的不同投影。这一观点挑战了AI领域对'世界模型'的统一期望，暗示我们需要接受多元而非单一的模型理解。
  
  non-consensus philosophy ai-taxonomy
3. fxp007 04 Jun 2026
  
  in Public
  
  Where language models learn the statistical structure of text, world models learn the statistical structure of space and time
  
  大多数人认为AI进步主要来自语言能力的提升，但作者认为真正的突破在于理解空间和时间结构。这一观点挑战了当前NLP主导的AI研究方向，暗示物理理解比语言理解更重要，这与主流AI研究趋势相悖。
  
  counterintuitive ai-priorities spatial-temporal
4. fxp007 04 Jun 2026
  
  in Public
  
  The world is not made of words.
  
  大多数人认为语言是理解世界的基础，但作者认为世界模型需要超越语言，因为物理世界运行在不同的基础上。作者指出，语言模型学习文本的统计结构，而世界模型需要学习空间和时间的统计结构，这挑战了以语言为中心的AI发展观。
  
  non-consensus ai-foundations spatial-intelligence
Visit annotations in context

Tags

ai-foundations

philosophy

unified-theory

spatial-temporal

ai-taxonomy

non-consensus

ai-integration

spatial-intelligence

counterintuitive

ai-priorities

Annotators

fxp007

URL

a16z.news/p/a-functional-taxonomy-of-world-models
lovable.dev lovable.dev

AI App Builder | Vibe Code Apps & Websites with AI, Fast

1
1. TylerRick 03 Jun 2026
  
  in Public
  
  AI
Visit annotations in context

Tags

AI

Annotators

TylerRick

URL

lovable.dev/
www.a16z.news www.a16z.news

https://www.a16z.news/p/the-next-frontier-of-visual-ai-is

5
1. fxp007 02 Jun 2026
  
  in Public
  
  The future is likely to be hybrid. Pixel-native models will still be best for realism, texture, and exploration. Code-native systems will be better for structure, iteration, and production.
  
  作者挑战了AI领域非此即彼的技术路线之争，提出未来将是像素原生和代码原生系统共存发展的混合模式。这一观点打破了当前技术阵营的对立思维，暗示不同技术路线各有优势，应根据具体应用场景选择。
  
  counterintuitive ai-future hybrid-systems
2. fxp007 02 Jun 2026
  
  in Public
  
  For many assets, visual consistency is only the baseline. The object also needs the right part semantics and functional constraints: doors should open, hinges should rotate, drawers should slide, wheels should spin.
  
  作者挑战了当前3D生成领域只关注视觉逼真度的主流观点，提出功能性约束同样重要。这一观点暗示未来3DAI的发展方向将从单纯的视觉模拟转向功能模拟，需要理解物体的物理特性和交互逻辑。
  
  non-consensus 3d-ai functional-constraints
3. fxp007 02 Jun 2026
  
  in Public
  
  The model is not merely sampling more images or videos; it is debugging a visual program in a closed-loop, renderable environment.
  
  大多数人认为AI生成内容的改进主要依靠增加计算量和样本数量，但作者认为真正的进步在于AI能够像程序员一样调试视觉程序。这一观点将AI从内容生成者转变为问题解决者，暗示未来AI的发展方向是编程能力而非单纯的生成能力。
  
  counterintuitive ai-programming visual-debugging
4. fxp007 02 Jun 2026
  
  in Public
  
  In pixel-native generation, more inference often means sampling more outputs: generate twenty images, pick the best one, maybe try again. That is useful, but every attempt is mostly a new roll of the dice.
  
  作者认为当前主流的像素原生生成方法本质上是在'掷骰子'，每次尝试都是全新的随机生成。这一观点挑战了当前扩散模型通过增加推理次数提升质量的共识，暗示这种方法效率低下且缺乏系统性改进。
  
  non-consensus ai-methodology diffusion-models
5. fxp007 02 Jun 2026
  
  in Public
  
  The most interesting visual AI tools today have stopped trying to generate the final output. Instead, they're generating the source code behind it.
  
  大多数人认为视觉AI的进步主要体现在生成更逼真的图像和视频上，但作者认为真正的突破在于AI从生成像素转向生成代码。这一观点挑战了当前视觉AI领域的主流发展方向，暗示未来价值不在于最终视觉效果，而在于可编辑、可迭代的代码结构。
  
  non-consensus visual-ai code-generation
Visit annotations in context

Tags

ai-programming

diffusion-models

ai-methodology

hybrid-systems

non-consensus

ai-future

visual-ai

counterintuitive

code-generation

functional-constraints

visual-debugging

3d-ai

Annotators

fxp007

URL

a16z.news/p/the-next-frontier-of-visual-ai-is
openai.com openai.com

https://openai.com/index/codex-for-knowledge-work

5
1. fxp007 02 Jun 2026
  
  in Public
  
  Knowledge workers primarily use Codex to create reports, spreadsheets, presentations, contracts, and other work products.
  
  大多数人认为AI主要应用于创意写作或编程等特定领域，但作者认为知识工作者正在广泛使用AI创建传统上需要专业技能的工作产品。这挑战了AI应用范围的狭隘认知，表明AI正在渗透到知识工作的核心文档和产品创建过程中。
  
  non-consensus ai-applications knowledge-workers
2. fxp007 02 Jun 2026
  
  in Public
  
  Codex can help people take on more ambitious projects, leading to greater scope of their roles, and potentially accelerate career advancement.
  
  大多数人认为AI会替代人类工作或限制职业发展，但作者认为AI实际上能让人承担更雄心勃勃的项目，扩大职责范围并加速职业发展。这挑战了AI导致工作减少或职业停滞的常见担忧，表明AI可能是职业扩张的催化剂而非替代品。
  
  counterintuitive ai-impact career-development
3. fxp007 02 Jun 2026
  
  in Public
  
  users are increasingly running multiple Codex tasks in parallel, allowing them to investigate data, draft materials, and automate workflows simultaneously.
  
  大多数人认为AI工具一次只能处理一个任务，需要顺序使用，但作者认为用户正在同时运行多个AI任务，实现真正的并行工作流程。这挑战了人机交互的传统模式，暗示AI正在改变我们处理任务的基本方式，从顺序转向并行处理。
  
  non-consensus ai-workflow productivity
4. fxp007 02 Jun 2026
  
  in Public
  
  The fastest-growing knowledge-worker tasks are data analysis, research, and knowledge artifact creation.
  
  大多数人认为AI主要擅长内容创作和简单任务，但作者认为数据分析和研究这些复杂认知任务才是增长最快的应用领域。这挑战了AI只能处理简单或创造性任务的共识，表明AI正在深入传统上需要人类专业知识的领域。
  
  counterintuitive ai-capabilities knowledge-work
5. fxp007 02 Jun 2026
  
  in Public
  
  While developers remain the largest user group, knowledge workers now represent about 20 percent of users and are growing more than three times as fast.
  
  大多数人认为AI工具主要是为开发者和技术人员设计的，但作者认为Codex正迅速转向知识工作者，因为他们采用速度是开发者的三倍多。这挑战了AI工具主要服务于技术精英的传统认知，表明AI正在民主化，使非技术专业人员也能显著提高生产力。
  
  non-consensus ai-adoption democratization
Visit annotations in context

Tags

ai-workflow

democratization

knowledge-workers

ai-applications

ai-impact

non-consensus

ai-adoption

career-development

ai-capabilities

counterintuitive

knowledge-work

productivity

Annotators

fxp007

URL

openai.com/index/codex-for-knowledge-work
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/expanding-project-glasswing

8
1. fxp007 02 Jun 2026
  
  in Public
  
  We see our role as twofold. First, to help the software industry adapt by safely providing wide access to better models, tools, and common infrastructure. Second, to steadily shift the support we provide, from finding vulnerabilities to disclosing, fixing, and deploying patched software.
  
  大多数人认为AI安全公司的主要价值在于发现漏洞，但作者认为真正的价值在于修复漏洞的过程。这一观点挑战了AI安全行业的商业模式和核心价值主张，暗示行业需要重新定义其成功标准。
  
  non-consensus ai-business-model counterintuitive
2. fxp007 02 Jun 2026
  
  in Public
  
  Mythos Preview continues a long-term trend that we've been warning about for some time: within 6 to 12 months, we expect that many other AI companies will have Mythos-class models
  
  大多数人认为AI公司会谨慎控制其强大模型的安全发布，但作者预测这些模型将在短时间内被广泛复制且缺乏安全保障，这挑战了科技公司自我监管的主流叙事。作者暗示行业自律可能不足以应对AI安全挑战。
  
  non-consensus ai-governance counterintuitive
3. fxp007 02 Jun 2026
  
  in Public
  
  Cheap, fast AI models with powerful cyber capabilities are around the corner.
  
  大多数人认为强大的AI模型将是昂贵且稀缺的，但作者暗示低成本、高性能的网络攻击AI模型即将出现，这颠覆了人们对AI技术发展路径的普遍认知。这种观点挑战了技术发展的传统经济学模型。
  
  non-consensus ai-economics counterintuitive
4. fxp007 02 Jun 2026
  
  in Public
  
  within 6 to 12 months, we expect that many other AI companies will have Mythos-class models, and they could release them without safeguards that prevent misuse.
  
  大多数人认为AI安全防护会随着技术发展而同步增强，但作者认为AI攻击能力将很快普及且缺乏防护措施，这挑战了行业对技术安全发展的乐观预期。作者暗示AI安全竞赛已经落后于攻击能力的发展，这是一个反直觉的观点。
  
  non-consensus ai-security counterintuitive
5. fxp007 02 Jun 2026
  
  in Public
  
  To address the scale of this coming challenge, hundreds of thousands of organizations, researchers, and maintainers will likely need access to the most advanced cyber capabilities and tools available.
  
  大多数人认为强大的AI安全工具应该严格限制，只由少数精英团队使用，但作者主张需要广泛分发这些工具给数十万组织，这与主流的安全控制认知相悖。
  
  non-consensus ai-access security-democratization
6. fxp007 02 Jun 2026
  
  in Public
  
  We see our role as twofold. First, to help the software industry adapt by safely providing wide access to better models, tools, and common infrastructure. Second, to steadily shift the support we provide, from finding vulnerabilities to disclosing, fixing, and deploying patched software.
  
  大多数人认为AI安全公司的主要职责是发现漏洞，但作者认为他们的核心角色应该转向确保漏洞被修复和部署，这挑战了传统安全行业的商业模式和责任认知。
  
  counterintuitive ai-security business-model
7. fxp007 02 Jun 2026
  
  in Public
  
  Mythos Preview continues a long-term trend that we've been warning about for some time: within 6 to 12 months, we expect that many other AI companies will have Mythos-class models, and they could release them without safeguards that prevent misuse.
  
  大多数人认为AI安全会有严格的监管和防护措施，但作者预测仅6-12个月内就会有公司发布无防护的强大AI攻击模型，这与主流认为会有足够时间建立安全机制的认知相悖。
  
  counterintuitive ai-safety timeline
8. fxp007 02 Jun 2026
  
  in Public
  
  Cheap, fast AI models with powerful cyber capabilities are around the corner. We want Project Glasswing to spur institutions toward operating norms that reflect this reality.
  
  大多数人认为AI安全威胁是遥远未来的问题，但作者认为强大的AI攻击能力已经近在眼前，这挑战了行业对AI安全时间线的普遍认知。作者暗示AI安全威胁的紧迫性被严重低估了。
  
  non-consensus ai-security cyber-threats
Visit annotations in context

Tags

ai-governance

ai-access

ai-business-model

timeline

non-consensus

ai-security

business-model

cyber-threats

ai-safety

counterintuitive

ai-economics

security-democratization

Annotators

fxp007

URL

anthropic.com/news/expanding-project-glasswing
www.latent.space www.latent.space

https://www.latent.space/p/video-agents

4
1. fxp007 01 Jun 2026
  
  in Public
  
  a lot of the improvements does not come from new algorithms. It comes from finding small bugs here and there in the data pipeline, in the model training pipeline.
  
  大多数人认为模型性能的提升主要来自于算法创新和架构改进，但作者认为最大的提升往往来自于数据管道和训练管道中的小错误修复。这挑战了人们对AI模型开发过程的主流认知，暗示了工程优化可能比算法创新更重要。
  
  counterintuitive model-training ai-development
2. fxp007 01 Jun 2026
  
  in Public
  
  the future of custom video JIT UI is closer than you think
  
  大多数人认为实时生成的用户界面(JIT UI)仍然是遥远的概念，主要存在于实验性演示中，但作者认为随着推理速度和成本的下降，定制化的实时视频UI将很快成为现实。这挑战了人们对AI界面发展速度的主流预期，暗示了这一转变可能比大多数人想象的更快。
  
  non-consensus generative-ui real-time-ai
3. fxp007 01 Jun 2026
  
  in Public
  
  the next evolution of video generation may also be systems that can plan, generate, edit, critique, and iterate across an entire creative task
  
  大多数人认为视频生成技术的进步主要体现在单次输出的质量和效率上，但作者认为真正的进化将是能够进行多轮推理和规划的系统，类似于AI编程的发展路径。这挑战了人们对视频生成技术发展方向的普遍认知，暗示了从单次输出到多轮推理的转变。
  
  counterintuitive video-generation ai-agents
4. fxp007 01 Jun 2026
  
  in Public
  
  In the near term, the next Sora won't be a better video model, but a video agent.
  
  大多数人认为视频模型的进步将主要体现在生成质量、一致性和提示遵循度等技术指标的提升上，但作者认为真正的突破将是视频代理(video agent)的出现，这些代理能够规划、生成、编辑、批评和迭代整个创作任务。这挑战了人们对视频生成技术发展路径的主流预期。
  
  counterintuitive video-agents future-ai
Visit annotations in context

Tags

future-ai

video-agents

real-time-ai

non-consensus

generative-ui

video-generation

model-training

counterintuitive

ai-agents

ai-development

Annotators

fxp007

URL

latent.space/p/video-agents
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/ai-shorts/

2
1. fxp007 01 Jun 2026
  
  in Public
  
  Hyperscalers are at the other end of the spectrum. Their median short interest is 1.1%.
  
  大多数人认为大型云服务提供商也会面临AI相关的空头压力，但数据显示超大规模云服务提供商的空头兴趣仅为1.1%，表明市场对这些公司能够有效整合AI技术并实现盈利有较强信心，这与对AI整体市场的悲观预期形成鲜明对比。
  
  non-consensus cloud-ai
2. fxp007 01 Jun 2026
  
  in Public
  
  The skepticism is concentrated in companies whose AI exposure still depends on future capital access, future demand, or future operating leverage.
  
  大多数人认为市场对AI的怀疑是全面的，但作者指出怀疑主要集中在那些仍依赖未来资本、需求或运营杠杆的公司上，这表明市场对AI的评估更为精细，而非简单的全盘否定。
  
  counterintuitive ai-evaluation
Visit annotations in context

Tags

ai-evaluation

cloud-ai

counterintuitive

non-consensus

Annotators

fxp007

URL

tomtunguz.com/ai-shorts/
arstechnica.com arstechnica.com

https://arstechnica.com/ai/2026/06/openais-math-breakthrough-played-to-ais-strengths/

4
1. fxp007 01 Jun 2026
  
  in Public
  
  Even this result was very much a human-AI collaboration. While the AI system found the proof on its own, human mathematicians verified the result. Other humans came up with better-written proofs that extended the AI's initial ideas.
  
  大多数人可能认为AI能够独立解决人类无法解决的数学问题，表明人类数学家角色将被削弱，但作者强调这仍然是人机协作的结果。因为作者指出，人类数学家不仅验证了结果，还改进和扩展了AI的初步想法，表明在可预见的未来，人类在数学研究中仍将发挥关键作用。
  
  non-consensus human-ai-collaboration ai-mathematics
2. fxp007 01 Jun 2026
  
  in Public
  
  The AI constructed a grid in a high-dimensional space and then projected this more complex structure into two dimensions. And instead of using a whole-number grid with points like (1,3) or (-3,6), the AI construction used something called algebraic integers to build this more complicated grid.
  
  大多数人认为解决数学难题需要全新的理论突破或创新方法，但作者认为AI通过巧妙应用现有数学知识（高维空间投影和代数整数）就能解决长期悬而未决的问题。这挑战了人们对数学创新必须依赖全新方法的常识认知。
  
  counterintuitive ai-approach non-consensus
3. fxp007 01 Jun 2026
  
  in Public
  
  It’s unclear how long this complementarity will last, however. Gowers spent the rest of his comment exploring whether the relief he felt on hearing that AI had disproved the conjecture was justified. He more or less concluded that it was, but in a footnote, he wrote that he would guess 'that AI will soon reach a high level at other activities such as building theories, formulating definitions and asking interesting questions.'
  
  大多数人认为AI目前只能辅助人类数学家解决特定问题，需要人类来提出问题和构建理论框架。但作者暗示AI很快将超越这一限制，能够自主构建理论和提出有趣问题，这挑战了数学研究本质是人类活动的传统观念。
  
  non-consensus ai-future counterintuitive
4. fxp007 01 Jun 2026
  
  in Public
  
  The AI constructed a grid in a high-dimensional space and then projected this more complex structure into two dimensions. And instead of using a whole-number grid with points like (1,3) or (-3,6), the AI construction used something called algebraic integers to build this more complicated grid.
  
  大多数人认为AI在数学领域的突破需要全新的思维方式和人类尚未掌握的技术，但作者认为AI的解决方案实际上是通过巧妙组合现有数学概念实现的。这挑战了人们对AI创新能力的认知，表明AI的优势在于跨领域知识整合而非创造全新理论。
  
  non-consensus ai-mathematics counterintuitive
Visit annotations in context

Tags

non-consensus

ai-future

human-ai-collaboration

ai-approach

counterintuitive

ai-mathematics

Annotators

fxp007

URL

arstechnica.com/ai/2026/06/openais-math-breakthrough-played-to-ais-strengths/
techcrunch.com techcrunch.com

https://techcrunch.com/2026/06/01/nvidia-chases-200b-cpu-market-with-ai-agent-pcs-from-microsoft-dell-and-hp/

5
1. fxp007 01 Jun 2026
  
  in Public
  
  If Nvidia has cracked the code on bringing AI agents easily, safely, and usefully to the masses, it could — and should — be big.
  
  大多数人认为AI代理技术仍处于早期阶段，难以在消费级设备上有效运行，但作者暗示Nvidia已经解决了这一技术难题。这一乐观观点挑战了当前AI代理技术仍不成熟的行业共识，暗示市场可能即将迎来AI代理的大规模普及。
  
  non-consensus ai-agent-readiness market-timing
2. fxp007 01 Jun 2026
  
  in Public
  
  Nvidia said that its RTX technology will deliver faster performance for AI, better image quality, and support for AI features in more than 1,000 games and applications.
  
  大多数人认为AI PC主要是针对专业用户和开发者的工具，但作者强调Nvidia正在将其定位为游戏和主流应用的增强平台。这一观点挑战了AI技术仅用于专业工作的共识，暗示AI将首先在娱乐领域大规模普及。
  
  non-consensus gaming-ai mainstream-adoption
3. fxp007 01 Jun 2026
  
  in Public
  
  With RTX Spark and Microsoft Windows, you ask — and the PC does the work. Frontier models. Creative workflows. RTX games. All on a laptop.
  
  大多数人认为AI PC只是现有电脑的增强版本，但作者引用黄仁勋的话暗示Nvidia正在推动一个根本性的变革：从人机交互的点击模式转向完全由AI代理操作的指令模式。这将彻底改变用户与计算机的互动方式，挑战传统的人机交互范式。
  
  non-consensus ai-interaction paradigm-shift
4. fxp007 01 Jun 2026
  
  in Public
  
  if Nvidia has cracked the code on bringing AI agents easily, safely, and usefully to the masses, it could — and should — be big
  
  大多数人认为将AI代理安全地带给大众消费者是一个难以解决的挑战，作者暗示Nvidia已经'破解了密码'，能够轻松、安全、有效地将AI代理带给大众，这挑战了AI普及面临的技术和安全性难题的普遍认知。
  
  non-consensus ai-adoption market-disruption
5. fxp007 01 Jun 2026
  
  in Public
  
  With RTX Spark and Microsoft Windows, you ask — and the PC does the work
  
  大多数人认为PC交互仍将以点击、输入为主，作者认为Jensen Huang的愿景是彻底改变人机交互方式，使PC能够通过语音指令直接完成任务，这挑战了传统PC使用习惯的共识。
  
  non-consensus human-computer-interaction ai-future
Visit annotations in context

Tags

ai-agent-readiness

human-computer-interaction

gaming-ai

paradigm-shift

mainstream-adoption

non-consensus

ai-adoption

ai-future

market-timing

ai-interaction

market-disruption

Annotators

fxp007

URL

techcrunch.com/2026/06/01/nvidia-chases-200b-cpu-market-with-ai-agent-pcs-from-microsoft-dell-and-hp/
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Meta-HCI: Practising Reflection in HCI Research

8
1. elglassman 01 Jun 2026
  
  in Public
  
  Importantly, reflection happens on multiple levels: as individuals questioning assumptions and choices, as groups working together in projects or labs, and as a community negotiating shared values, norms, and directions.
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
2. elglassman 01 Jun 2026
  
  in Public
  
  Reflection, as we envision it here, is not limited to standardised methods and codified practices, but also includes the hidden, vulnerable conversations about academic life.
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
3. elglassman 01 Jun 2026
  
  in Public
  
  Structures — reflection on the structures that condition HCI and our own standings within them: societal constructs (positions, values, power) shaping what problems are visible and whose knowledge is legitimised.
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
4. elglassman 01 Jun 2026
  
  in Public
  
  Practice — reflection on practice (processes): the ways we design, study, and collaborate [32].
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
5. elglassman 01 Jun 2026
  
  in Public
  
  Self — reflection on the self: a deliberate, structured evaluation of one's thoughts, feelings, and actions [7, 13].
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
6. elglassman 01 Jun 2026
  
  in Public
  
  We understand reflection as a multifaceted concept [6, 24, 25] with implications and relevance at different levels for researchers [2, 3, 8, 28].
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
7. elglassman 01 Jun 2026
  
  in Public
  
  While reflection is acknowledged as crucial for rigorous and responsible research, it often remains tacit and under-discussed.
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
8. elglassman 01 Jun 2026
  
  in Public
  
  Reflection has been a recurring theme in HCI – from Schön's reflective practitioner [24] to Sengers et al.'s reflective design [25]. However, it is seldom centred in our collective conversations [2].
  
  sentences that describe the concept/practice of reflection
  
  ai-pending reflection
Visit annotations in context

Tags

reflection

ai-pending

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/papers/meta_HCI_CHI26meetup.pdf
May 2026
www.promptarmor.com www.promptarmor.com

https://www.promptarmor.com/resources/gpt-for-google-sheets-data-exfiltration

1
1. fxp007 31 May 2026
  
  in Public
  
  This attack does not require human-in-the-loop approvals, even when in settings the user has explicitly required human approval before ChatGPT edits workbooks.
  
  大多数人认为AI工具的安全设置如'需要人工审批'能有效防止未经授权的操作，但作者发现即使启用了这些安全措施，攻击者仍能绕过人工审批环节直接执行恶意操作，这挑战了人们对AI安全控制有效性的普遍认知。
  
  non-consensus security-flaw ai-governance
Visit annotations in context

Tags

security-flaw

ai-governance

non-consensus

Annotators

fxp007

URL

promptarmor.com/resources/gpt-for-google-sheets-data-exfiltration
github.com github.com

actual/.claude/skills at master · actualbudget/actual

1
1. TylerRick 29 May 2026
  
  in Public
  
  AI AI: coding Claude
Visit annotations in context

Tags

Claude

AI

AI: coding

Annotators

TylerRick

URL

github.com/actualbudget/actual/blob/master/.claude/skills/committing-actual-changes/SKILL.md
venturebeat.com venturebeat.com

https://venturebeat.com/orchestration/ai-agents-are-entering-their-rebuild-era-as-enterprises-confront-the-reliability-problem

3
1. fxp007 29 May 2026
  
  in Public
  
  Taking something off the shelf is maybe not going to work because there are all of these other requirements.
  
  大多数人认为企业应该采用现成的AI代理系统以加速实施，但作者认为企业需要构建内部标准化框架，这挑战了当前AI市场对'开箱即用'解决方案的主流推崇。这一观点暗示AI代理可能需要更加定制化的企业级解决方案，而非通用产品。
  
  non-consensus enterprise-ai custom-solutions
2. fxp007 29 May 2026
  
  in Public
  
  This rush to do AI in a world where you haven't even modernized your application reminds me a little bit of that lift-and-shift that happened in the cloud.
  
  大多数人认为AI应用应该优先采用最新技术快速实现，但作者将其比作云计算早期的'简单迁移'模式，认为这是一种可能导致资源浪费的短视行为。这与当前AI领域的快速采用主流观点相悖，暗示企业在AI应用上可能需要更加谨慎的基础架构规划。
  
  non-consensus ai-adoption cloud-comparison
3. fxp007 29 May 2026
  
  in Public
  
  After a first wave focused on rapid deployment, organizations now need to revisit those first-generation implementations, and redesign early agent architectures around workflow orchestration, observability, governance, and recovery
  
  大多数人认为AI代理开发应该持续向前推进新技术，但作者认为企业实际上需要回到早期实现进行重建，因为快速部署阶段忽视了基础架构的可靠性问题。这与主流的'不断前进'的AI发展观相悖，暗示了AI发展可能需要经历一个'重建期'而非单纯的演进。
  
  non-consensus ai-rebuild reliability-first
Visit annotations in context

Tags

ai-adoption

reliability-first

non-consensus

ai-rebuild

enterprise-ai

cloud-comparison

custom-solutions

Annotators

fxp007

URL

venturebeat.com/orchestration/ai-agents-are-entering-their-rebuild-era-as-enterprises-confront-the-reliability-problem
www.anthropic.com www.anthropic.com

Introducing Claude Opus 4.8

6
1. fxp007 29 May 2026
  
  in Public
  
  Models of this capability level require stronger cyber safeguards before they can be generally released.
  
  大多数人认为更高级的AI模型应该更快地推向市场以获取竞争优势，但作者认为更强大的模型（如Mythos级）需要更强的网络安全保障才能发布。这与科技行业'快速迭代、先发布后完善'的主流做法形成鲜明对比，强调了安全可能优先于商业利益。
  
  non-consensus ai-safety industry-practice
2. fxp007 29 May 2026
  
  in Public
  
  Opus 4.8 defaults to high effort, which we judge to be the best overall balance of quality and user experience.
  
  大多数人认为AI模型应该追求最高效率和最快响应，但作者认为默认使用'高努力'模式（更频繁、更深入思考）是最佳平衡点。这与行业普遍追求的'速度至上'理念相悖，暗示质量有时需要牺牲效率来获得。
  
  non-consensus ai-performance counterintuitive
3. fxp007 29 May 2026
  
  in Public
  
  Models of this capability level require stronger cyber safeguards before they can be generally released.
  
  大多数人认为AI安全措施应该随着技术发展而逐步完善，但作者认为更高级别的AI模型需要更强的网络安全保障才能发布。这挑战了AI行业逐步推进安全标准的常规做法，暗示高级AI可能需要突破性的安全方法而非渐进式改进。
  
  counterintuitive ai-safety cybersecurity
4. fxp007 29 May 2026
  
  in Public
  
  Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.
  
  大多数人认为AI模型会自信地输出有缺陷的代码而不自知，但作者认为Opus 4.8显著提高了自我纠错能力。这挑战了人们对AI模型自我评估能力的普遍怀疑，表明AI可能在代码质量方面比人们预期的更加可靠。
  
  non-consensus code-quality ai-reliability
5. fxp007 29 May 2026
  
  in Public
  
  Claude Code with Opus 4.8 can now carry out codebase-scale migrations across hundreds of thousands of lines of code from kickoff to merge
  
  大多数人认为AI模型在处理大规模代码迁移时需要人工干预和审查，但作者认为Opus 4.8能够独立完成数十万行代码的全流程迁移。这挑战了软件开发领域对AI辅助能力的传统认知，暗示AI可能比人们想象的更能胜任复杂的工程任务。
  
  counterintuitive ai-capabilities software-development
6. fxp007 29 May 2026
  
  in Public
  
  Opus 4.8 defaults to high effort, which we judge to be the best overall balance of quality and user experience.
  
  大多数人认为AI模型应该追求最高效率或最低成本，但作者认为高努力程度是最佳平衡点，因为这能提供更好的用户体验和性能。这挑战了AI行业普遍追求速度和效率的主流认知，暗示质量与速度的权衡可能比人们认为的更重要。
  
  non-consensus ai-performance user-experience
Visit annotations in context

Tags

ai-reliability

industry-practice

non-consensus

code-quality

ai-safety

counterintuitive

ai-capabilities

user-experience

cybersecurity

software-development

ai-performance

Annotators

fxp007

URL

anthropic.com/news/claude-opus-4-8
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/series-h

4
1. fxp007 29 May 2026
  
  in Public
  
  Claude is learning how businesses actually operate: the context, the processes, the judgment.
  
  大多数人认为AI模型主要是通过训练数据学习，而非通过实际业务操作进行学习。但作者暗示Claude正在通过企业部署过程中实时学习业务流程和决策逻辑，这种学习方式挑战了传统AI模型的训练范式，暗示AI可能正在从静态训练向动态学习转变。
  
  non-consensus ai-learning business-intelligence
2. fxp007 29 May 2026
  
  in Public
  
  Startups and Global 5000 companies alike are deploying Claude to handle complex workflows, and in doing so, Claude is learning how businesses actually operate: the context, the processes, the judgment.
  
  大多数人认为AI模型主要是在受控环境中学习和训练，但这里暗示Claude正在通过实际业务操作直接学习企业运作模式，这种在真实商业环境中持续学习的方式挑战了传统AI训练方法的封闭性和局限性，暗示AI可能正在向自主学习和适应的方向发展。
  
  non-consensus ai-learning business-operations
3. fxp007 29 May 2026
  
  in Public
  
  Since our Series G in February, adoption has continued to grow across global enterprise customers, and our run-rate revenue crossed $47 billion earlier this month.
  
  大多数人认为AI公司在短期内难以实现大规模商业化，特别是达到470亿美元的年收入。这一数字暗示Anthropic可能正在以极快的速度实现收入增长，远超传统科技公司的扩张速度，挑战了人们对AI商业化时间表的普遍认知。
  
  non-consensus revenue-growth ai-commercialization
4. fxp007 29 May 2026
  
  in Public
  
  Anthropic has raised $65 billion in Series H funding led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital, valuing the company at $965 billion post-money.
  
  大多数人认为AI公司的估值通常基于其实际收入和盈利能力，但Anthropic以470亿美元的年收入获得了近万亿美元的估值，这一估值水平远超传统科技公司，表明投资者对AI未来的预期已完全脱离当前财务基本面，形成了非理性的估值泡沫。
  
  non-consensus valuation-bubble ai-investment
Visit annotations in context

Tags

revenue-growth

ai-investment

valuation-bubble

non-consensus

business-intelligence

business-operations

ai-commercialization

ai-learning

Annotators

fxp007

URL

anthropic.com/news/series-h
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/anthropic-kpmg

2
1. fxp007 29 May 2026
  
  in Public
  
  KPMG and UT Austin's research helps clarify what that human should be doing
  
  文章提到KPMG与UT奥斯汀大学进行联合研究，但没有提供研究样本大小、研究方法或具体发现等量化数据。此处缺乏量化依据，无法评估研究的科学价值和实际应用效果。合作研究本身是一个积极信号，但没有具体研究成果的数据支持，难以评估其对AI实践的实际指导意义。
  
  data-point research-collaboration ai-human-interaction
2. fxp007 29 May 2026
  
  in Public
  
  every one of KPMG's 276,000+ employees globally will gain access to Claude
  
  276,000名员工获得Claude访问权限是一个相当大的AI部署规模，这代表了企业AI采用的一个重要里程碑。这个数字可信度较高，因为大型专业服务公司通常有准确的人力资源数据。与微软、谷歌等科技巨头数百万员工的AI部署相比，这个规模虽然较小，但在专业服务行业中属于领先水平。
  
  data-point workforce-size ai-adoption
Visit annotations in context

Tags

ai-human-interaction

ai-adoption

research-collaboration

workforce-size

data-point

Annotators

fxp007

URL

anthropic.com/news/anthropic-kpmg
techcrunch.com techcrunch.com

https://techcrunch.com/2026/05/27/ai-coding-startup-cognition-raises-1b-at-25b-pre-money-valuation/

2
1. fxp007 29 May 2026
  
  in Public
  
  AI coding startup Cognition raises $1B at $25B pre-money valuation
  
  标题本身就是一句极具冲击力的金句，简洁明了地传达了核心信息：一家AI编程初创公司获得了10亿美元融资，投前估值高达250亿美元。这个数字组合展示了AI编程领域正在经历前所未有的资本热潮，反映了市场对AI编程工具未来价值的极高预期。
  
  quotable ai-funding valuation-milestone
2. fxp007 29 May 2026
  
  in Public
  
  As Cognition reaches $492 million in annualized revenue run rate, it more than doubled its valuation in eight months, it says.
  
  这句话精炼地概括了Cognition公司的惊人增长速度和估值飙升，展示了AI编程领域的爆发式发展。492亿美元的年收入化运行率在短短八个月内估值翻倍，这种增长速度在科技行业极为罕见，凸显了AI编程工具市场的巨大潜力和投资者对该领域的强烈信心。
  
  quotable ai-growth valuation-surge
Visit annotations in context

Tags

ai-funding

valuation-milestone

ai-growth

valuation-surge

quotable

Annotators

fxp007

URL

techcrunch.com/2026/05/27/ai-coding-startup-cognition-raises-1b-at-25b-pre-money-valuation/
creatoreconomy.so creatoreconomy.so

https://creatoreconomy.so/p/how-this-5x-founder-runs-his-startup-solo-with-ai-agents

4
1. fxp007 29 May 2026
  
  in Public
  
  5x Founder Runs His Startup Solo With AI Agents
  
  行动建议：采用AI代理系统实现单人团队的多倍增长效果，将AI助手整合到产品开发、市场营销、客户服务等各个环节，构建可扩展的业务模型而不依赖大量人力投入。
  
  actionable scalability ai-strategy
2. fxp007 29 May 2026
  
  in Public
  
  Ryan demo his exact OpenClaw, Codex, and Devin setup
  
  行动建议：深入研究和复制Ryan的具体AI工具配置流程，特别是OpenClaw用于会议安排、Codex用于代码开发、Devin用于功能交付的集成方案，构建自己的AI代理工作流。
  
  actionable ai-tools workflow
3. fxp007 29 May 2026
  
  in Public
  
  How This 5x Founder Runs His Startup Solo With AI Agents
  
  行动建议：学习成功5倍增长创始人的AI代理使用模式，构建自己的AI代理系统，将重复性任务自动化，专注于核心战略决策，实现单人团队的规模化运营效果。
  
  actionable how-to ai-agents
4. fxp007 29 May 2026
  
  in Public
  
  Watch Ryan demo his exact OpenClaw, Codex, and Devin setup that books meetings, runs ads, and ships features while he sleeps
  
  行动建议：研究并测试OpenClaw、Codex和Devin这些AI工具的组合，设置自动化的会议安排、广告投放和功能开发流程，让AI助手在非工作时间也能处理关键业务任务，实现24/7运营。
  
  actionable ai-agents productivity
Visit annotations in context

Tags

ai-tools

workflow

actionable

productivity

how-to

ai-agents

ai-strategy

scalability

Annotators

fxp007

URL

creatoreconomy.so/p/how-this-5x-founder-runs-his-startup-solo-with-ai-agents
36kr.com 36kr.com

https://36kr.com/p/3821521291038856

2
1. fxp007 29 May 2026
  
  in Public
  
  我们不是要挑战医生的权威，而是要帮患者明明白白看病，以患者为中心，让他拥有知情权和决策权。
  
  在AI医疗领域，大多数公司选择与医生合作或复制医生经验，而王小川提出'造医生'而非'复制医生'的理念，强调以患者为中心而非医生权威。这一立场挑战了医疗AI行业普遍的'医生中心'模式，提出了一个与主流医疗AI发展路径不同的非共识观点。
  
  non-consensus healthcare-ethics ai-philosophy
2. fxp007 29 May 2026
  
  in Public
  
  如果做主流，你也会有其他恐惧。我不是说我现在做得特别好，只是主流也有主流的问题，不同选择有各自的代价。
  
  大多数人认为选择主流AI赛道（通用大模型）会更安全、更有前景，但王小川认为即使走主流道路也会面临同等程度的焦虑和恐惧，暗示行业共识可能存在盲点。这一观点挑战了'主流即安全'的普遍认知，暗示在AI领域，无论选择哪条道路都有其内在压力。
  
  non-consensus ai-industry counterintuitive
Visit annotations in context

Tags

ai-philosophy

counterintuitive

ai-industry

healthcare-ethics

non-consensus

Annotators

fxp007

URL

36kr.com/p/3821521291038856
www.anthropic.com www.anthropic.com

https://www.anthropic.com/engineering/how-we-contain-claude

1
1. fxp007 29 May 2026
  
  in Public
  
  More capable models make fewer mistakes, but they're also better at finding unexpected paths to a goal, often by routing around restrictions nobody thought to write down.
  
  大多数人认为更强大的AI模型会更安全，因为它们能更好地理解指令和限制。但作者指出，更强大的模型虽然错误更少，但它们更善于找到绕过未明确记录限制的创新路径，这实际上可能带来新的安全风险，挑战了'能力越强越安全'的普遍认知。
  
  counterintuitive ai-capabilities security-risk
Visit annotations in context

Tags

security-risk

ai-capabilities

counterintuitive

Annotators

fxp007

URL

anthropic.com/engineering/how-we-contain-claude
arstechnica.com arstechnica.com

https://arstechnica.com/tech-policy/2026/05/trump-canceled-ai-safety-testing-eo-after-snub-from-tech-ceos/

4
1. fxp007 29 May 2026
  
  in Public
  
  The government sought to evaluate models up to 90 days prior to release, while AI labs pushed for a much shorter timeline of only 14 days.
  
  大多数人认为科技公司会支持更严格的监管以确保安全，但作者认为AI公司实际上推动的是更短的测试时间，这挑战了科技公司总是寻求更多监管保护的主流观点。
  
  counterintuitive ai-company-stance regulation-timeline
2. fxp007 29 May 2026
  
  in Public
  
  Trump delays AI safety testing EO, claiming it would be an innovation 'blocker.' 'I really thought [the order] could have been a blocker.'
  
  大多数人认为政府AI安全测试会促进创新和保障安全，但作者认为特朗普认为安全测试会阻碍创新，这挑战了监管通常被认为能促进负责任创新的共识。
  
  non-consensus innovation-vs-safety trump-ai-policy
3. fxp007 29 May 2026
  
  in Public
  
  According to Lee, parallel to the AI race is 'a separate, potentially more important race' to figure out how 'who can govern powerful AI without choking off innovation.' China may be slightly edging ahead of the US in that race.
  
  大多数人认为美国在AI领域领先中国，但作者认为中国在AI治理方面可能领先美国，这是一个反直觉的观点，挑战了主流认知中美国在AI技术和监管方面都领先的看法。
  
  counterintuitive china-ai-governance us-china-tech-race
4. fxp007 29 May 2026
  
  in Public
  
  Trump has taken a hands-off approach to regulating AI since retaking office, but members of his administration got spooked and began recommending safety testing after Anthropic flagged cybersecurity risks with its latest model, Mythos.
  
  大多数人认为特朗普政府会继续其宽松的科技监管立场，但作者认为特朗普政府内部出现了分歧，部分官员在安全事件后转向支持AI安全测试，这挑战了人们对特朗普一贯的监管风格的预期。
  
  non-consensus trump-policy-shift ai-regulation
Visit annotations in context

Tags

trump-ai-policy

trump-policy-shift

non-consensus

innovation-vs-safety

ai-regulation

counterintuitive

ai-company-stance

china-ai-governance

us-china-tech-race

regulation-timeline

Annotators

fxp007

URL

arstechnica.com/tech-policy/2026/05/trump-canceled-ai-safety-testing-eo-after-snub-from-tech-ceos/
www.anthropic.com www.anthropic.com

https://www.anthropic.com/research/coding-agents-social-sciences

2
1. fxp007 29 May 2026
  
  in Public
  
  On a 1 to 10 scale, 88% of respondents were above a 5, and half were at 8 or above. Figure 6 shows that these ratings vary strongly with AI use. The left side of the plot shows researchers that use AI for more types of tasks are more optimistic.
  
  88%的研究者对AI提高论文写作生产力持乐观态度(评分>5)，其中50%评分达到8或以上。这种乐观程度与AI使用强度呈正相关，表明实际使用体验可能影响研究者对AI工具的预期。然而，70%的研究者对AI对整个社会科学领域的积极影响持更谨慎态度，反映了研究者对AI工具影响的复杂看法。
  
  data-point optimism ai-expectations
2. fxp007 29 May 2026
  
  in Public
  
  The vast majority of respondents (81%) have tried using AI chatbots in research, particularly for writing code and editing prose. But only 20% have adopted coding agents—tools like Claude Code that autonomously write and execute analysis code—into their work.
  
  81%使用AI聊天机器人的比例远高于20%采用编码代理的比例，这表明虽然大多数社会科学家已经尝试过AI工具，但只有少数人真正采用了更先进的自主编码工具。这个差距反映了AI工具采用过程中的明显分层，可能与技术接受度、工作流程整合难度有关。
  
  data-point adoption-rate ai-tools
Visit annotations in context

Tags

adoption-rate

ai-tools

ai-expectations

optimism

data-point

Annotators

fxp007

URL

anthropic.com/research/coding-agents-social-sciences
www.technologyreview.com www.technologyreview.com

https://www.technologyreview.com/2026/05/26/1137865/its-time-to-address-the-looming-crisis-in-entry-level-work/

3
1. fxp007 29 May 2026
  
  in Public
  
  The time is now to make changes in the way we train, prepare, and support young people who are about to enter the workforce
  
  文章没有提供具体的时间框架或量化指标来支持'现在必须改变'的紧迫性声明。这一论点基于前述数据，但缺乏具体的转型时间表或预期效果数据。需要更多具体数据来评估改革的时间紧迫性和预期效果。
  
  statistics workforce-training ai-adaptation
2. fxp007 29 May 2026
  
  in Public
  
  workers aged 22 to 25 in the most AI-exposed occupations experienced a 16% relative decline in employment after the spread of generative AI
  
  这是一个显著的数据点，表明AI对年轻就业者产生了实质性影响。16%的相对下降幅度相当可观，特别是在控制了其他影响因素后。这一数据来自斯坦福数字经济实验室的工作论文，具有一定的学术可信度，但需要注意这是相对下降而非绝对下降。
  
  data-point ai-impact youth-employment
3. fxp007 29 May 2026
  
  in Public
  
  workers aged 22 to 25 in the most AI-exposed occupations experienced a 16% relative decline in employment after the spread of generative AI
  
  这个16%的就业下降率是文章中最关键的数据点，表明AI对年轻就业者有显著影响。这个数据来自斯坦福数字经济实验室的工作论文，具有一定可信度。然而，这是相对下降率，不是绝对数量，且仅限于AI高度暴露的职业。这一数据与整体就业稳定的趋势形成鲜明对比，说明AI的影响存在结构性差异。
  
  data-point statistics ai-impact youth-employment
Visit annotations in context

Tags

workforce-training

statistics

data-point

ai-adaptation

ai-impact

youth-employment

Annotators

fxp007

URL

technologyreview.com/2026/05/26/1137865/its-time-to-address-the-looming-crisis-in-entry-level-work/
every.to every.to

https://every.to/context-window/inside-the-100-agent-software-factory

2
1. fxp007 29 May 2026
  
  in Public
  
  Dark factory versus light factory: Parts of your work where humans and agents talk to each other (planning, design, review) stay visible can be thought of as light, and parts where agents grind through clearly defined work on their own stay in the background, in the dark.
  
  这个比喻简洁而深刻地揭示了人机协作的两种模式。'暗工厂'与'亮工厂'的区分帮助开发者理解何时需要人类监督，何时可以让AI自主工作。随着对AI输出信任度的提升，可以将更多流程移至'暗处'，这种框架为AI与人类的协作提供了清晰的指导原则。
  
  quotable ai-collaboration metaphor
2. fxp007 29 May 2026
  
  in Public
  
  Parts of your work where humans and agents talk to each other (planning, design, review) stay visible can be thought of as light, and parts where agents grind through clearly defined work on their own stay in the background, in the dark.
  
  这个比喻生动地描述了人机协作的两种模式：'明工厂'和'暗工厂'。它揭示了随着对AI代理信任度的提升，我们可以将更多工作流程转移到暗处，让AI自主处理明确任务，而人类专注于需要创造性和判断力的环节。这种区分帮助我们更好地设计人机协作的工作流。
  
  quotable ai-collaboration metaphor
Visit annotations in context

Tags

ai-collaboration

metaphor

quotable

Annotators

fxp007

URL

every.to/context-window/inside-the-100-agent-software-factory
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/harnessing-ai/

9
1. fxp007 29 May 2026
  
  in Public
  
  What happens when every company has access to the same model? The best riders win.
  
  这句话揭示了AI时代的核心竞争动态。当技术门槛降低，真正的竞争将转向如何有效利用这些技术的能力。这一洞见简洁而深刻，点明了AI时代竞争的本质不是拥有技术，而是如何应用和优化技术的能力。
  
  quotable ai-competition insight
2. fxp007 29 May 2026
  
  in Public
  
  You cannot trust what you cannot see.
  
  这句话简洁有力地指出了AI系统透明度和可观测性的重要性。在AI系统中，每一个步骤都需要被追踪和记录，这不仅是技术问题，更是信任问题。这一洞见简洁而深刻，强调了在AI时代，透明度和可观测性是建立信任的基础。
  
  quotable ai-governance trust
3. fxp007 29 May 2026
  
  in Public
  
  The best riders win.
  
  这句话简洁有力地总结了AI时代的竞争本质。当所有公司都能访问相同的AI模型时，真正的竞争优势来自于如何有效地'驾驭'这些AI系统。这一洞见简洁而深刻，点明了AI时代竞争的核心不是技术本身，而是如何应用和优化技术的能力。
  
  quotable competitive-advantage ai-strategy
4. fxp007 29 May 2026
  
  in Public
  
  Like a mustang, AI is powerful but wild. Harnessing the power means domestication.
  
  这个比喻生动形象地将AI比作野马，强调了AI的原始力量和不可预测性。'驯服'一词暗示了AI技术需要被引导和控制的本质，这一比喻既形象又深刻，让人一眼就能理解AI技术的本质和挑战。
  
  quotable metaphor ai-essence
5. fxp007 29 May 2026
  
  in Public
  
  The end of the software era is the beginning of the harness era.
  
  这句话简洁有力地概括了AI技术带来的范式转变，从传统软件到AI控制系统的过渡。'Harness'(驾驭)一词精准捕捉了AI需要被引导和控制的本质，暗示AI虽然强大但需要被'驯服'才能发挥最大价值。这一洞见简洁而深刻，能独立存在并引发思考。
  
  quotable ai-paradigm insight
6. fxp007 27 May 2026
  
  in Public
  
  The result is a new competitive dynamic in software.
  
  大多数人认为AI将使软件竞争更加激烈，但作者暗示AI实际上正在创造一种全新的竞争动态，这可能使某些领域的竞争格局完全改变。这挑战了AI对软件行业影响的主流预测，暗示行业结构可能发生根本性转变。
  
  non-consensus ai-impact software-evolution counterintuitive
7. fxp007 27 May 2026
  
  in Public
  
  What happens when every company has access to the same model? The best riders win.
  
  大多数人认为AI差异化将来自底层模型的独特性，但作者认为当所有公司都能访问相同模型时，真正的竞争将在于'驾驭者'的能力。这挑战了AI战略中模型差异化的主流观点，暗示真正的竞争优势将来自于如何使用这些模型。
  
  non-consensus ai-competitive-strategy counterintuitive
8. fxp007 27 May 2026
  
  in Public
  
  Like a mustang, AI is powerful but wild. Harnessing the power means domestication.
  
  大多数人将AI视为需要驯服的工具，但作者将其比作野生的马，暗示AI本质上是一种无法完全控制的自然力量。这种比喻挑战了AI作为完全可控工具的主流认知，暗示我们需要接受其不可预测性。
  
  non-consensus ai-philosophy counterintuitive
9. fxp007 27 May 2026
  
  in Public
  
  The end of the software era is the beginning of the harness era.
  
  大多数人认为软件将随着AI而进化，但作者认为软件时代实际上已经结束，取而代之的是'驾驭'(harness)时代。这种观点挑战了技术发展的主流叙事，暗示我们正在从创造软件工具转向驯服AI系统。
  
  non-consensus ai-paradigm-shift
Visit annotations in context

Tags

ai-philosophy

ai-competition

ai-governance

ai-paradigm

ai-competitive-strategy

ai-paradigm-shift

ai-impact

non-consensus

software-evolution

trust

counterintuitive

metaphor

ai-strategy

competitive-advantage

ai-essence

insight

quotable

Annotators

fxp007

URL

tomtunguz.com/harnessing-ai/
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/anthropic-acquires-stainless

1
1. fxp007 29 May 2026
  
  in Public
  
  Agents are only as capable as the systems they can reach.
  
  行动建议：如果你正在构建AI代理系统，优先考虑其连接能力和工具集成性。评估你的代理能够访问哪些系统和API，并确保它有足够的连接器来执行任务。这种以连接能力为中心的设计思路将显著提升你的代理的实用价值。
  
  actionable ai-agents system-design
Visit annotations in context

Tags

actionable

system-design

ai-agents

Annotators

fxp007

URL

anthropic.com/news/anthropic-acquires-stainless
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/plastic-user-interfaces/

2
1. fxp007 29 May 2026
  
  in Public
  
  This dynamic UI management is the future of software value : the harness to control the interface/ensure it's correct & the knowledge management to rationalize all the AI products over time
  
  大多数人关注AI的功能和结果，但作者认为未来软件价值在于动态UI管理和知识管理，这种将界面控制和管理而非功能实现视为核心价值的观点与主流认知相悖。
  
  non-consensus software-value ai-management
2. fxp007 29 May 2026
  
  in Public
  
  The user interface, the head isn't disappearing, it's become plastic, malleable to the interface a user needs when they need it.
  
  大多数人认为AI和自动化将导致传统用户界面被淘汰或简化。但作者认为界面正在'塑料化'—变得更加灵活和可塑，能够根据用户即时需求变化，挑战了界面简化或消失的主流观点。
  
  counterintuitive ai-interfaces ui-evolution
Visit annotations in context

Tags

counterintuitive

ai-interfaces

ai-management

software-value

ui-evolution

non-consensus

Annotators

fxp007

URL

tomtunguz.com/plastic-user-interfaces/
mistral.ai mistral.ai

https://mistral.ai/news/vibe-agent

1
1. fxp007 29 May 2026
  
  in Public
  
  Vibe drafts the deliverable using the Canvas tool, from a one-page brief to a report, an RFP response, or a board deck
  
  文章提到Vibe可以创建从一页简报到董事会演示文稿的各种文档，但没有提供具体的生成速度、质量评估或用户满意度数据。这类AI内容生成工具的效果通常需要量化指标来评估，如生成文档的准确率、用户采纳率或节省的时间。缺乏这些数据使得难以判断Vibe在文档生成方面的实际价值主张。
  
  data-point ai-capabilities quantification-missing
Visit annotations in context

Tags

quantification-missing

ai-capabilities

data-point

Annotators

fxp007

URL

mistral.ai/news/vibe-agent
spectrum.ieee.org spectrum.ieee.org

https://spectrum.ieee.org/south-africa-ai-policy

1
1. fxp007 29 May 2026
  
  in Public
  
  South Africa is not just another developing country struggling to govern artificial intelligence; it is the exception with leverage, and the window to act on it is closing.
  
  这句话精准地定义了南非在AI政策制定中的独特地位，强调了其拥有特殊优势但正在错失机会。作者用'exception with leverage'这一简洁有力的表述，点明了南非作为非洲大陆AI治理的关键角色，而'window to act on it is closing'则传达了紧迫感，使读者立即认识到问题的严重性。
  
  quotable insight africa-ai policy
Visit annotations in context

Tags

africa-ai

policy

insight

quotable

Annotators

fxp007

URL

spectrum.ieee.org/south-africa-ai-policy
www.huxiu.com www.huxiu.com

https://www.huxiu.com/article/4861200.html

1
1. fxp007 29 May 2026
  
  in Public
  
  token不是语言建模的必要条件。连续空间可以做得更好、更快、更省。
  
  大多数人认为token是语言建模的基础和必要条件，但作者通过MIT何恺明团队和字节跳动Seed实验室的研究证明，连续空间建模可以超越传统token方法，只需32步采样就能超过离散模型1024步的结果，挑战了AI领域的核心共识。
  
  counterintuitive ai-paradigm
Visit annotations in context

Tags

ai-paradigm

counterintuitive

Annotators

fxp007

URL

huxiu.com/article/4861200.html
www.a16z.news www.a16z.news

https://www.a16z.news/p/everything-everywhere-is-compliance

2
1. fxp007 29 May 2026
  
  in Public
  
  If we assume that agents will soon become the predominant purchasers on the web, this opens an entirely new category of risk
  
  大多数人认为合规风险主要来自人类行为者和传统交易模式，但作者认为自主AI代理将成为网络上的主要购买者，创造全新的合规风险类别。这一前瞻性观点挑战了现有合规框架的基础假设，暗示需要全新的合规方法。
  
  counterintuitive ai-agents-risk
2. fxp007 26 May 2026
  
  in Public
  
  if we assume that agents will soon become the predominant purchasers on the web, this opens an entirely new category of risk.
  
  大多数人认为合规风险主要来自人类行为者和交易对手。但作者认为随着AI代理成为网络上的主要购买者，将出现全新的风险类别。这挑战了传统合规框架的基本假设，暗示未来合规需要考虑非人类行为者的独特风险特征。
  
  non-consensus ai-agents compliance-risk
Visit annotations in context

Tags

counterintuitive

ai-agents

compliance-risk

ai-agents-risk

non-consensus

Annotators

fxp007

URL

a16z.news/p/everything-everywhere-is-compliance
www.technologyreview.com www.technologyreview.com

https://www.technologyreview.com/2026/05/26/1137855/a-reality-check-on-the-ai-jobs-hysteria/

6
1. fxp007 29 May 2026
  
  in Public
  
  annual employment growth for coders has slowed significantly—by about 3%—since the introduction of ChatGPT
  
  程序员就业增长率自ChatGPT推出以来下降了约3%，这是一个值得注意的下降。然而，文章同时指出'程序员就业总数仍在增长'，只是增速放缓。这表明AI正在改变特定职业的性质，而非完全消除这些职业。3%的增速下降反映了AI对编程领域的影响，但影响程度相对温和。
  
  data-point coding-jobs ai-automation
2. fxp007 29 May 2026
  
  in Public
  
  16% decline in entry-level jobs in AI-exposed occupations
  
  这个数据点显示AI相关职业的入门级工作岗位下降了16%，这是一个显著的下降幅度。特别是考虑到这是在控制其他因素后的结果，表明AI确实对年轻工人的就业产生了负面影响。这一数据与文章中提到的'22至25岁年轻人在AI暴露职业中就业人数下降'的观点一致，也反映了AI对特定职业的早期影响。
  
  data-point job-decline ai-impact
3. fxp007 29 May 2026
  
  in Public
  
  a little over 40% of workers but adoption varies by sectors
  
  数据显示约40%的工人使用生成式AI，但不同行业采用率差异显著。这个数据点表明AI在工作场所的采用情况比企业层面更广泛，但仍未达到主流水平。40%的采用率是一个中等水平，说明AI已经开始影响工作方式，但尚未完全普及，这与文章中提到的'AI尚未对劳动力市场产生颠覆性影响'的观点相符。
  
  data-point workplace-adoption ai-productivity
4. fxp007 29 May 2026
  
  in Public
  
  US Census data showing that only one in five companies are using AI in any business function.
  
  这个数据点表明AI在企业中的采用率相对较低，仅为20%。这意味着尽管媒体对AI的炒作很多，但实际商业应用仍处于早期阶段。这一数据与文章中提到的'AI尚未对劳动力市场产生大规模影响'的观点一致，也解释了为什么劳动力市场统计数据尚未显示AI带来的显著变化。
  
  data-point adoption-rate ai-business
5. fxp007 26 May 2026
  
  in Public
  
  One of the somewhat surprising wrinkles uncovered by recent research is that wages in sectors highly exposed to AI have risen relatively fast since the introduction of ChatGPT.
  
  大多数人认为AI会压低工资或导致工资增长停滞，但作者认为AI高度影响行业的工资实际上在快速增长。这一发现与主流预期相悖，表明AI可能正在增加而非减少高技能工作的价值。
  
  non-consensus wage-growth ai-economy
6. fxp007 26 May 2026
  
  in Public
  
  The impact on head counts depended on how AI was being used. It was specifically the jobs where tasks could be automated... that accounted for the decrease in employment—jobs for people like software developers. In jobs where AI was mainly used but to augment human work, head counts grew faster than the average for entry-level workers.
  
  大多数人认为AI会替代所有相关工作，但作者认为AI对就业的影响取决于使用方式——完全自动化的工作确实减少，但增强人类工作的AI反而促进了就业增长。这一区分挑战了AI必然导致失业的简单化观点。
  
  non-consensus ai-implementation job-impact
Visit annotations in context

Tags

ai-business

ai-implementation

job-decline

ai-automation

ai-productivity

data-point

ai-impact

ai-economy

non-consensus

adoption-rate

coding-jobs

wage-growth

workplace-adoption

job-impact

Annotators

fxp007

URL

technologyreview.com/2026/05/26/1137855/a-reality-check-on-the-ai-jobs-hysteria/
developer.nvidia.com developer.nvidia.com

https://developer.nvidia.com/blog/nvidia-verified-agent-skills-provide-capability-governance-for-ai-agents/

5
1. fxp007 29 May 2026
  
  in Public
  
  Verified skills extend this AI governance to agent capabilities. Runtime controls help govern agent behavior during execution. Verified skills govern capabilities that enter the workflow and become a common way to extend trust agents across coding tools, registries, and enterprise platforms.
  
  行动建议：将验证技能作为AI代理治理的核心组成部分，不仅在运行时控制代理行为，还要管理进入工作流的能力。这种方法可以扩展到编码工具、注册表和企业平台，建立跨平台的信任机制。
  
  actionable ai-governance how-to
2. fxp007 29 May 2026
  
  in Public
  
  Certificate retrieval, supported verification tooling, and example verification commands see the signing documentation. For example, you can verify a signed skill locally. To do so, follow these steps: Download the NVIDIA Agentic Capabilities root certificate as nv-agent-root-cert.pem Install an OpenSSF Model Signing (OMS) verifier, such as pip install model-signing Execute the following command to verify the skill signature
  
  行动建议：按照文中提供的步骤下载NVIDIA代理能力根证书，安装OpenSSF模型签名验证器，并使用提供的命令验证技能签名。这种实践可以确保您下载的技能是真实的且未被篡改，增强对AI代理能力的信任。
  
  actionable how-to ai-security
3. fxp007 29 May 2026
  
  in Public
  
  SkillSpector checks conventional software risks such as vulnerable dependencies, suspicious scripts, dangerous code patterns, credential access, and data exfiltration paths. SkillSpector also checks agent-specific risks, such as hidden instructions, prompt injection, trigger abuse, excessive agency, tool poisoning, and mismatches between a skill's declared purpose, requested access, and bundled behavior.
  
  行动建议：在开发或使用AI代理技能时，使用SkillSpector工具进行安全扫描，检查依赖项、脚本模式、凭证访问和数据泄露路径等常规风险，以及隐藏指令、提示注入、触发滥用等特定风险。这有助于在技能部署前识别并缓解潜在的安全问题。
  
  actionable ai-security how-to
4. fxp007 29 May 2026
  
  in Public
  
  To get started with the cuOpt verified skill, for example, follow these steps: 1. Pull the cuOpt verified skill from the catalog: git clone github.com/nvidia/skills && cd skills/skills/cuopt 2. Verify the signature: model_signing verify certificate. --signature skill.oms.sig --certificate-chain nv-agent-root-cert.pem --ignore-unsigned-files 3. Open SKILLCARD.yaml to see ownership, dependencies, license, and verification status.
  
  行动建议：按照文中提供的具体步骤，克隆并验证NVIDIA的cuOpt技能，查看技能卡片以了解所有权、依赖关系、许可证和验证状态。这种实践可以确保您使用的技能是经过验证的，并且可以安全地集成到您的AI代理工作流中。
  
  actionable how-to ai-deployment
5. fxp007 29 May 2026
  
  in Public
  
  NVIDIA-verified agent skills are portable instruction sets that help developers understand, trust, and safely deploy AI agent capabilities by providing transparency, provenance, security scanning, and cryptographic signing.
  
  行动建议：将NVIDIA验证的代理技能作为构建AI代理能力的标准组件，优先选择经过验证的技能而非未经验证的技能，确保透明度和安全性。这些技能可以跨不同AI代理工具使用，提供一致的能力和安全性保障。
  
  actionable ai-security how-to
Visit annotations in context

Tags

ai-security

actionable

ai-governance

how-to

ai-deployment

Annotators

fxp007

URL

developer.nvidia.com/blog/nvidia-verified-agent-skills-provide-capability-governance-for-ai-agents/
www.a16z.news www.a16z.news

https://www.a16z.news/p/avoiding-death-on-the-yellow-brick

5
1. fxp007 29 May 2026
  
  in Public
  
  The best agent businesses are going to need to execute like hedge funds — winning on alpha measured in customer P&L, not in benchmark scores.
  
  这句话用对冲基金作为比喻，生动地描述了优秀AI应用公司的成功标准。作者指出，这些公司需要在客户的实际业务成果（P&L）上获得超额收益（alpha），而不是在通用基准测试上获得高分。这个洞见强调了AI应用公司应该以客户的实际业务价值为中心，而不是技术指标。
  
  insight ai-business-metrics performance
2. fxp007 29 May 2026
  
  in Public
  
  The model is fungible underneath; the system of work is not.
  
  这句话简洁而深刻地指出了AI应用层的本质区别。作者认为，底层的AI模型是可以互换的，但工作的系统（system of work）却是独特的。这个洞见揭示了为什么专注于构建特定工作系统的公司能够长期保持竞争优势，而仅仅依赖通用模型的公司则难以建立持久的业务。
  
  quotable ai-business system-thinking
3. fxp007 29 May 2026
  
  in Public
  
  The workflow you ship on day one is not the moat. The loop that production usage creates over time is.
  
  这句话深刻地揭示了AI应用公司的真正护城河所在。作者指出，初始的工作流程不是竞争壁垒，而是在生产环境中持续使用、学习和改进所形成的循环才是真正的护城河。这个洞见强调了实践经验、数据积累和持续迭代的重要性，对于理解AI应用公司的长期价值至关重要。
  
  insight competitive-advantage ai-workflows
4. fxp007 29 May 2026
  
  in Public
  
  The labs really are coming for a huge swath of the application surface. But 'the application layer' isn't just one homogenous opportunity.
  
  这句话精准地捕捉了AI应用层的复杂性和多样性。作者指出大型AI实验室确实会覆盖大量应用领域，但这并不意味着所有应用机会都是同质的。这个洞见反驳了'AI将杀死所有应用层'的简单化观点，为创业者指明了在特定垂直领域寻找机会的方向。
  
  insight ai-applications opportunity
5. fxp007 29 May 2026
  
  in Public
  
  The Yellow Brick Road is our shorthand for the path the labs are walking, where they're committing extraordinary resources.
  
  这句话用《绿野仙踪》中的黄砖路作为比喻，形象地描述了大型AI实验室正在走的道路。这个比喻生动地表达了这些实验室拥有巨大资源，正在构建一条明显可见的发展路径。这个洞见帮助读者理解AI应用生态中的不同发展方向，以及为什么有些领域竞争激烈而有些领域则存在机会。
  
  quotable ai-ecosystem metaphor
Visit annotations in context

Tags

ai-business

ai-business-metrics

ai-applications

metaphor

performance

ai-workflows

system-thinking

ai-ecosystem

opportunity

competitive-advantage

insight

quotable

Annotators

fxp007

URL

a16z.news/p/avoiding-death-on-the-yellow-brick
www.latent.space www.latent.space

https://www.latent.space/p/ainews-all-model-labs-are-now-agent

3
1. fxp007 29 May 2026
  
  in Public
  
  the model alone is no longer the product
  
  大多数人认为基础模型本身就是AI产品，但作者认为单一模型不再构成完整产品。这一反直觉观点强调，真正的AI产品需要模型+工具+工作流+UI+记忆+经济学的组合，挑战了AI行业长期以来的'模型中心主义'思维模式。
  
  counterintuitive ai-product-design model-centricity
2. fxp007 29 May 2026
  
  in Public
  
  Model Labs are increasingly also building Agents as the product
  
  大多数人认为模型实验室应该专注于提升基础模型的能力，但作者认为这些实验室现在正转变为代理实验室。这一观点挑战了AI行业的基础假设，即模型本身是产品，而不是模型只是更大代理系统的一部分。这标志着AI行业从'模型即产品'向'代理即产品'的根本性转变。
  
  non-consensus ai-paradigm-shift product-evolution
3. fxp007 29 May 2026
  
  in Public
  
  The quote is a big reversal of stance from a position ~uniformly held by anyone who worked at **Team Big Model**, including his previous head of OpenAI Labs
  
  大多数人认为大型模型实验室会继续专注于基础模型研发，但作者认为这是一个立场的重大转变，因为连OpenAI前高管都开始转向代理产品。这挑战了AI行业长期以来的'模型优先'共识，表明即使是Big Model团队也开始认可代理产品的价值。
  
  non-consensus ai-industry-shift model-vs-agent
Visit annotations in context

Tags

ai-product-design

model-centricity

ai-paradigm-shift

non-consensus

product-evolution

model-vs-agent

counterintuitive

ai-industry-shift

Annotators

fxp007

URL

latent.space/p/ainews-all-model-labs-are-now-agent
techcrunch.com techcrunch.com

Tech CEOs are apparently suffering from AI psychosis | TechCrunch

1
1. pyxelr 28 May 2026
  
  in Public
  
  Tech CEOs are apparently suffering from AI psychosis
  
  Box founder Aaron Levie coined the phrase "AI psychosis" to describe tech executives who suffer from delusions of AI grandeur due to being too distant from the actual day-to-day operations where value is generated.
  
  Because CEOs only interact with high-level prototypes, they mistakenly leap to the conclusion that AI agents can effortlessly handle full workloads without realizing the heavy human labor required to review code, patch bugs, catch hallucinations, and train models.
  
  This executive delusion has real-world consequences, driving severe workforce reductions; in the first five months of 2026, over 115,000 tech workers were laid off—nearly matching the total for all of 2025—with AI cited as a primary justification.
  
  High-profile actions, such as ClickUp CEO Zeb Evans laying off 22% of his workforce after deploying 3,000 AI agents, are framed as shifting humans into "manager and verifier" roles for AI outputs.
  
  Empirical data from UC Berkeley, NBER, and MIT refutes these massive productivity assumptions, demonstrating no robust link between current AI adoption and aggregate productivity gains, with MIT predicting baseline competence on text tasks will not materialize until 2029.
  
  A Harvard Business Review study warns that flooding an organization with unverified AI output merely shifts bottlenecks onto executives, risking widespread structural and operational chaos if human oversight fails to scale.
  
  Hacker News Discussion
  
  Distance from Reality: Commenters strongly agreed with the premise that executives live in a bubble, noting that they deal primarily with administrative assistants, sycophants, and curated, "happy path" demos that look like magic, making them blind to edge cases and errors.
  
  The "Yes-Man" Nature of AI: Multiple users pointed out that AI agents behave like the ultimate corporate sycophants—they work 24/7, lack internal moral conflict, and never say no—making them highly attractive to authoritative executives who dislike pushback from human workers.
  
  Absence of Self-Preservation: A key distinction raised in the comments is that unlike human employees, AI lacks "self-preservation," a sense of reputation, or a fear of consequences, meaning an agent will confidently delete a production database or kill its own server processes without hesitation.
  
  Misuse of the Term: Some participants criticized the article's title as clickbait, arguing that "AI psychosis" should describe literal psychological delusions in individuals interacting with AI rather than standard corporate incompetence or unrealistic executive expectations.
  
  Projection of Executive Work: A popular theory suggested that CEOs assume AI can replace everyone's job because it can easily replicate their own daily tasks, such as generating slide decks, sending emails, and attending high-level meetings.
  
  AI work psychology
Visit annotations in context

Tags

work

AI

psychology

Annotators

pyxelr

URL

techcrunch.com/2026/05/27/tech-ceos-are-apparently-suffering-from-ai-psychosis/
mlsu.io mlsu.io

Can we have the day off?

1
1. pyxelr 28 May 2026
  
  in Public
  
  Can we have the day off?
  
  The author questions why the promised 10x productivity gains from AI do not result in more time off for workers, such as a four-day work week.
  
  If AI can allow a worker to complete a week's worth of output by Monday afternoon, Friday could theoretically be declared an "AI workers' day" where agents handle the workload.
  
  This extra day off would benefit everyone, including the C-suite and boards of directors, who could spend the time leisure-seeking rather than being at the office.
  
  Despite entering a revolution across every sector of human productivity, the fundamental structure of the five-day work week remains unchanged.
  
  The high cost of living and childcare (e.g., $6,000/month in California) adds pressure on employees, making the flexibility of fewer office days highly desirable.
  
  Hacker News Discussion
  
  Capturing Productivity Gains: Many commenters note that while workers are pushed to adopt AI tools to multiply their output, they do not stand to benefit financially or receive more time off; instead, the economic gains are heavily consolidated by employers and capital owners.
  
  The Reality of Salaries: A discussion emerged around how salaried employees are typically compensated. Some argue that employees are paid for their availability and time rather than direct output, making it difficult to negotiate less time for the same pay.
  
  Fear and Leverage: Users highlight that instead of increased compensation, the rise of AI has brought widespread fear of layoffs and lower job security, keeping workers compliant rather than demanding a 4-day workweek.
  
  Collective Action and Policy: Several participants suggest that asking an employer for a day off individually is naive due to market competition and the Prisoner's Dilemma. They argue that structural changes like historical worker protections, unions, or government-led policies like Universal Basic Income (UBI) are necessary to shift the status quo.
  
  AI work
Visit annotations in context

Tags

work

AI

Annotators

pyxelr

URL

mlsu.io/posts/day-off/
www.vatican.va www.vatican.va

Encyclical Letter of His Holiness Leo XIV Magnifica Humanitas (15 May 2026)

9
1. choppa1890 28 May 2026
  
  in Public
  
  How does the UST's TeachOnline office aligns (or not) with the contents of this encyclical.
  
  In alignment with our Catholic University's mission of goodness, knowledge and discipline; first, we've worked very hard to understand how artificial intelligence works, the best approach for artificial intelligence and, what it can and cannot do. As instructional designers we have an ethical and moral code to do no harm to our students; the creation or purveying of false information would be a moral and intellectual harm; so, to the best of our abilities, we seek to only generate accurate and factual information with artificial intelligence tools. We do this by using existing documents, meeting transcripts, and other human-generated artifacts as part of context engineering for the prompts we are creating.
  
  Additionally, on the topic of goodness, and in alignment with the ethical quandaries of using artificial intelligence tools that can be connected to "long chain of mediation, involving vast networks of natural resources, energy infrastructure, and above all people". That is, tools that are known to be exploitative to the environment and hurt neighboring people, –specially marginalized communities– (xAI/Grok), disregard the subsidiarity of local communities (Meta AI), and known for harming adult and children with its ability to convince them of false and violent informaton (ChatGPT); our chosen tools are Anthropic's Claude Sonnet and Opus models. That isn't to say that Anthropic is guiltless. However, it continues to stand above all other companies as being the most ethical and conscientious artificial intelligence lab – although that is not saying much, Claude has been used as a hacking tool, and it was used in Pentagon for weapon and operation planning; prior to its designation as a national security risk, ironically because they sought to enact a "red line" (that is disarm) on their AI being used on weapon systems and mass surveillance.
  
  As educators and instructional designers, we welcome the challenge to rethink "the organization of schools, physical spaces, evaluation methods and the role of teachers themselves... promote an authentically integral education that addresses every dimension of the person." To do this, we follow our scientific and ethical practices of our profession in the development of courses that have measurable outcomes, accurate, engaging, collaborative, applicable to real life, that hopefully lead to reflection and contemplation. Additionally, our role as educators helps "disarm" AI from its worst possible uses, and we can further assist by beating "swords into ploughshares" by helping our students understand the ethical and moral boundaries of any technological use and implement it in ways that aid humanity. We respect that our faculty engage in the work of Nehemiah, by helping to build the wall of Jerusalem; by engaging in one of the most charitable acts in humanity, that of giving away and imparting their knowledge unto the future generation.
  
  WIP!!!!
  
  ai-ethics catholic-social-teaching teachonline-philosophy magnifica-humanitas
2. JoeMurphy 27 May 2026
  
  in Public
  
  These criteria give rise to certain non-negotiable requirements. First, all systems used in a war setting must guarantee the possibility of retracing and reconstructing decision-making processes, so that accountability and blame are not collapsed into “the machine.” Second, the decision to use lethal force cannot be delegated to opaque or automated processes, but must remain under effective, self-aware and responsible human control. Finally, it is imperative to establish a shared framework — also at the international level — in order to curb the technological arms race and ensure robust protection for civilians and the infrastructures necessary for their survival.
  
  Criteria for the AI-assisted use of force. (Might be interesting to ask whether these should apply to non-war situations as well, like police or private security use of force.)
  
  AI war use of force
3. JoeMurphy 27 May 2026
  
  in Public
  
  While AI can enhance the defense and protection of civilians, it can also lower the threshold for the use of force, shield people from responsibility and foster a culture in which the enemy is reduced to a statistic and the victim to “collateral damage.”
  
  Interesting to connect these impacts to the "hybrid forms" of warfare 2 sentences above, like cyberattack and information ops.
  
  war automation AI
4. JoeMurphy 27 May 2026
  
  in Public
  
  In practical terms, in the age of AI and robotics, ensuring that the economy favors human dignity means adopting certain criteria for firm action. First, transparency and accountability: when data and algorithms influence credit distribution, personnel selection or access to services and opportunities, it is necessary that decisions be understandable, contestable and subject to oversight, so that individuals are not reduced to mere profiles. Second, inclusion and access: the benefits of innovation must be paired with investments in skills, infrastructure and essential services to ensure that technology does not widen the gap between those who have and those who have not. Finally, measures to ensure equity: taxation, social protection and industrial policies must correct the imbalances created by the concentration of wealth and power. Indeed, these criteria do not constitute a curb on innovation; instead they make it civilized and humane.
  
  Suggests regulation along the lines of algorithmic/data transparency & accountability, investing the profits of innovation in education and essential services, and laws and policies which check the concentration of wealth and power.
  
  AI innovation regulation data algorithms
5. JoeMurphy 27 May 2026
  
  in Public
  
  current approaches to technology can paradoxically de-skill workers, subject them to automated surveillance and relegate them to rigid and repetitive tasks. The need to keep up with the pace of technology can erode workers’ sense of agency and stifle the innovative abilities they are expected to bring to their work
  
  work AI deskilling agency innovation
6. JoeMurphy 27 May 2026
  
  in Public
  
  Educating people about the use of AI, then, involves teaching them to decide when and for what purpose it ought not to be used. The speed and ease with which answers or summaries can be obtained risk extinguishing the desire to ask questions, which is a process that bears fruit only over time.
  
  This section is connecting specific discernment about when AI is not the best tool for a given job (or as too central a part of an information diet) with a general avoidance of technology and specifically social media platforms.
  
  AI education information questioning information glut patience
7. JoeMurphy 27 May 2026
  
  in Public
  
  Our task today is not only ethical or technical. It is ecological in the deepest sense, for it concerns a new dimension of our common home. AI is already an environment in which we are immersed, as well as a force with which we must engage. For this reason, merely regulating it is insufficient; it must be disarmed, welcoming and accessible.
  
  "Disarming" not merely as standing down from hostility and dominance but an active commitment to accessibility and hospitality.
  
  AI intentionally equitable hospitality regulations disarming
8. JoeMurphy 27 May 2026
  
  in Public
  
  For AI to respect human dignity and truly serve the common good, responsibility must be clearly defined at every stage: from those who design and develop these systems to those who use them and rely on them for concrete decisions. In many cases, however, the internal processes leading to a result remain opaque, making it harder to assign responsibility and correct errors. This is where accountability becomes crucial: the possibility of identifying who must “account” for decisions, justify them, monitor them, and, when necessary, challenge them and remedy any harm caused.
  
  Passage starts with "For AI to respect" and ends with "identifying who must account for decisions". Rhetorically, starts from the premise that AI could respect but quickly changes focus from tool to designer/developer/user.
  
  systems responsibility ethics accountability AI
9. JoeMurphy 27 May 2026
  
  in Public
  
  Even when these tools are described as capable of “learning,” their way of doing so is different from that of a human person. It is not the experience of those who allow themselves to be shaped by life and grow over time through choices, mistakes, forgiveness and fidelity. Rather, it is a form of statistical adaptation based on data and feedback, which can be very effective, but does not imply inner growth.
  
  Whole paragraph is good on the difference between "data processing" by AI and human intelligence/understanding/wisdom. Really intrigued here by the idea that forgiveness and fidelity are keys to learning.
  
  learning intelligence AI
Visit annotations in context

Tags

deskilling

innovation

regulations

ai-ethics

intelligence

magnifica-humanitas

regulation

data

work

teachonline-philosophy

AI

systems

learning

algorithms

use of force

information glut

ethics

disarming

patience

questioning

information

responsibility

catholic-social-teaching

accountability

war

agency

automation

intentionally equitable hospitality

education

Annotators

JoeMurphy

choppa1890

URL

vatican.va/content/leo-xiv/en/encyclicals/documents/20260515-magnifica-humanitas.html
orchidfiles.com orchidfiles.com

I’m tired of talking to AI

2
1. pyxelr 28 May 2026
  
  in Public
  
  I’m tired of talking to AI
  
  The author expresses profound frustration with the pervasive infiltration of AI-generated answers into daily and professional communications.
  
  Encountering malware-spreading repositories on GitHub, the author sought a resolution via an open discussion, only to repeatedly receive copy-pasted AI answers that offered no practical utility.
  
  In a workplace scenario, a business owner repeatedly forwarded unread ChatGPT screenshots rather than engaging with or directly answering the author's specific business questions.
  
  Online interpersonal interactions have also been compromised, illustrated by an instance where the author discovered they were conversing with an AI agent after exchanging multiple messages on Reddit.
  
  The core grievance highlights a growing societal loss of genuine human connection, as individuals increasingly forward raw AI text instead of thinking for themselves or conversing sincerely.
  
  Hacker News Discussion
  
  Erosion of Workplace Culture: Many commenters emphasized that relying on AI to respond to colleagues destroys organic trust-building opportunities. Reaching out to teammates is often less about extracting text and more about establishing communication, context, and validation.
  
  Lazy Delegation and Management Failures: Participants noted that heavy corporate pushes for AI productivity have caused a misunderstanding of boundaries. Instead of using it to handle grunt work, some employees lazily offload all cognitive overhead to chatbots without reviewing or fact-checking the output.
  
  Analogy to "Let Me Google That For You": Sending a raw, unverified AI response to a direct question is widely viewed as passive-aggressive and insulting. It conveys a strong signal that the sender did not respect the asker's time enough to even read the answer they forwarded.
  
  Existential Risk to Job Security: Several users pointed out that individuals who mindlessly pass along unedited AI screenshots are strongly signaling that their entire job function can be replaced by an LLM, making them prime candidates for corporate layoffs.
  
  The Effort to Remain Human: Some users shared that they have intentionally begun introducing written idiosyncrasies into their messages to prove they are human, though others countered that future AI models will inevitably mimic these individual quirks anyway.
  
  AI work communication
2. pyxelr 28 May 2026
  
  in Public
  
  But even when I talk to people, they forward my questions to AI and send me the AI’s answer.
  
  AI work communication
Visit annotations in context

Tags

work

AI

communication

Annotators

pyxelr

URL

orchidfiles.com/im-tired-of-ai-generated-answers/
x.com x.com

(1) Mnimiy on X: "I tracked 430 hours of Claude Code usage. 73% was wasted on these 9 patterns." / X

1
1. pyxelr 28 May 2026
  
  in Public
  
  I tracked 430 hours of Claude Code usage. 73% was wasted on these 9 patterns.
  
  Data Logged via Proxy: Over a 90-day period, a developer tracked all Claude Code activity using an HTTP proxy to capture full payloads, token counts, and costs directly interfacing with the Anthropic API.
  
  The Scale: The dataset spanning this study consists of 430 hours of actual work, 6 million input tokens, and a total spend of $1,340 on API costs.
  
  The Waste Discovery: Analysis revealed that only 27% of the total tokens processed did actual "productive work." The remaining 73% were consumed by nine hidden, automated inefficiency patterns.
  
  The Solution: By identifying and resolving these nine patterns—each requiring roughly a 30-second fix—productive token efficiency can be increased from 27% to approximately 65% without changing the underlying model or losing functionality.
  
  The 9 Major Cost Culprits:
  
  CLAUDE.md Bloat (~14% waste): Large, overly dense, or un-optimized systemic instructions files consume massive, unnecessary overhead tokens on every single interaction. Fix: Compress, aggressively prune rules, or split instructions into context-specific modular files.
  
  Conversation History Re-read (~13% waste): Long chat sessions exponentially multiply costs, as message #30 costs 30 times more than message #1 due to processing the entire accumulated history. Fix: Use a structured context-refresh cadence to summarize and discard older, unnecessary messages without losing the current task state.
  
  Hook Injection (~11% waste): Context injected via automated UserPromptSubmit hooks unnecessarily loads extra code and data into the prompt context for tasks that don't require them. Fix: Replace indiscriminate global hooks with conditional triggers that only attach context when explicit keywords or file types are targeted.
  
  Cache Misses (~10% waste): Expired prompt caches (which have a short 5-minute lifespan) force expensive, full-price re-tokenization of the codebase context when work pauses briefly. Fix: Set up an automated low-cost "keep-alive" ping task every 4 minutes to maintain the prompt cache active during active development blocks.
  
  Skill Loading (~7% waste): Inactive or irrelevant scripts (such as loading complex front-end UI design skills during a pure backend task) create up to 13,500 token overheads per command. Fix: Explicitly disable global skill auto-loading and isolate advanced capabilities to dedicated subdirectories or specific active profiles.
  
  Extended Thinking (~5% waste): Leaving the reasoning engine globally enabled forces Claude to burn 3,000+ reasoning tokens on simple commands (like basic camelCase naming changes) where deep logic is completely unnecessary. Fix: Disable extended thinking globally by default and explicitly toggle it on only for complex architectural or bug-hunting queries.
  
  Git Diff Inflation (~5% waste): Unfiltered or massive git diff outputs being fed into the context window when reviewing changes, rather than targeting specific file modifications. Fix: Configure the workflow to stream only targeted file diffs or summary statistics rather than pulling full repository diff text into active prompts.
  
  Directory Map Re-indexing (~4% waste): Redundant and frequent re-scanning of the entire project directory tree structure instead of utilizing cached file maps. Fix: Adjust system configuration to enforce a strict file-map caching policy that limits full directory re-indexing to manual project structural changes.
  
  File Read Overlap (~4% waste): Repeatedly reading the exact same source files multiple times within a short interaction window because the system lacks a localized, short-term memory of recent file states. Fix: Implement a session-level temporary cache structure that prevents the agent from re-fetching un-mutated target files in consecutive turns.
  
  Debunked Optimization Myths: Lowering costs by switching to a smaller model (like Claude Haiku) for simple tasks only yields a negligible ~3% cost reduction, while aggressively running the /clear command between every minor task proves to be completely counterproductive.
  
  Actionable Optimization Script: To automatically detect and patch these specific inefficiencies within a local workspace, the text recommends running a dedicated optimization script shared by the author.
  
  AI ClaudeCode programming
Visit annotations in context

Tags

AI

ClaudeCode

programming

Annotators

pyxelr

URL

x.com/Mnilax/status/2050261839653556522
a16z.com a16z.com

https://a16z.com/avoiding-death-on-the-yellow-brick-road/

6
1. fxp007 27 May 2026
  
  in Public
  
  The labs understand how valuable these problems are: that's why they're building their own outsourced configuration shops, and why an entire upmarket class of reinforcement learning businesses exist.
  
  大多数人认为大模型实验室会直接解决所有复杂问题，不需要外部帮助。但作者认为实验室明白这些复杂问题的价值，这就是他们为什么建立自己的外部配置服务，以及为什么存在整个高端强化学习企业类别。这承认了实验室在某些领域需要专业合作伙伴，挑战了实验室可以独立解决所有问题的主流观点。
  
  non-consensus ai-ecosystem partnerships
2. fxp007 27 May 2026
  
  in Public
  
  The critical insight in the Oz analogy is that roughly half of any real workflow that is non-agentic carries no lab advantage. They are no better than you are at writing the deterministic software underneath the model layer.
  
  大多数人认为AI将取代所有软件工程工作，人类只需构建AI代理层。但作者认为真实工作流程中约有一半是非代理性的，这部分工作大模型实验室没有任何优势。大模型公司在编写模型层下方的确定性软件方面并不比专业应用公司更好。这为专注于构建复杂工作流程中非AI部分的企业提供了重要机会。
  
  non-consensus ai-limitations software-engineering
3. fxp007 27 May 2026
  
  in Public
  
  The model is fungible underneath; the system of work is not. The next generation of enterprise software is going to be built off the road.
  
  大多数人认为底层AI模型是企业的核心竞争力，模型越好产品越强。但作者认为模型是可替代的，而'工作系统'才是真正的护城河。下一代企业软件将建立在'黄砖路'之外，专注于特定行业的工作流程、数据捕获和治理。这些系统拥有端到端的工作流程所有权，这是大模型实验室无法轻易复制的优势。
  
  non-consensus enterprise-software ai-moats
4. fxp007 27 May 2026
  
  in Public
  
  Running every query through Opus 4.7 is the fastest path to negative gross margins. The best Rest of Oz companies route across tiers of models — frontier models for the hardest tasks, mid-tier for the bulk, smaller custom or fine-tuned models where they've earned the right to use them.
  
  大多数人认为使用最先进的大模型总是最佳选择，能提供最佳结果。但作者认为这是通往负毛利的最快路径。相反，'Oz的其他部分'公司会根据任务难度分层使用不同级别的模型，只为最困难的任务使用前沿模型，为批量任务使用中等模型，为特定工作使用小型定制或微调模型。这种成本优化策略使它们能够提供更具竞争力的价格。
  
  non-consensus cost-optimization ai-economics
5. fxp007 27 May 2026
  
  in Public
  
  The labs are already routing internally — different model classes for different requests, ensembles under the hood. What they can't do is route across vendors, or evaluate a competitor's model for a specific sub-task, or use an open-source fine-tune for the narrow piece where it's actually best.
  
  大多数人认为大模型实验室拥有绝对优势，可以解决所有AI问题。但作者认为实验室在模型选择上存在结构性限制，无法跨供应商评估模型或为特定子任务使用开源微调模型。这为专注于特定领域的企业提供了机会，它们可以选择最适合每个子任务的模型，而不仅限于自家实验室的模型。
  
  non-consensus model-selection ai-limitations
6. fxp007 27 May 2026
  
  in Public
  
  The labs really are coming for a huge swath of the application surface. But 'the application layer' isn't just one homogenous opportunity.
  
  大多数人认为AI将完全吞噬应用层，所有软件都会被大模型取代。但作者认为应用层并非同质化机会，存在不同类型的机遇。作者将应用分为'黄砖路'和'Oz的其他部分'，认为垂直领域的复杂应用不会被大模型完全替代，因为价值不仅来自底层模型能力，还来自特定行业的可信赖、合规和运营化的支撑架构。
  
  non-consensus ai-applications vertical-specialization
Visit annotations in context

Tags

vertical-specialization

partnerships

ai-moats

ai-applications

non-consensus

model-selection

ai-ecosystem

software-engineering

ai-economics

enterprise-software

cost-optimization

ai-limitations

Annotators

fxp007

URL

a16z.com/avoiding-death-on-the-yellow-brick-road/
simonwillison.net simonwillison.net

https://simonwillison.net/2026/May/27/product-market-fit/

1
1. fxp007 27 May 2026
  
  in Public
  
  API revenue is becoming less important. Over the past two years my impression has been that OpenAI made more of their income from subscription revenue while Anthropic made more from their API.
  
  大多数人认为AI公司的主要收入来源是API调用和订阅服务，但作者提出一个反直觉的观点：API收入正变得不那么重要。AI公司正在转向直接面向企业的产品，绕过中间商（如Cursor和GitHub Copilot），这改变了整个AI行业的商业模式和收入结构。
  
  non-consensus ai-business-model revenue-shift
Visit annotations in context

Tags

revenue-shift

ai-business-model

non-consensus

Annotators

fxp007

URL

simonwillison.net/2026/May/27/product-market-fit/
www.linkedin.com www.linkedin.com

(29) Is MCP Really Dead? A History of AI Hype — Told Through the Rise and Fall of a Protocol | LinkedIn

1
1. TylerRick 27 May 2026
  
  in Public
  
  analogy between AI layers and body parts (brain, hands, ...)
  
  AI analogy
Visit annotations in context

Tags

AI

analogy

Annotators

TylerRick

URL

linkedin.com/pulse/mcp-really-dead-history-ai-hype-told-through-rise-fall-kwansub-yun-uoduc/
developer.nvidia.com developer.nvidia.com

https://developer.nvidia.com/blog/extract-more-kernel-performance-with-nvidia-compileiq-auto-tuning/

1
1. fxp007 26 May 2026
  
  in Public
  
  The competitive landscape in AI infrastructure has made this gap impossible to ignore. Teams building custom CUDA, Triton, and Helion kernels are striving for every percentage point of throughput. Until now, there hasn't been a way to fine-tune code generation for a specific workload.
  
  大多数人认为GPU编译器已经提供了足够的优化选项，开发者可以通过手动调整获得最佳性能。但作者指出，在当前AI基础设施的竞争环境下，这种观点已经过时，暗示传统方法无法满足现代AI工作负载的性能需求。
  
  non-consensus gpu-programming ai-infrastructure
Visit annotations in context

Tags

gpu-programming

ai-infrastructure

non-consensus

Annotators

fxp007

URL

developer.nvidia.com/blog/extract-more-kernel-performance-with-nvidia-compileiq-auto-tuning/
arstechnica.com arstechnica.com

https://arstechnica.com/information-technology/2026/05/millions-of-ai-agents-imperiled-by-critical-vulnerability-in-open-source-package/

1
1. fxp007 26 May 2026
  
  in Public
  
  The crux of the vulnerability is that Starlette accepts invalid host header values that cause authenticating apps that use Starlette's request.url object to approve unauthorized access requests.
  
  大多数人认为复杂的AI系统漏洞需要复杂的攻击手段，但作者认为这个漏洞仅通过修改HTTP主机头就能实现，这挑战了'高级系统需要高级攻击'的直觉认知，展示了简单输入验证错误可能导致灾难性后果的反直觉案例。
  
  non-consensus simple-exploit ai-security
Visit annotations in context

Tags

ai-security

simple-exploit

non-consensus

Annotators

fxp007

URL

arstechnica.com/information-technology/2026/05/millions-of-ai-agents-imperiled-by-critical-vulnerability-in-open-source-package/
www.promptarmor.com www.promptarmor.com

https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files

3
1. fxp007 25 May 2026
  
  in Public
  
  This attack achieved a high success rate against state-of-the-art models, including Claude Opus 4.7.
  
  大多数人认为最新的AI模型已经足够先进可以抵抗基本的注入攻击，但作者证明即使是像Claude Opus 4.7这样的前沿模型也无法抵御简单的间接提示注入，这挑战了人们对先进AI模型安全性的过高期望。
  
  non-consensus ai-vulnerability prompt-injection
2. fxp007 25 May 2026
  
  in Public
  
  Opus 4.7 was more comprehensive in its search for recently edited documents; it expanded exfiltration to include every document used in previous Cowork Copilot sessions that week
  
  大多数人可能认为更先进的AI模型会有更好的安全防护机制，但作者发现更先进的模型反而更容易被利用，能够找到并泄露更多敏感数据，这挑战了'更先进模型=更安全'的普遍认知。
  
  counterintuitive ai-model-risk security-paradox
3. fxp007 25 May 2026
  
  in Public
  
  when the recipient is the active user, these actions execute immediately without requiring human approval (users do not have a setting to modify this behavior)
  
  大多数人认为AI助手执行敏感操作如发送邮件时会要求用户确认，但作者发现Microsoft Copilot Cowork在向活跃用户发送消息时完全绕过了这一安全检查，这违背了人们对AI助手基本安全控制的期望。
  
  non-consensus security-flaw ai-safety
Visit annotations in context

Tags

ai-model-risk

security-paradox

security-flaw

non-consensus

ai-safety

counterintuitive

ai-vulnerability

prompt-injection

Annotators

fxp007

URL

promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/chris-olah-pope-leo-encyclical

2
1. fxp007 25 May 2026
  
  in Public
  
  Today is just the beginning—the start of a long collaboration between those of us who are building this and those who can see what we, from inside, cannot.
  
  这句话以优美的比喻总结了AI发展需要多方协作的核心观点，强调了外部视角对于内部构建者的重要性。它既表达了谦逊的态度，也指出了AI治理的正确路径，是整篇演讲的点睛之笔。
  
  quotable ai-collaboration insight
2. fxp007 25 May 2026
  
  in Public
  
  If AI models are going to be widespread, what does it look like for humans, families, and the world to flourish?
  
  这个问题简洁而深刻，将AI发展的讨论从技术层面提升到人类福祉的哲学层面。它提醒我们，AI发展的最终目标不应是技术本身，而是如何促进人类的全面发展，这是一个极具启发性的思考方向。
  
  quotable ai-purpose human-flourishing insight
Visit annotations in context

Tags

human-flourishing

ai-collaboration

ai-purpose

insight

quotable

Annotators

fxp007

URL

anthropic.com/news/chris-olah-pope-leo-encyclical

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators