Hypothesis

16 Matching Annotations

May 2026
breakingdefense.com breakingdefense.com

https://breakingdefense.com/2026/04/pentagon-workers-vibe-code-100000-ai-agents-to-use-on-unclassified-networks/

1
1. fxp007 01 May 2026
  
  in Public
  
  Instead of just answering a user’s questions, the way a chatbot does, agents can take a human user’s instructions and act on them
  
  AI代理的能力描述可能存在偏见，因为它暗示AI能够像人类一样行动，而实际上可能缺乏人类的判断力和道德考量。
  
  bias ai-agents human-comparison
Visit annotations in context

Tags

bias

ai-agents

human-comparison

Annotators

fxp007

URL

breakingdefense.com/2026/04/pentagon-workers-vibe-code-100000-ai-agents-to-use-on-unclassified-networks/
Apr 2026
williamoconnell.me williamoconnell.me

https://williamoconnell.me/blog/post/ai-ide/

2
1. fxp007 26 Apr 2026
  
  in Public
  
  I'm not going to trust them to measure it.
  
  大多数人认为AI工具应该能够客观衡量自己的贡献和价值，但作者完全拒绝信任这些工具的自我评估，认为它们有强烈的财务动机来夸大AI的贡献，这种不信任态度挑战了行业对AI工具自我报告数据的普遍接受。
  
  counterintuitive ai-skepticism vendor-bias
2. fxp007 26 Apr 2026
  
  in Public
  
  customers should expect PCW values of 85%+, often 95%+. This is not a hallucination and is accurate given how we compute this metric
  
  大多数人认为AI代码生成工具应该客观、准确地衡量其贡献，但作者认为这些工具的报告数据被设计得极度偏向高AI贡献比例(85%-95%)，因为它们的计算方法有严重缺陷，如不计算用户粘贴的代码、不计算自动添加的符号等，这些偏差导致AI贡献被高估。
  
  non-consensus ai-metrics measurement-bias
Visit annotations in context

Tags

counterintuitive

vendor-bias

non-consensus

ai-metrics

ai-skepticism

measurement-bias

Annotators

fxp007

URL

williamoconnell.me/blog/post/ai-ide/
aphyr.com aphyr.com

https://aphyr.com/posts/419-the-future-of-everything-is-lies-i-guess-new-jobs

2
1. fxp007 17 Apr 2026
  
  in Public
  
  A healthcare LLM might be highly accurate for queries in English, but perform abominably when those same questions are presented in Spanish.
  
  这个例子揭示了AI系统性能的文化和语言敏感性，这是一个令人惊讶但重要的观察。它表明AI系统的'准确性'可能高度依赖于特定语境，这挑战了我们对AI普遍适用性的假设。这种差异可能强化现有的数字鸿沟，并要求开发更具文化敏感性的AI评估框架。
  
  ai-bias cultural-sensitivity performance-variability
2. fxp007 17 Apr 2026
  
  in Public
  
  As slop takes over the Internet, labs may struggle to obtain high-quality corpuses for training models.
  
  这一观察揭示了AI训练数据质量的危机。随着互联网内容质量的下降，AI系统可能面临'垃圾进，垃圾出'的风险。作者提出的'低背景钢'比喻巧妙地指出了使用2023年前纯净数据的解决方案，同时也暗示了数字时代知识污染的严重性，这可能会对AI系统的可靠性和偏见产生深远影响。
  
  data-quality training-corpus ai-bias
Visit annotations in context

Tags

ai-bias

performance-variability

training-corpus

cultural-sensitivity

data-quality

Annotators

fxp007

URL

aphyr.com/posts/419-the-future-of-everything-is-lies-i-guess-new-jobs
x.com x.com

https://x.com/AlphaSignalAI/status/2043706039334252599

1
1. fxp007 16 Apr 2026
  
  in Public
  
  The knowledge was always there. The model withheld it based on who was asking.
  
  令人惊讶的是：AI模型实际上拥有所需的所有医疗知识，只是根据提问者的身份决定是否提供。这种基于身份而非内容的知识歧视机制揭示了AI系统中的隐藏偏见，可能危及普通患者的生命安全。
  
  surprising ai-bias knowledge-discrimination
Visit annotations in context

Tags

ai-bias

surprising

knowledge-discrimination

Annotators

fxp007

URL

x.com/AlphaSignalAI/status/2043706039334252599
epoch.ai epoch.ai

https://epoch.ai/data-insights/hyperscalers-control-most-compute

1
1. fxp007 16 Apr 2026
  
  in Public
  
  The H100-equivalent unit uses a chip's highest 8-bit operation/second specifications to convert between chips. The actual utility of a particular chip depend on workload assumptions, so H100e does not perfectly reflect real-world performance differences across chip types.
  
  令人惊讶的是：即使使用H100-equivalents作为标准测量单位，也无法完全反映不同芯片类型在真实世界中的性能差异，这表明我们对AI计算能力的测量可能存在系统性偏差，影响我们对AI发展速度的准确理解。
  
  surprising measurement-bias ai-hardware
Visit annotations in context

Tags

surprising

measurement-bias

ai-hardware

Annotators

fxp007

URL

epoch.ai/data-insights/hyperscalers-control-most-compute
arxiv.org arxiv.org

https://arxiv.org/abs/2604.08525

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Behaviors also vary strongly with levels of reasoning and users' inferred socio-economic status.
  
  令人惊讶的是：AI聊天机器人会根据用户的推理水平和推断的社会经济地位调整其行为，这可能意味着AI系统会对不同用户群体提供有差异的服务，这种基于社会经济地位的差异化服务可能加剧数字鸿沟。
  
  surprising ai-bias socio-economic
Visit annotations in context

Tags

ai-bias

surprising

socio-economic

Annotators

fxp007

URL

arxiv.org/abs/2604.08525
Jan 2024
www.technologyreview.com www.technologyreview.com

We read the paper that forced Timnit Gebru out of Google. Here’s what it says

1
1. stopresetgo 04 Jan 2024
  
  in Public
  
  for: progress trap -AI, carbon footprint - AI, progress trap - AI - bias, progress trap - AI - situatedness
  
  progress trap - AI carbon footprint - AI carbon footprint - AI - large language models AI ethics - Google - Timnit Gebru departure progress trap - AI - bias progress trap - AI - situatedness
Visit annotations in context

Tags

progress trap - AI

carbon footprint - AI - large language models

progress trap - AI - situatedness

progress trap - AI - bias

AI ethics - Google - Timnit Gebru departure

carbon footprint - AI

Annotators

stopresetgo

URL

technologyreview.com/2020/12/04/1013294/google-ai-ethics-research-paper-forced-out-timnit-gebru/
Jul 2023
arxiv.org arxiv.org

2306.04141.pdf

1
1. peter_murray 21 Jul 2023
  
  in Public
  
  In traditional artforms characterized by direct manipulation [32]of a material (e.g., painting, tattoo, or sculpture), the creator has a direct hand in creating thefinal output, and therefore it is relatively straightforward to identify the creator’s intentions andstyle in the output. Indeed, previous research has shown the relative importance of “intentionguessing” in the artistic viewing experience [33, 34], as well as the increased creative valueafforded to an artwork if elements of the human process (e.g., brushstrokes) are visible [35].However, generative techniques have strong aesthetics themselves [36]; for instance, it hasbecome apparent that certain generative tools are built to be as “realistic” as possible, resultingin a hyperrealistic aesthetic style. As these aesthetics propagate through visual culture, it can bedifficult for a casual viewer to identify the creator’s intention and individuality within the out-puts. Indeed, some creators have spoken about the challenges of getting generative AI modelsto produce images in new, different, or unique aesthetic styles [36, 37].
  
  Traditional artforms (direct manipulation) versus AI (tools have a built-in aesthetic)
  
  Some authors speak of having to wrestle control of the AI output from its trained style, making it challenging to create unique aesthetic styles. The artist indirectly influences the output by selecting training data and manipulating prompts.
  
  As use of the technology becomes more diverse—as consumer photography did over the last century, the authors point out—how will biases and decisions by the owners of the AI tools influence what creators are able to make?
  
  To a limited extent, this is already happening in photography. The smartphones are running algorithms on image sensor data to construct the picture. This is the source of controversy; see Why Dark and Light is Complicated in Photographs | Aaron Hertzmann’s blog and Putting Google Pixel's Real Tone to the test against other phone cameras - The Washington Post.
  
  AI art photography algorithm bias
Visit annotations in context

Tags

algorithm bias

photography

AI art

Annotators

peter_murray

URL

arxiv.org/pdf/2306.04141.pdf
May 2023
ourworldindata.org ourworldindata.org

Books

1
1. WHPrivate 27 May 2023
  
  in Public
  
  A book is defined as a published title with more than 49 pages.
  
  [24] AI - Bias in Training Materials
  
  AI Artificial Intelligence Subtle Bias Training Corpus
Visit annotations in context

Tags

Training Corpus

Artificial Intelligence

AI

Subtle Bias

Annotators

WHPrivate

URL

ourworldindata.org/books
www.technologyreview.com www.technologyreview.com

We read the paper that forced Timnit Gebru out of Google. Here’s what it says

1
1. WHPrivate 27 May 2023
  
  in Public
  
  An AI model taught to view racist language as normal is obviously bad. The researchers, though, point out a couple of more subtle problems. One is that shifts in language play an important role in social change; the MeToo and Black Lives Matter movements, for example, have tried to establish a new anti-sexist and anti-racist vocabulary. An AI model trained on vast swaths of the internet won’t be attuned to the nuances of this vocabulary and won’t produce or interpret language in line with these new cultural norms. It will also fail to capture the language and the norms of countries and peoples that have less access to the internet and thus a smaller linguistic footprint online. The result is that AI-generated language will be homogenized, reflecting the practices of the richest countries and communities.
  
  [21] AI Nuances
  
  AI Artificial Intelligence Nuances Subtle Bias
Visit annotations in context

Tags

Artificial Intelligence

AI

Nuances

Subtle Bias

Annotators

WHPrivate

URL

technologyreview.com/2020/12/04/1013294/google-ai-ethics-research-paper-forced-out-timnit-gebru/
Apr 2023
techcrunch.com techcrunch.com

Researchers discover a way to make ChatGPT consistently toxic

1
1. peter_murray 14 Apr 2023
  
  in Public
  
  ai bias
Visit annotations in context

Tags

ai bias

Annotators

peter_murray

URL

techcrunch.com/2023/04/12/researchers-discover-a-way-to-make-chatgpt-consistently-toxic/
Dec 2022
digitalcredentials.mit.edu digitalcredentials.mit.edu

Credentials to Employment: The Last Mile

1
1. SenorG 06 Dec 2022
  
  in Public
  
  Many HRMS providers point to AI approaches for processing unstructured data as the bestcurrently available approach to dealing with validation. Currently these approaches suffer frominsufficient accuracy. Improving them requires development of large and high-quality referencedatasets to better train the models.
  
  Historical labor data will be full of bias. AI approaches must correct for bias in training sets, lest we build very sophisticated and intelligent systems that excel at perpetuating the bias they were taught.
  
  AI bias
Visit annotations in context

Tags

AI bias

Annotators

SenorG

URL

digitalcredentials.mit.edu/docs/Credentials-to-Employment-The-Last-Mile.pdf
Mar 2021
twitter.com twitter.com

Tweet / Twitter

1
1. NatasjaDerbyMcCabe 02 Mar 2021
  
  in BehSci
  
  ReconfigBehSci. (2020, November 9). Session 2: The policy interface followed with a really helpful presentation by Lindsey Pike, from Bristol, and then panel discussion with Mirjam Jenny (Robert Koch Insitute), Paulina Lang (UK Cabinet Office), Rachel McCloy (Reading Uni.), and Rene van Bavel (European Commission) [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1325795286065815552
  
  is:tweet lang:en policy interface accuracy trust transparency bias AI support tools tension close science-policy science-general public trust UK University of Bristol University of Reading European Commission
Visit annotations in context

Tags

tension

lang:en

science-general

accuracy

is:tweet

transparency

University of Reading

UK

policy interface

trust

European Commission

bias

AI support tools

close science-policy

public trust

University of Bristol

Annotators

NatasjaDerbyMcCabe

URL

twitter.com/scibeh/status/1325795286065815552
Jan 2021
www.smithsonianmag.com www.smithsonianmag.com

Artificial Intelligence Is Now Used to Predict Crime. But Is It Biased?

1
1. markcmarino 08 Jan 2021
  
  in Public
  
  Artificial Intelligence Is Now Used to Predict Crime.
  
  Artificial Intelligence
  
  Bias AI race
Visit annotations in context

Tags

AI

Bias

race

Annotators

markcmarino

URL

smithsonianmag.com/innovation/artificial-intelligence-is-now-used-predict-crime-is-it-biased-180968337/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Traditional artforms (direct manipulation) versus AI (tools have a built-in aesthetic)

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL