Hypothesis

2 Matching Annotations

Last 7 days
x.com x.com

https://x.com/DimitrisPapail/status/2028669695344148946

1
1. fxp007 07 May 2026
  
  in Public
  
  The 100:1 loss trick. In a 33 long sequence, only 2 positions change per step. Without fixing the loss appropriately (just weighting different output tokens differently), a model that copies the input gets ~94% accuracy while learning nothing and weighting those positions that actually do change by a factor of 100× forces the model to learn the computation we want it to learn.
  
  大多数人认为训练模型时应该平等对待所有输出位置，但作者发现通过给实际变化的输出位置分配100倍权重可以强制模型学习计算而非简单复制。这挑战了标准的训练方法，表明损失函数设计可能比模型架构选择更重要。
  
  non-consensus training-methods
Visit annotations in context

Tags

non-consensus

training-methods

Annotators

fxp007

URL

x.com/DimitrisPapail/status/2028669695344148946
May 2026
news.mit.edu news.mit.edu

https://news.mit.edu/2026/teaching-ai-models-to-say-im-not-sure-0422

1
1. fxp007 01 May 2026
  
  in Public
  
  Nothing in between. A model that arrives at the correct answer through careful reasoning receives the same reward as one that guesses correctly by chance.
  
  这一段落揭示了当前训练方法的问题：没有区分模型是通过深思熟虑还是偶然猜对答案，导致模型过度自信。
  
  training-methods overconfidence ai-rewards
Visit annotations in context

Tags

ai-rewards

training-methods

overconfidence

Annotators

fxp007

URL

news.mit.edu/2026/teaching-ai-models-to-say-im-not-sure-0422

Tags

Annotators

URL

Tags

Annotators

URL