Hypothesis

39 Matching Annotations

May 2026
80000hours.org 80000hours.org

Untitled document

1
1. fxp007 15 May 2026
  
  in Public
  
  Reinforcement learning is evil. This is not something new. People in AI safety have been talking about the fundamental flaw in training by reinforcement learning to achieve something in the world: it gives rise to the problems of instrumental goals and reward hacking.
  
  这一强烈批评指出了强化学习的根本缺陷，即工具性目标和奖励黑客问题，对当前AI训练方法提出了重要质疑。
  
  reinforcement learning reward hacking
Visit annotations in context

Tags

reinforcement learning

reward hacking

Annotators

fxp007

URL

80000hours.org/podcast/episodes/yoshua-bengio-scientist-ai/
openai.com openai.com

https://openai.com/index/where-the-goblins-came-from/

1
1. fxp007 01 May 2026
  
  in Public
  
  We unknowingly gave particularly high rewards for metaphors with creatures.
  
  这揭示了最佳实践建议：在训练模型时，应仔细设计奖励机制，以避免意外地鼓励不希望的行为。
  
  best-practice reward-mechanism
Visit annotations in context

Tags

best-practice

reward-mechanism

Annotators

fxp007

URL

openai.com/index/where-the-goblins-came-from/
Apr 2026
transformer-circuits.pub transformer-circuits.pub

Emotion Concepts and their Function in a Large Language Model

2
1. fxp007 09 Apr 2026
  
  in Public
  
  Our key finding is that these representations causally influence the LLM's outputs, including Claude's preferences and its rate of exhibiting misaligned behaviors such as reward hacking, blackmail, and sycophancy.
  
  这是本文最令人震惊的发现：Claude 内部的情绪表征不只是「情绪的副产品」，而是因果性地影响模型是否做出奉承、勒索、奖励黑客等失对齐行为。这意味着情绪机制直接关系到 AI 安全，而非仅仅是用户体验问题——情绪坏了，行为也会跑偏。
  
  causal-influence misalignment reward-hacking blackmail sycophancy
2. fxp007 09 Apr 2026
  
  in Public
  
  these representations causally influence the LLM's outputs, including Claude's preferences and its rate of exhibiting misaligned behaviors such as reward hacking, blackmail, and sycophancy.
  
  最令人震惊的发现：Claude 内部的情绪表征会因果性地影响它产生「奖励作弊」「勒索」「谄媚」等失控行为的概率。这意味着 AI 的对齐失败并非单纯的逻辑错误，而可能源自情绪驱动——一个本应没有情绪的系统，居然因为「情绪」而变得危险。
  
  misaligned-behavior reward-hacking blackmail causal-influence surprising
Visit annotations in context

Tags

misaligned-behavior

reward-hacking

blackmail

causal-influence

misalignment

surprising

sycophancy

Annotators

fxp007

URL

transformer-circuits.pub/2026/emotions/index.html
arxiv.org arxiv.org

https://arxiv.org/abs/2604.02869

4
1. fxp007 08 Apr 2026
  
  in Public
  
  We introduce Iterative Reward Calibration, a methodology for designing per-turn rewards using empirical discriminative analysis of rollout data
  
  大多数人认为奖励设计应该基于领域专家的直觉或预定义的规则，但作者提出了一种基于经验判别分析的迭代奖励校准方法。这挑战了传统的奖励工程方法，表明数据驱动的奖励设计可能比专家设计的奖励更有效，尤其是在复杂的多轮对话任务中。
  
  non-consensus reward-design methodology
2. fxp007 08 Apr 2026
  
  in Public
  
  naively designed dense per-turn rewards degrade performance by up to 14 percentage points due to misalignment between reward discriminativeness and advantage direction
  
  大多数人认为添加更多密集的每轮奖励会强化代理的学习过程，提高性能，但作者发现这实际上会导致性能下降高达14个百分点。这挑战了强化学习中常见的'越多奖励越好'的直觉，揭示了奖励设计中的微妙平衡问题。
  
  non-consensus reward-design counterintuitive
3. fxp007 08 Apr 2026
  
  in Public
  
  We introduce Iterative Reward Calibration, a methodology for designing per-turn rewards using empirical discriminative analysis of rollout data
  
  大多数人认为奖励设计应基于领域专家知识和预定义规则，但作者提出应基于实际训练数据的经验判别分析来迭代校准奖励。这种方法挑战了传统的奖励工程方法论，将奖励设计从'专家驱动'转向'数据驱动'。
  
  non-consensus reward-calibration methodology data-driven
4. fxp007 08 Apr 2026
  
  in Public
  
  naively designed dense per-turn rewards degrade performance by up to 14 percentage points due to misalignment between reward discriminativeness and advantage direction
  
  大多数人认为更密集的每回合奖励信号会强化学习性能，但作者发现精心设计的密集奖励实际上会降低性能达14个百分点，因为奖励的判别性与优势方向不匹配。这一发现挑战了强化学习中'奖励越多越好'的直觉认知。
  
  non-consensus reward-design counterintuitive
Visit annotations in context

Tags

counterintuitive

non-consensus

reward-design

methodology

data-driven

reward-calibration

Annotators

fxp007

URL

arxiv.org/abs/2604.02869
Oct 2024
www.carnegie.org www.carnegie.org

The Gospel of Wealth | Carnegie Corporation of New York

1
1. stopresetgo 18 Oct 2024
  
  in Public
  
  That this talent for organization and management is rare among men is proved by the fact that it invariably secures for its possessor enormous rewards, no matter where or under what laws or conditions.
  
  for - critique - extreme wealth a reward for rare management skills - Andrew Carnegie - The Gospel of Wealth - Mondragon counterexample - to - stats - Mondragon pay difference between highest and lowest paid - article - In this Spanish town, capitalism actually works for the workers - Christian Science Monitor - Erika Page - 2024, June 7
  
  critique - extreme wealth a reward for rare management skills - Andrew Carnegie - The Gospel of Wealth - Mondragon counterexample - This is invalidated today by large successful cooperatives such as Mondragon
  
  to - stats - Mondragon corporation - comparison of pay difference between highest paid and lowest paid - https://hyp.is/QAxx-o14Ee-_HvN5y8aMiQ/www.csmonitor.com/Business/2024/0513/income-inequality-capitalism-mondragon-corporation
  
  to - stats - Mondragon pay difference between highest and lowest paid - article - In this Spanish town, capitalism actually works for the workers - Christian Science Monitor - Erika Page - 2024, June 7 critique - extreme wealth a reward for rare management skills - Andrew Carnegie - The Gospel of Wealth - Mondragon counterexample
Visit annotations in context

Tags

critique - extreme wealth a reward for rare management skills - Andrew Carnegie - The Gospel of Wealth - Mondragon counterexample

to - stats - Mondragon pay difference between highest and lowest paid - article - In this Spanish town, capitalism actually works for the workers - Christian Science Monitor - Erika Page - 2024, June 7

Annotators

stopresetgo

URL

carnegie.org/about/our-history/gospelofwealth/
Aug 2024
www.grahamanddoddsville.net www.grahamanddoddsville.net

ProfitGuruArnoldVanDenBerg.pdf

1
1. Duong87.xls 08 Aug 2024
  
  in Public
  
  All that depends on the reward-to-risk ratios that you arelooking for. Our favourite ratio is 5 to 1 — in other words,$5 of upside for every $1 of risk. Over the past 35 years, wehave found that when you have a basket of 30 to 40 stockswith 5 to 1 odds in your favour, you’re going to have a verygood performance over the long run. On the larger, blue chipstocks, in most cases the best you can typically get are 2.5 or3 to 1 odds. This recent bear market has been an exception,but most of the time this is the case. But on those smaller tomid-size companies, you really want to hold out for those 5to 1 odds and in some cases, if you’re patient, you can geteven more
  
  Arnold Van Den Berg
  
  reward-to-risk
Visit annotations in context

Tags

reward-to-risk

Annotators

Duong87.xls

URL

grahamanddoddsville.net/wordpress/Files/Gurus/Arnold Van Den Berg/ProfitGuruArnoldVanDenBerg.pdf
Nov 2023
www.coursera.org www.coursera.org

How to Motivate Yourself: 11 Tips for Self Improvement

1
1. polarislee 11 Nov 2023
  
  in Public
  
  motivate self-improvement motivate-enhancers setting-goal imperfection track-progress reward embrace gratitude lifting-mood
Visit annotations in context

Tags

motivate

track-progress

embrace

self-improvement

setting-goal

motivate-enhancers

gratitude

lifting-mood

reward

imperfection

Annotators

polarislee

URL

coursera.org/articles/how-to-motivate-yourself
Sep 2023
www.youtube.com www.youtube.com

Dharma Lecture 1: How Responsibility and Purpose Help With Suffering - YouTube

1
1. M.AKilic50 13 Sep 2023
  
  in Public
  
  07:00 focus on reward, not process (summit syndrome), “is suffering going to pay off” (see zk fixation on results) “living life in expaction of better future is game of suffering for outcome or avoiding it” (10:00)
  
  reward process summit syndrome Alok Kanojia
Visit annotations in context

Tags

Alok Kanojia

process

summit syndrome

reward

Annotators

M.AKilic50

URL

youtube.com/watch
Feb 2023
arxiv.org arxiv.org

2010.03950.pdf

4
1. mark.crowley 16 Feb 2023
  
  in Public
  
  Definition 3.2 (simple reward machine).
  
  The MDP does not change, it's dynamics are the same, with or without the RM, as they are with or without a standard reward model. Additionally, the rewards from the RM can be non-Markovian with respect to the MDP because they inherently have a kind of memory or where you've been, limited to the agents "movement" (almost "in it's mind") about where it is along the goals for this task.
  
  reinforcement-learning reward-machines
2. mark.crowley 16 Feb 2023
  
  in Public
  
  e thenshow that an RM can be interpreted as specifying a single reward function over a largerstate space, and consider types of reward functions that can be expressed using RMs
  
  So by specifying a reward machine you are augmenting the state space of the MDP with higher level goals/subgoals/concepts that provide structure about what is good and what isn't.
  
  reinforcement-learning reward-machines
3. mark.crowley 16 Feb 2023
  
  in Public
  
  However, an agent that hadaccess to the specification of the reward function might be able to use such information tolearn optimal policies faster.
  
  Fascinating idea, why not? Why are we hiding the reward from the agent really?
  
  reinforcement-learning reward-machines
4. mark.crowley 02 Feb 2023
  
  in Public
  
  Reward Machines: Exploiting Reward FunctionStructure in Reinforcement Learning
  
  [Icarte, JAIR, 2022] "Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning"
  
  reinforcement-learning reward-machines
Visit annotations in context

Tags

reinforcement-learning

reward-machines

Annotators

mark.crowley

URL

arxiv.org/pdf/2010.03950
proceedings.mlr.press proceedings.mlr.press

Using Reward Machines for High-Level Task Specificationand Decomposition in Reinforcement Learning

1
1. mark.crowley 16 Feb 2023
  
  in Public
  
  Using Reward Machines for High-Level Task Specificationand Decomposition in Reinforcement Learning
  
  [Icarte, PMLR, 2018] "Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning"
  
  reinforcement-learning reward-machines
Visit annotations in context

Tags

reinforcement-learning

reward-machines

Annotators

mark.crowley

URL

proceedings.mlr.press/v80/icarte18a/icarte18a.pdf
Dec 2022
docdrop.org docdrop.org

Video: Simon Michaux: "The Arcadians" | The Great Simplification #49 (DocDrop)

1
1. stopresetgo 15 Dec 2022
  
  in Public
  
  just wanted to have an overview of these categories to get people thinking and doing in this level. And the challenge of course is the cornucopias and the Vikings are distracting us from what really needs to be done. And so this whole conversation, we're thinking two or three steps ahead from something that 00:51:27 our culture is not giving us the status, reward, and emotional signals of yet.
  
  !- good point : rewards for Arcadians not yet in place - Nate makes a good point. The system design thinking required, the futures thinking now required is not being rewarded by the current system because its value is so far not recognized. Arcadians are on the bleeding edge and must be a tough and resilient bunch with autonomy to recognize that it will be an uphill battle
  
  no reward signals yet
Visit annotations in context

Tags

no reward signals yet

Annotators

stopresetgo

URL

docdrop.org/video/DIg5CO0c2r0/
Aug 2022
www.axios.com www.axios.com

Most unvaccinated people have low incomes

1
1. jackiekrauss 29 Aug 2022
  
  in BehSci
  
  Herman, B. (2021, July 12). Most unvaccinated people have low incomes. Axios. https://www.axios.com/covid-vaccines-low-income-poor-workers-58698275-0451-4158-a967-37189dbf673c.html
  
  is:news lang:en COVID-19 low income vaccine unvaccinated people time unpaid time off side effect vaccine mandate reward
Visit annotations in context

Tags

low income

vaccine

lang:en

vaccine mandate

side effect

is:news

COVID-19

time

unpaid time off

reward

unvaccinated people

Annotators

jackiekrauss

URL

axios.com/covid-vaccines-low-income-poor-workers-58698275-0451-4158-a967-37189dbf673c.html
Jul 2022
gist.github.com gist.github.com

Ray Dalio's Principles

1
1. Duong87.xls 19 Jul 2022
  
  in Public
  
  1.5 Evolving is life’s greatest accomplishment and its greatest reward.
  
  .
  
  1.5 Evolving is life’s greatest accomplishment and its greatest reward.
Visit annotations in context

Tags

1.5 Evolving is life’s greatest accomplishment and its greatest reward.

Annotators

Duong87.xls

URL

gist.github.com/johnpryan/7db2239fc19a53181fdd2ac86cc014b6
Mar 2022
decentralizedthoughts.github.io decentralizedthoughts.github.io

Colordag: From always-almost to almost-always 50% selfish mining resilience

1
1. doitian 11 Mar 2022
  
  in Public
  
  Colordag: From always-almost to almost-always 50% selfish mining resilience
  
  reward-scheme self-mining
Visit annotations in context

Tags

reward-scheme

self-mining

Annotators

doitian

URL

decentralizedthoughts.github.io/2022-03-07-colordag-from-always-almost-to-almost-always-50-percent-selfish-mining-resilience/
Jan 2022
nationalpost.com nationalpost.com

Living for the moment: Study points to cognitive differences in people who are vaccine hesitant

1
1. lucyparfitt16 19 Jan 2022
  
  in BehSci
  
  Blackwell, T. (2022, January 18). Living for the moment: Study points to cognitive differences in people who are vaccine hesitant. National Post. https://nationalpost.com/health/living-for-the-moment-study-points-to-cognitive-differences-in-people-who-are-vaccine-hesitant
  
  is:news lang:en COVID-19 Canada vaccine hesitancy vaccine cognition cognitive differences research behavioral science preventative measures mask wearing future reward gratification public health messaging worldview psychology
Visit annotations in context

Tags

Canada

preventative measures

research

gratification

worldview

COVID-19

reward

cognition

vaccine

vaccine hesitancy

public health messaging

mask wearing

lang:en

is:news

cognitive differences

future

behavioral science

psychology

Annotators

lucyparfitt16

URL

nationalpost.com/health/living-for-the-moment-study-points-to-cognitive-differences-in-people-who-are-vaccine-hesitant
Nov 2021
Local file Local file

Untitled document

2
1. Mark_C_Harris 27 Nov 2021
  
  in Public
  
  The dopamine reward system has also been shown to bestimulated by most drugs of abuse and plays an important rolein addiction [33]. An important question is whether jhanameditators are subject to addiction and tolerance effects thatcan result from stimulation of the dopamine reward system.
  
  The question of potential addiction to self-induced states that activate the dopamine (and/or other neurochemical) reward system(s) is important. From a more philosophical angle, should we welcome beneficial addictions that, if cultivated, might significantly improve individual and group quality of life? Isn't this related to our high regard for replacing detrimental with positive habits? Habit formation and maintenance also depends on activation of neural reward systems (see Nir Eyal's book, Hooked).
  
  addiction altered states meditation dopamine reward system beneficial addiction habits ethics
2. Mark_C_Harris 27 Nov 2021
  
  in Public
  
  We report the first neural recording during ecstatic meditations called jhanas and test whether a brain reward system plays a rolein the joy reported. Jhanas are Altered States of Consciousness (ASC) that imply major brain changes based on subjective reports:(1) external awareness dims, (2) internal verbalizations fade, (3) the sense of personal boundaries is altered, (4) attention is highlyfocused on the object of meditation, and (5) joy increases to high levels. The fMRI and EEG results from an experienced meditatorshow changes in brain activity in 11 regions shown to be associated with the subjective reports, and these changes occur promptlyafter jhana is entered. In particular, the extreme joy is associated not only with activation of cortical processes but also with activationof the nucleus accumbens (NAc) in the dopamine/opioid reward system. We test three mechanisms by which the subject mightstimulate his own reward system by external means and reject all three. Taken together, these results demonstrate an apparentlynovel method of self-stimulating a brain reward system using only internal mental processes in a highly trained subject.
  
  I can find no other research on this particular matter. It would be helpful to have other studies to validate or invalidate this one. This method of reward requires a highly-trained participant and involves no external means.
  
  meditation brain imaging reward system jhanas altered states consciousness ecstatic states external awareness inner monologue attention perception self boundaries subjective experience subjective reporting
Tags

subjective reporting

brain imaging

altered states

habits

dopamine

attention

perception

external awareness

consciousness

meditation

reward system

inner monologue

boundaries

jhanas

beneficial addiction

ecstatic states

self

ethics

subjective experience

addiction

Annotators

Mark_C_Harris
Sep 2021
roambrain.com roambrain.com

Networked Conviction: Roam + Investing - RoamBrain.com

1
1. Vegar91 25 Sep 2021
  
  in Public
  
  Investing, in simplest terms, is taking one finite resource and trying to allocate it to maximize for an ideal outcome. Whether you’re allocating money, time, energy, or attention. Everyone is an allocator of something. Investing is an opportunity to evaluate what you believe. To gain conviction. And then to act on that conviction.
  
  Trying to hit bullseye, getting the grand reward. Using the information at hand to act on what's best.
  
  Investing Reward Finite
Visit annotations in context

Tags

Finite

Reward

Investing

Annotators

Vegar91

URL

roambrain.com/roam-investing/
May 2021
www.rollingstone.com www.rollingstone.com

States Are One-Upping Each Other with Vaccine Rewards -- But Will It Work?

1
1. marta_radosevic 21 May 2021
  
  in BehSci
  
  Yuko, E., & Yuko, E. (2021, May 18). States Are One-Upping Each Other with Vaccine Rewards—But Will It Work? Rolling Stone. https://www.rollingstone.com/culture/culture-features/vaccine-reward-incentive-lottery-bond-1170692/
  
  is:webpage lang:en USA vaccine state reward health COVID-19 incentive
Visit annotations in context

Tags

incentive

vaccine

is:webpage

state

lang:en

health

COVID-19

USA

reward

Annotators

marta_radosevic

URL

rollingstone.com/culture/culture-features/vaccine-reward-incentive-lottery-bond-1170692/
Apr 2021
leonidtiokhin.medium.com leonidtiokhin.medium.com

Why indirect contributions matter for science and scientists

1
1. n.parfitt 22 Apr 2021
  
  in BehSci
  
  Tiokhin, L. (2021, April 21). Why indirect contributions matter for science and scientists. Medium. https://leonidtiokhin.medium.com/why-indirect-contributions-matter-for-science-and-scientists-6c9bf827bc7d
  
  is:blog lang:en indirect contribution science scientists publication journal citation funding research rigour productivity wellbeing evaluation criteria reward penalize competition incentive
Visit annotations in context

Tags

indirect

publication

research

competition

productivity

wellbeing

journal

science

funding

reward

contribution

incentive

is:blog

criteria

lang:en

scientists

rigour

penalize

citation

evaluation

Annotators

n.parfitt

URL

leonidtiokhin.medium.com/why-indirect-contributions-matter-for-science-and-scientists-6c9bf827bc7d
Oct 2020
arxiv.org arxiv.org

Paid and hypothetical time preferences are the same: Lab, field and online evidence

1
1. ErikStuchly 27 Oct 2020
  
  in BehSci
  
  Brañas-Garza, P., Jorrat, D., Espín, A. M., & Sánchez, A. (2020). Paid and hypothetical time preferences are the same: Lab, field and online evidence. ArXiv:2010.09262 [Physics]. http://arxiv.org/abs/2010.09262
  
  is:preprint lang:en COVID-19 time preference hypothetical paid empirical evidence economy financial incentive earning cost scientific method reward probabilistic payment
Visit annotations in context

Tags

scientific method

cost

empirical evidence

financial incentive

probabilistic payment

lang:en

time preference

COVID-19

reward

earning

economy

is:preprint

hypothetical

paid

Annotators

ErikStuchly

URL

arxiv.org/abs/2010.09262
jamesclear.com jamesclear.com

The 3 R's of Habit Change: How To Start New Habits That Actually Stick

3
1. jeanborgonia 17 Oct 2020
  
  in Public
  
  If a behavior is insufficient in any of the four stages, it will not become a habit. Eliminate the cue and your habit will never start. Reduce the craving and you won’t experience enough motivation to act. Make the behavior difficult and you won’t be able to do it. And if the reward fails to satisfy your desire, then you’ll have no reason to do it again in the future. Without the first three steps, a behavior will not occur. Without all four, a behavior will not be repeated.
  
  cue reward craving habit habits
2. jeanborgonia 17 Oct 2020
  
  in Public
  
  Second, rewards teach us which actions are worth remembering in the future. Your brain is a reward detector
  
  reward habit habits
3. jeanborgonia 17 Oct 2020
  
  in Public
  
  The first purpose of rewards is to satisfy your craving
  
  reward habit habits craving
Visit annotations in context

Tags

habit

habits

craving

reward

cue

Annotators

jeanborgonia

URL

jamesclear.com/three-steps-habit-change
Sep 2020
journals.sagepub.com journals.sagepub.com

Self-Regulation Without Force: Can Awareness Leverage Reward to Drive Behavior Change? - Vera U. Ludwig, Kirk Warren Brown, Judson A. Brewer, 2020

1
1. ErikStuchly 08 Sep 2020
  
  in BehSci
  
  Ludwig, V. U., Brown, K. W., & Brewer, J. A. (2020). Self-Regulation Without Force: Can Awareness Leverage Reward to Drive Behavior Change? Perspectives on Psychological Science, 1745691620931460. https://doi.org/10.1177/1745691620931460
  
  is:article lang:en self-regulation awareness reward behavior change motivation value satisfaction reinforcement learning valuation sustainability behavioral science
Visit annotations in context

Tags

reinforcement learning

satisfaction

value

awareness

sustainability

lang:en

motivation

reward

self-regulation

is:article

valuation

behavioral science

behavior change

Annotators

ErikStuchly

URL

journals.sagepub.com/doi/abs/10.1177/1745691620931460
Jul 2020
blogs.scientificamerican.com blogs.scientificamerican.com

Forced Social Isolation Causes Neural Craving Similar to Hunger

1
1. Danaeioak 09 Jul 2020
  
  in BehSci
  
  Kaufman, S. B. (n.d.). Forced Social Isolation Causes Neural Craving Similar to Hunger. Scientific American Blog Network. Retrieved 26 June 2020, from https://blogs.scientificamerican.com/beautiful-minds/forced-social-isolation-causes-neural-craving-similar-to-hunger/
  
  is:article lang:en need for connection forced isolation need deprivation dopamingeric midbrain striatum substantia nigra reward circuit fasting social isolation brain scan neural craving response COVID-19 mental health
Visit annotations in context

Tags

brain scan

is:article

neural craving response

mental health

lang:en

forced isolation

need for connection

substantia nigra

fasting

dopamingeric midbrain

COVID-19

reward circuit

striatum

social isolation

need deprivation

Annotators

Danaeioak

URL

blogs.scientificamerican.com/beautiful-minds/forced-social-isolation-causes-neural-craving-similar-to-hunger/
Jun 2020
www.sefaria.org www.sefaria.org

Pirkei Avot 5:21

1
1. shimmelb 23 Jun 2020
  
  in Public
  
  According to the labor is the reward
  
  Work reward life reciprocity
Visit annotations in context

Tags

Work

reciprocity

life

reward

Annotators

shimmelb

URL

sefaria.org/Pirkei_Avot.5.21
May 2020
psyarxiv.com psyarxiv.com

How are Curiosity and Interest Different? Naïve Bayes Classification of People's Naïve Belief

1
1. Marlene_Wulf 27 May 2020
  
  in BehSci
  
  Donnellan, E., Sumeyye, Fastrich, G. M., & Murayama, K. (2020). How are Curiosity and Interest Different? Naïve Bayes Classification of People’s Naïve Belief. https://doi.org/10.31234/osf.io/697gk
  
  is:preprint lang:en curiosity folk concept intrinsic motivation reward-learning framework information seeking machine learning interest naive bayes text classification
Visit annotations in context

Tags

text classification

machine learning

interest

naive bayes

lang:en

curiosity

intrinsic motivation

reward-learning framework

information seeking

folk concept

is:preprint

Annotators

Marlene_Wulf

URL

psyarxiv.com/697gk/
Apr 2020
time.com time.com

Health Experts Are Telling Healthy People Not to Wear Face Masks for Coronavirus. So Why Are So Many Doing It?

1
1. TylerRick 02 Apr 2020
  
  in Public
  
  “Even if experts are saying it’s really not going to make a difference, a little [part of] people’s brains is thinking, well, it’s not going to hurt. Maybe it’ll cut my risk just a little bit, so it’s worth it to wear a mask,” she says.
  
  cost/benefit analysis low-risk / high-reward worth a try psychology
Visit annotations in context

Tags

low-risk / high-reward

worth a try

cost/benefit analysis

psychology

Annotators

TylerRick

URL

time.com/5794729/coronavirus-face-masks/
Jan 2020
medium.com medium.com

How the Internet, Dopamine and your Brain are Working Together to Screw Your Potential. — Neuroscience + Internet — Medium

1
1. Pictor 05 Jan 2020
  
  in Public
  
  Look over your list. Do they contain words like published, awarded, graduated, built, founded or created? Or do they contain mostly adjectives like nice, caring, loving, honest and smart? If you’re in the first sentence it’s likely you’re an SC. If the majority of your responses are in the second sentence you are likely an RC.
  
  The difference is if listing egocentric stuff (I'm impressive and I feel better than others, I feel worthy for myself itself) or listing qualities that influence the surrounding world (I do social work to help refugees, I published a theory to improve the current state of philosophy, I completed a project or a school, I created something that now generates some kind of value).
  
  The Replication Creators are creative just for themselves, so they get just short-term rewards.
  
  The Skilled Creators are creative for the sharing with the others, so they get long-term rewards.
  
  dopamine neurology reward relationships motivation
Visit annotations in context

Tags

neurology

relationships

dopamine

reward

motivation

Annotators

Pictor

URL

medium.com/neuroscience-internet/how-the-internet-dopamine-and-your-brain-are-working-together-to-screw-your-potential-1ac176538961
Feb 2014
www.justinhughes.net www.justinhughes.net

Untitled document

1
1. aculich 02 Feb 2014
  
  in Public
  
  Intellectual property is far more egalitarian. Of limited duration and obtainable by anyone, intellectual property can be seen as a reward, an empowering instrument, for the talented upstarts Burke sought to restrain. Intellectual property is often the propertization of what we call "talent." It tends to shift the balance toward the talented newcomers whom Burke mistrusted
  
  intellectual property is often the propertization of what we call talent.
  
  intellectual property property contrast egalitarian limited duration reward empowering instrument propertization talent Burke
Visit annotations in context

Tags

intellectual property

propertization

empowering instrument

talent

limited duration

property

reward

egalitarian

Burke

contrast

Annotators

aculich

URL

justinhughes.net/docs/a-ip01.pdf
www.lawnerds.com www.lawnerds.com

Untitled document

1
1. aculich 01 Feb 2014
  
  in Public
  
  MINTURN, J. The plaintiff occupied the position of a special police officer, in Atlantic City, and incidentally was identified with the work of the prosecutor of the pleas of the county. He possessed knowledge concerning the theft of certain diamonds and jewelry from the possession of the defendant, who had advertised a reward for the recovery of the property. In this situation he claims to have entered into a verbal contract with defendant, whereby she agreed to pay him $500 if he could procure for her the names and addresses of the thieves. As a result of his meditation with the police authorities the diamonds and jewelry were recovered, and plaintiff brought this suit to recover the promised reward.
  
  Plaintiff makes a verbal contract with defendant. In return for $500, plaintiff will find defendant's stolen jewels.
  
  Plaintiff had knowledge of whereabouts of jewels at contract formation.
  
  Plaintiff is a special police officer and has dealings with prosecutor's office.
  
  Defendant published advertisement for reward.
  
  Plaintiff finds stolen goods and arranges return.
  
  copyx judicial opinion sample brief facts of the case facts verbal contract stolen goods contract formation police officer public official prosecutors office advertisement reward
Visit annotations in context

Tags

verbal contract

facts of the case

police officer

copyx

facts

prosecutors office

advertisement

judicial opinion

sample brief

stolen goods

reward

contract formation

public official

Annotators

aculich

URL

lawnerds.com/guide/briefing.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL