Hypothesis

4 Matching Annotations

Nov 2022
arxiv.org arxiv.org

1809.09672.pdf

2
1. ravenscroftj 28 Nov 2022
  
  in Public
  
  Extractive summarization may be regarded as acontextual bandit as follows. Each document is acontext, and each ordered subset of a document’ssentences is a different action
  
  We can represent extractive summarization as a bandit problem by treating the document as the context and possible reorderings of sentences as actions an agent could take
  
  rl bandit NLProc summarization
2. ravenscroftj 28 Nov 2022
  
  in Public
  
  andit is a decision-making formal-ization in which an agent repeatedly chooses oneof several actions, and receives a reward based onthis choice.
  
  Definition for contextual bandit: an agent that repeatedly choses one of several actions and receives a reward based on this choice.
  
  rl bandit nlproc summarization
Visit annotations in context

Tags

NLProc

nlproc

bandit

summarization

rl

Annotators

ravenscroftj

URL

arxiv.org/pdf/1809.09672.pdf
aclanthology.org aclanthology.org

Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses

1
1. ravenscroftj 28 Nov 2022
  
  in Public
  
  BanditSum a hierarchical bi-LSTM
  
  Banditsum uses bi-directional LSTM encoding. It generates sentence-level representations
  
  NLProc summarization bandit rl
Visit annotations in context

Tags

summarization

NLProc

rl

bandit

Annotators

ravenscroftj

URL

aclanthology.org/D19-1620.pdf
Oct 2020
reisub0.github.io reisub0.github.io

Fractal Learning

1
1. jessems 11 Oct 2020
  
  in Public
  
  Most people seem to follow one of two strategies - and these strategies come under the umbrella of tree-traversal algorithms in computer science.
  
  Deciding whether you want to go deep into one topic, or explore more topics, can be seen as a choice between two types of tree-traversal algorithms: depth-first and breadth-first.
  
  This also reminds me of the Explore-Exploit problem in machine learning, which I believe is related to the Multi-Armed Bandit Problem.
  
  Explore vs. Exploit Depth-first search Breadth-first search Multi-armed bandit problem Fractal Learning
Visit annotations in context

Tags

Breadth-first search

Depth-first search

Multi-armed bandit problem

Explore vs. Exploit

Fractal Learning

Annotators

jessems

URL

reisub0.github.io/fractal-learning.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL