Hypothesis

9 Matching Annotations

Sep 2023
datasciencetoday.net datasciencetoday.net

Paper Dissected: “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding” Explained

3
1. powerverwirrt 19 Sep 2023
  
  in Public
  
  BERT only swaps 10% of the 15% tokens selected for masking (in total 1.5% of all tokens) and leaves 10% of the tokens intact
  
  noise is manually introduced by swapping words randomly without masking them, this prevents overfitting on MASK token
2. powerverwirrt 19 Sep 2023
  
  in Public
  
  given a context, a language model predicts the probability of a word occurring in that context
  
  useful definition of "language model"
3. powerverwirrt 19 Sep 2023
  
  in Public
  
  Instead of just training a model to map a single vector for each word, these methods train a complex, deep neural network to map a vector to each word based on the entire sentence/surrounding context.
  
  difference between simple context-less embeddings and representations from BERT etc
Visit annotations in context

Annotators

powerverwirrt

URL

datasciencetoday.net/index.php/en-us/nlp/211-paper-dissected-bert-pre-training-of-deep-bidirectional-transformers-for-language-understanding-explained
Aug 2023
citeseerx.ist.psu.edu citeseerx.ist.psu.edu

document

1
1. powerverwirrt 19 Aug 2023
  
  in Public
  
  Compositionality is defined here as the property whereby “the meaning of anexpression is a monotonic function of the meaning of its parts and the way they are puttogether.” (Cann 1993:4) Recursion is “the phenomenon by which a constituent of a sen-tence dominates another instance of the same syntactic category . . . recursion is the principlereason that the number of sentences in a natural language is normally taken to be infinite”(Trask 1993:229-230)
  
  Kirby defines two fundamental properties of language. For one, language is compositional. The meaning of each part of a sentence, as well as the order in which these individual pieces appear, defines the overall meaning of that sentence. Language is also recursive (and therefore infinite). Each constituent in a phrase may contain one or more child constituents of the same syntactic category.
  
  linguistics syntax
Visit annotations in context

Tags

syntax

linguistics

Annotators

powerverwirrt

URL

citeseerx.ist.psu.edu/document
Aug 2022
blogs.loc.gov blogs.loc.gov

What Could Curation Possibly Mean? | The Signal

1
1. powerverwirrt 08 Aug 2022
  
  in Public
  
  authoring organizations realized the old copies were sticking around but they did not want that history available (presumably to hide the fact they were changing things without notice or public disclosure). So they started forcing the removal of PDF and similar documents off the Archive
  
  Someone commented that their own list of links in their field gradually got replaced with Internet Archive snapshots until institutions even enforced a removal of those.
Visit annotations in context

Annotators

powerverwirrt

URL

blogs.loc.gov/thesignal/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/
Jul 2022
openai.com openai.com

DALL·E 2 Pre-Training Mitigations

4
1. powerverwirrt 22 Jul 2022
  
  in Public
2. powerverwirrt 22 Jul 2022
  
  in Public
3. powerverwirrt 22 Jul 2022
  
  in Public
  
  image regurgitation
  
  If clusters of images from the training dataset are too similar, the model ends up creating a mix of those instead of considering other inputs.
4. powerverwirrt 22 Jul 2022
  
  in Public
  
  various guardrails
  
  Link to the system card: DALL·E 2 Preview - Risks and Limitations
Visit annotations in context

Annotators

powerverwirrt

URL

openai.com/blog/dall-e-2-pre-training-mitigations/

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL