Hypothesis

37 Matching Annotations

Dec 2024
www.youtube.com www.youtube.com

Ajit Narayanan: A word game to communicate in any language

1
1. stopresetgo 23 Dec 2024
  
  in Public
  
  when you want to use Google, you go into Google search, and you type in English, and it matches the English with the English. What if we could do this in FreeSpeech instead? I have a suspicion that if we did this, we'd find that algorithms like searching, like retrieval, all of these things, are much simpler and also more effective, because they don't process the data structure of speech. Instead they're processing the data structure of thought
  
  for - indyweb dev - question - alternative to AI Large Language Models? - Is indyweb functionality the same as Freespeech functionality? - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan - data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan
  
  data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan indyweb dev - question - alternative to AI Large Language Models? - Is indyweb functionality the same as Freespeech functionality? - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan
Visit annotations in context

Tags

indyweb dev - question - alternative to AI Large Language Models? - Is indyweb functionality the same as Freespeech functionality? - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan

data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan

Annotators

stopresetgo

URL

youtube.com/watch
Sep 2024
metagov.org metagov.org

KOI Pond | Metagov

1
1. chrisaldrich 25 Sep 2024
  
  in Public
  
  https://metagov.org/projects/koi-pond
  
  Metagov's KOI (Knowledge Organization Infrastructure) is a graph database that supports relationships between knowledge objects, users, and groups within Metagov. via JM
  
  Friends of the Link 2024-09-25 Knowledge Organization Infrastructure (KOI) knowledge objects Large Language Models (LLM) communities self-governance
Visit annotations in context

Tags

knowledge objects

communities

Large Language Models (LLM)

Friends of the Link 2024-09-25

self-governance

Knowledge Organization Infrastructure (KOI)

Annotators

chrisaldrich

URL

metagov.org/projects/koi-pond
Jul 2024
ai.meta.com ai.meta.com

Introducing Llama 3.1: Our most capable models to date

1
1. chrisaldrich 24 Jul 2024
  
  in Public
  
  Introducing Llama 3.1: Our most capable models to date by [[Meta]]
  
  Meta Llama large language models Friends of the Link 2024-07-24
Visit annotations in context

Tags

Friends of the Link 2024-07-24

Llama

large language models

Meta

Annotators

chrisaldrich

URL

ai.meta.com/blog/meta-llama-3-1/
Mar 2024
research.ibm.com research.ibm.com

What is retrieval-augmented generation? | IBM Research Blog

1
1. chrisaldrich 30 Mar 2024
  
  in Public
  
  https://research.ibm.com/blog/retrieval-augmented-generation-RAG
  
  PK indicates that folks using footnotes in AI are using rag methods.
  
  retrieval augmented generation (RAG) artificial intelligence large language models Friends of the Link 2024-03-27
Visit annotations in context

Tags

artificial intelligence

retrieval augmented generation (RAG)

large language models

Friends of the Link 2024-03-27

Annotators

chrisaldrich

URL

research.ibm.com/blog/retrieval-augmented-generation-RAG
Jan 2024
arxiv.org arxiv.org

2401.05566.pdf

1
1. mark.crowley 26 Jan 2024
  
  in Public
  
  Hubinger, et. al. "SLEEPER AGENTS: TRAINING DECEPTIVE LLMS THAT PERSIST THROUGH SAFETY TRAINING". Arxiv: 2401.05566v3. Jan 17, 2024.
  
  Very disturbing and interesting results from team of researchers from Anthropic and elsewhere.
  
  large-language-models transformers rdgrp rdgrp-w24
Visit annotations in context

Tags

rdgrp-w24

large-language-models

transformers

rdgrp

Annotators

mark.crowley

URL

arxiv.org/pdf/2401.05566
cdn.openai.com cdn.openai.com

gpt-4-system-card.pdf

1
1. mark.crowley 06 Jan 2024
  
  in Public
  
  GPT-4 System CardOpenAIMarch 23, 2023
  
  chat-gpt large-language-models openai system-cards transformers toread reading_group_crowley
Visit annotations in context

Tags

toread

openai

chat-gpt

transformers

system-cards

reading_group_crowley

large-language-models

Annotators

mark.crowley

URL

cdn.openai.com/papers/gpt-4-system-card.pdf
www.technologyreview.com www.technologyreview.com

We read the paper that forced Timnit Gebru out of Google. Here’s what it says

1
1. stopresetgo 04 Jan 2024
  
  in Public
  
  for: progress trap -AI, carbon footprint - AI, progress trap - AI - bias, progress trap - AI - situatedness
  
  progress trap - AI carbon footprint - AI carbon footprint - AI - large language models AI ethics - Google - Timnit Gebru departure progress trap - AI - bias progress trap - AI - situatedness
Visit annotations in context

Tags

carbon footprint - AI

AI ethics - Google - Timnit Gebru departure

progress trap - AI - bias

carbon footprint - AI - large language models

progress trap - AI - situatedness

progress trap - AI

Annotators

stopresetgo

URL

technologyreview.com/2020/12/04/1013294/google-ai-ethics-research-paper-forced-out-timnit-gebru/
Oct 2023
arxiv.org arxiv.org

RoBERTa: A Robustly Optimized BERT Pretraining Approach

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Introduction of the RoBERTa improved analysis and training approach to BERT NLP models.
  
  large-language-models nlp transformers rdgrp-s23 reading_group_crowley
Visit annotations in context

Tags

nlp

transformers

reading_group_crowley

large-language-models

rdgrp-s23

Annotators

mark.crowley

URL

arxiv.org/pdf/1907.11692
arxiv.org arxiv.org

2305.15486.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Wu, Prabhumoye, Yeon Min, Bisk, Salakhutdinov, Azaria, Mitchell and Li. "SPRING: GPT-4 Out-performs RL Algorithms byStudying Papers and Reasoning". Arxiv preprint arXiv:2305.15486v2, May, 2023.
  
  reinforcement-learning nlp large-language-models chatgpt minecraft evaluation-methods rdgrp-f23
Visit annotations in context

Tags

chatgpt

large-language-models

nlp

minecraft

evaluation-methods

rdgrp-f23

reinforcement-learning

Annotators

mark.crowley

URL

arxiv.org/pdf/2305.15486.pdf
arxiv.org arxiv.org

2308.13067.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Zecevic, Willig, Singh Dhami and Kersting. "Causal Parrots: Large Language Models May Talk Causality But Are Not Causal". In Transactions on Machine Learning Research, Aug, 2023.
  
  transformers large-language-models nlp reading_group_crowley rdgrp-f23
Visit annotations in context

Tags

nlp

transformers

reading_group_crowley

rdgrp-f23

large-language-models

Annotators

mark.crowley

URL

arxiv.org/pdf/2308.13067.pdf
www.gatesnotes.com www.gatesnotes.com

The Age of AI has begun

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  "The Age of AI has begun : Artificial intelligence is as revolutionary as mobile phones and the Internet." Bill Gates, March 21, 2023. GatesNotes
  
  aig chatgpt large-language-models
Visit annotations in context

Tags

chatgpt

large-language-models

aig

Annotators

mark.crowley

URL

gatesnotes.com/The-Age-of-AI-Has-Begun
www.inc.com www.inc.com

Bill Gates Says We're Witnessing a 'Stunning' New Technology Age. 5 Ways You Must Prepare Now

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Minda Zetlin. "Bill Gates Says We're Witnessing a 'Stunning' New Technology Age. 5 Ways You Must Prepare Now". Inc.com, March 2023.
  
  chatgpt openai large-language-models
Visit annotations in context

Tags

chatgpt

openai

large-language-models

Annotators

mark.crowley

URL

inc.com/minda-zetlin/bill-gates-says-were-witnessing-a-stunning-new-technology-age-5-ways-to-prepare.html
arxiv.org arxiv.org

2212.05032.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Feng, 2022. "Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis"
  
  Shared and found via: Gowthami Somepalli @gowthami@sigmoid.social Mastodon > Gowthami Somepalli @gowthami StructureDiffusion: Improve the compositional generation capabilities of text-to-image #diffusion models by modifying the text guidance by using a constituency tree or a scene graph.
  
  chatgpt large-language-models nlp transformers ece657a
Visit annotations in context

Tags

chatgpt

nlp

transformers

large-language-models

ece657a

Annotators

mark.crowley

URL

arxiv.org/pdf/2212.05032.pdf
arxiv.org arxiv.org

2203.02155.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Training language models to follow instructionswith human feedback
  
  Original Paper for discussion of the Reinforcement Learning with Human Feedback algorithm.
  
  large-language-models reinforcement-learning chatgpt
Visit annotations in context

Tags

chatgpt

large-language-models

reinforcement-learning

Annotators

mark.crowley

URL

arxiv.org/pdf/2203.02155
cdn.openai.com cdn.openai.com

Language Models are Unsupervised Multitask Learners

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  GPT-2 Introduction paper
  
  Language Models are Unsupervised Multitask Learners A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever, (2019).
  
  large-language-models nlp machine-learning transformers gpt reading_group_crowley rdgrp-s23
Visit annotations in context

Tags

machine-learning

nlp

transformers

reading_group_crowley

large-language-models

gpt

rdgrp-s23

Annotators

mark.crowley

URL

cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
arxiv.org arxiv.org

1706.03762.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  "Attention is All You Need" Foundational paper introducing the Transformer Architecture.
  
  transformers reading_group_crowley rdgrp-s23 large-language-models nlp
Visit annotations in context

Tags

nlp

transformers

reading_group_crowley

large-language-models

rdgrp-s23

Annotators

mark.crowley

URL

arxiv.org/pdf/1706.03762
papers.nips.cc papers.nips.cc

NeurIPS-2020-language-models-are-few-shot-learners-Paper.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  GPT-3 introduction paper
  
  large-language-models nlp machine-learning transformers gpt reading_group_crowley rdgrp-s23
Visit annotations in context

Tags

machine-learning

nlp

transformers

reading_group_crowley

large-language-models

gpt

rdgrp-s23

Annotators

mark.crowley

URL

papers.nips.cc/paper_files/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf
arxiv.org arxiv.org

2105.03322.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  "Are Pre-trained Convolutions Better than Pre-trained Transformers?"
  
  transformers deep-learning nlp large-language-models reading_group_crowley rdgrp-s23
Visit annotations in context

Tags

rdgrp-s23

nlp

transformers

reading_group_crowley

large-language-models

deep-learning

Annotators

mark.crowley

URL

arxiv.org/pdf/2105.03322.pdf
arxiv.org arxiv.org

2201.08239.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  LaMDA: Language Models for Dialog Application
  
  "LaMDA: Language Models for Dialog Application" Meta's introduction of LaMDA v1 Large Language Model.
  
  transformers reading_group_crowley rdgrp-s23 large-language-models nlp
Visit annotations in context

Tags

nlp

transformers

reading_group_crowley

large-language-models

rdgrp-s23

Annotators

mark.crowley

URL

arxiv.org/pdf/2201.08239.pdf
osf.io osf.io

Attention Mechanism, Transformers, BERT, and GPT: Tutorial and Survey

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  Benyamin GhojoghAli Ghodsi. "Attention Mechanism, Transformers, BERT, and GPT: Tutorial and Survey"
  
  reading_group_crowley transformers reading_group_crowley rdgrp-s23 nlp large-language-models
Visit annotations in context

Tags

nlp

transformers

reading_group_crowley

large-language-models

rdgrp-s23

Annotators

mark.crowley

URL

osf.io/m6gcn/
Jul 2023
arxiv.org arxiv.org

2307.09288.pdf

1
1. mark.crowley 19 Jul 2023
  
  in Public
  
  LLAMA 2 Release Paper
  
  large-language-models transformers
Visit annotations in context

Tags

large-language-models

transformers

Annotators

mark.crowley

URL

arxiv.org/pdf/2307.09288
arxiv.org arxiv.org

2001.09977.pdf

1
1. mark.crowley 19 Jul 2023
  
  in Public
  
  Daniel Adiwardana Minh-Thang Luong David R. So Jamie Hall, Noah Fiedel Romal Thoppilan Zi Yang Apoorv Kulshreshtha, Gaurav Nemade Yifeng Lu Quoc V. Le "Towards a Human-like Open-Domain Chatbot" Google Research, Brain Team
  
  Defined the SSI metric for chatbots used in LAMDA paper by google.
  
  large-language-models ssi nlp
Visit annotations in context

Tags

large-language-models

ssi

nlp

Annotators

mark.crowley

URL

arxiv.org/pdf/2001.09977.pdf
Apr 2023
srush.github.io srush.github.io

The Annotated S4

1
1. mark.crowley 18 Apr 2023
  
  in Public
  
  The Annotated S4 Efficiently Modeling Long Sequences with Structured State Spaces Albert Gu, Karan Goel, and Christopher Ré.
  
  A new approach to transformers
  
  transformers large-language-models
Visit annotations in context

Tags

large-language-models

transformers

Annotators

mark.crowley

URL

srush.github.io/annotated-s4/
arxiv.org arxiv.org

Efficiently Modeling Long Sequences with Structured State Spaces

1
1. mark.crowley 18 Apr 2023
  
  in Public
  
  Efficiently Modeling Long Sequences with Structured State SpacesAlbert Gu, Karan Goel, and Christopher R ́eDepartment of Computer Science, Stanford University
  
  transformers large-language-models
Visit annotations in context

Tags

large-language-models

transformers

Annotators

mark.crowley

URL

arxiv.org/pdf/2111.00396
arxiv.org arxiv.org

Eight Things to Know about Large Language Models

1
1. peter_murray 16 Apr 2023
  
  in Public
  
  Bowman, Samuel R.. "Eight Things to Know about Large Language Models." arXiv, (2023). https://doi.org/https://arxiv.org/abs/2304.00612v1.
  
  Abstract
  
  The widespread public deployment of large language models (LLMs) in recent months has prompted a wave of new attention and engagement from advocates, policymakers, and scholars from many fields. This attention is a timely response to the many urgent questions that this technology raises, but it can sometimes miss important considerations. This paper surveys the evidence for eight potentially surprising such points: 1. LLMs predictably get more capable with increasing investment, even without targeted innovation. 2. Many important LLM behaviors emerge unpredictably as a byproduct of increasing investment. 3. LLMs often appear to learn and use representations of the outside world. 4. There are no reliable techniques for steering the behavior of LLMs. 5. Experts are not yet able to interpret the inner workings of LLMs. 6. Human performance on a task isn't an upper bound on LLM performance. 7. LLMs need not express the values of their creators nor the values encoded in web text. 8. Brief interactions with LLMs are often misleading.
  
  Found via: Taiwan's Gold Card draws startup founders, tech workers | Semafor
  
  large language models
Visit annotations in context

Tags

large language models

Annotators

peter_murray

URL

arxiv.org/pdf/2304.00612.pdf
time.com time.com

Exclusive: The $2 Per Hour Workers Who Made ChatGPT Safer

1
1. avner 06 Apr 2023
  
  in Public
  
  It was only by building an additional AI-powered safety mechanism that OpenAI would be able to rein in that harm, producing a chatbot suitable for everyday use.
  
  This isn't true. The Stochastic Parrots paper outlines other avenues for reining in the harms of language models like GPT's.
  
  large language models AI Artificial Intelligence Stochastic Parrots
Visit annotations in context

Tags

Stochastic Parrots

large language models

Artificial Intelligence

AI

Annotators

avner

URL

time.com/6247678/openai-chatgpt-kenya-workers/
Mar 2023
arxiv.org arxiv.org

2302.07459.pdf

1
1. peter_murray 29 Mar 2023
  
  in Public
  
  Ganguli, Deep, Askell, Amanda, Schiefer, Nicholas, Liao, Thomas I., Lukošiūtė, Kamilė, Chen, Anna, Goldie, Anna et al. "The Capacity for Moral Self-Correction in Large Language Models." arXiv, (2023). https://doi.org/https://arxiv.org/abs/2302.07459v2.
  
  Abstract
  
  We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability for moral self-correction emerges at 22B model parameters, and typically improves with increasing model size and RLHF training. We believe that at this level of scale, language models obtain two capabilities that they can use for moral self-correction: (1) they can follow instructions and (2) they can learn complex normative concepts of harm like stereotyping, bias, and discrimination. As such, they can follow instructions to avoid certain kinds of morally harmful outputs. We believe our results are cause for cautious optimism regarding the ability to train language models to abide by ethical principles.
  
  computing ethics large language models
Visit annotations in context

Tags

computing ethics

large language models

Annotators

peter_murray

URL

arxiv.org/pdf/2302.07459.pdf
web.archive.org web.archive.org

Ancient Egyptian Dictionary

2
1. chrisaldrich 28 Mar 2023
  
  in Public
  
  Dass das ägyptische Wort p.t (sprich: pet) "Himmel" bedeutet, lernt jeder Ägyptologiestudent im ersten Semester. Die Belegsammlung im Archiv des Wörterbuches umfaßt ca. 6.000 Belegzettel. In der Ordnung dieses Materials erfährt man nun, dass der ägyptische Himmel Tore und Wege hat, Gewässer und Ufer, Seiten, Stützen und Kapellen. Damit wird greifbar, dass der Ägypter bei dem Wort "Himmel" an etwas vollkommen anderes dachte als der moderne westliche Mensch, an einen mythischen Raum nämlich, in dem Götter und Totengeister weilen. In der lexikographischen Auswertung eines so umfassenden Materials geht es also um weit mehr als darum, die Grundbedeutung eines banalen Wortes zu ermitteln. Hier entfaltet sich ein Ausschnitt des ägyptischen Weltbildes in seinem Reichtum und in seiner Fremdheit; und naturgemäß sind es gerade die häufigen Wörter, die Schlüsselbegriffe der pharaonischen Kultur bezeichnen. Das verbreitete Mißverständnis, das Häufige sei uninteressant, stellt die Dinge also gerade auf den Kopf.
  
  Google translation:
  
  Every Egyptology student learns in their first semester that the Egyptian word pt (pronounced pet) means "heaven". The collection of documents in the dictionary archive comprises around 6,000 document slips. In the order of this material one learns that the Egyptian heaven has gates and ways, waters and banks, sides, pillars and chapels. This makes it tangible that the Egyptians had something completely different in mind when they heard the word "heaven" than modern Westerners do, namely a mythical space in which gods and spirits of the dead dwell.
  
  This is a fantastic example of context creation for a dead language as well as for creating proper historical context.
  
  context collapse contextual clues large language models key word in context heaven historical context historical method examples Egyptian philology
2. chrisaldrich 28 Mar 2023
  
  in Public
  
  In looking at the uses of and similarities between Wb and TLL, I can't help but think that these two zettelkasten represented the state of the art for Large Language Models and some of the ideas behind ChatGPT
  
  Thesaurus Linguae Latinae Wörterbuch der ägyptischen Sprache zettelkasten large language models information theory ChatGPT
Visit annotations in context

Tags

key word in context

contextual clues

heaven

philology

examples

Thesaurus Linguae Latinae

ChatGPT

context collapse

zettelkasten

Wörterbuch der ägyptischen Sprache

historical method

large language models

historical context

information theory

Egyptian

Annotators

chrisaldrich

URL

web.archive.org/web/20180627163317/https://aaew.bbaw.de/wbhome/Broschuere/index.html
www.inc.com www.inc.com

Bill Gates Says We're Witnessing a 'Stunning' New Technology Age. 5 Ways You Must Prepare Now

1
1. mark.crowley 27 Mar 2023
  
  in Public
  
  "There is a robust debate going on in the computing industry about how to create it, and whether it can even be created at all."
  
  Is there? By whom? Why industry only and not government, academia and civil society?
  
  ai-for-good aigpt20230326 large-language-models chat open
Visit annotations in context

Tags

open

ai-for-good

chat

aigpt20230326

large-language-models

Annotators

mark.crowley

URL

inc.com/minda-zetlin/bill-gates-says-were-witnessing-a-stunning-new-technology-age-5-ways-to-prepare.html
dl.acm.org dl.acm.org

On the Dangers of Stochastic Parrots | Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency

1
1. chrisaldrich 23 Mar 2023
  
  in Public
  
  Bender, Emily M., Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜” In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610–23. FAccT ’21. New York, NY, USA: Association for Computing Machinery, 2021. https://doi.org/10.1145/3442188.3445922.
  
  Would the argument here for stochastic parrots also potentially apply to or could it be abstracted to Markov monkeys?
  
  artificial intelligence references large language models stochastic parrots Markov monkey tools for thought
Visit annotations in context

Tags

tools for thought

large language models

artificial intelligence

Markov monkey

references

stochastic parrots

Annotators

chrisaldrich

URL

dl.acm.org/doi/10.1145/3442188.3445922
Feb 2023
wordcraft-writers-workshop.appspot.com wordcraft-writers-workshop.appspot.com

Wordcraft Writers Workshop

1
1. chrisaldrich 12 Feb 2023
  
  in Public
  
  language models are incredible "yes, and" machines, allowing writers to quickly explore seemingly unlimited variations on their ideas.
  
  Wordcraft improvisation programmed creativity tools for creativity yes and language models LaMDA combinatorial creativity
Visit annotations in context

Tags

language models

Wordcraft

LaMDA

tools for creativity

combinatorial creativity

improvisation

programmed creativity

yes and

Annotators

chrisaldrich

URL

wordcraft-writers-workshop.appspot.com/learn
Jan 2023
inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net

Untitled document

1
1. mark.crowley 10 Jan 2023
  
  in Public
  
  "Talking About Large Language Models" by Murray Shanahan
  
  nlp large-language-models deep-learning transformers
Visit annotations in context

Tags

large-language-models

nlp

transformers

deep-learning

Annotators

mark.crowley

URL

inst-fs-iad-prod.inscloudgate.net/files/4b2a700d-1125-444d-bd72-8045fe274f37/Shanahan2023.pdf
Dec 2022
www.theatlantic.com www.theatlantic.com

The College Essay Is Dead

1
1. wiobyrne 10 Dec 2022
  
  in Public
  
  natural-language processing is going to force engineers and humanists together. They are going to need each other despite everything. Computer scientists will require basic, systematic education in general humanism: The philosophy of language, sociology, history, and ethics are not amusing questions of theoretical speculation anymore. They will be essential in determining the ethical and creative use of chatbots, to take only an obvious example.
  
  gpt-3 language models
Visit annotations in context

Tags

language models

gpt-3

Annotators

wiobyrne

URL

theatlantic.com/technology/archive/2022/12/chatgpt-ai-writing-college-student-essays/672371/
jack-clark.net jack-clark.net

Import AI 310: AlphaZero learned Chess like humans learn Chess; capability emergence in language models; demoscene AI.

1
1. wiobyrne 10 Dec 2022
  
  in Public
  
  Houston, we have a Capability Overhang problem: Because language models have a large capability surface, these cases of emergent capabilities are an indicator that we have a ‘capabilities overhang’ – today’s models are far more capable than we think, and our techniques available for exploring the models are very juvenile. We only know about these cases of emergence because people built benchmark datasets and tested models on them. What about all the capabilities we don’t know about because we haven’t thought to test for them? There are rich questions here about the science of evaluating the capabilities (and safety issues) of contemporary models.
  
  capability overhang ai language models gpt-3
Visit annotations in context

Tags

language models

gpt-3

ai

capability overhang

Annotators

wiobyrne

URL

jack-clark.net/2022/11/28/import-ai-310-alphazero-learned-chess-like-humans-learn-chess-capability-emergence-in-language-models-demoscene-ai/
Apr 2021
yashuseth.blog yashuseth.blog

BERT Explained – A list of Frequently Asked Questions

1
1. mromanello 22 Apr 2021
  
  in Public
  
  tutorial article on BERT
  
  bert deep learning language models
Visit annotations in context

Tags

language models

bert

deep learning

Annotators

mromanello

URL

yashuseth.blog/2019/06/12/bert-explained-faqs-understand-bert-working/
Aug 2020
deponysum.com deponysum.com

Recent advances in Natural Language Processing- Some Woolly speculations

1
1. jessems 19 Aug 2020
  
  in Public
  
  It might be instructive to think about what it would take to create a program which has a model of eighth grade science sufficient to understand and answer questions about hundreds of different things like “growth is driven by cell division”, and “What can magnets be used for” that wasn’t NLP led. It would be a nightmare of many different (probably handcrafted) models. Speaking somewhat loosely, language allows for intellectual capacities to be greatly compressed. From this point of view, it shouldn’t be surprising that some of the first signs of really broad capacity- common sense reasoning, wide ranging problem solving etc., have been found in language based programs- words and their relationships are just a vastly more efficient way of representing knowledge than the alternatives.
  
  DePonySum ask us to consider what you would need to program to be able to answer a wide range of eight grade science level questions (e.g. What can magnets be used for.) The answer is you would need a whole slew of separately trained and optimized models.
  
  Language, they say, is a way to compress intellectual capacities.
  
  It is then no surprise that common sense reasoning, and solving a wide range of problems, is first discovered through language models. Words and their relationships are probably a very efficient way of representing knowledge.
  
  Language Models GPT3
Visit annotations in context

Tags

Language Models

GPT3

Annotators

jessems

URL

deponysum.com/2020/01/16/recent-advances-in-natural-language-processing-some-woolly-speculations/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Abstract

Tags

Annotators

URL

Tags

Annotators

URL

Abstract