Hypothesis

1 Matching Annotations

Apr 2023
ar5iv.labs.arxiv.org ar5iv.labs.arxiv.org

What learning algorithm is in-context learning? Investigations with linear models

1
1. mshook 22 Apr 2023
  
  in Public
  
  While past work has characterized what kinds of functions ICL can learn (Garg et al., 2022; Laskin et al., 2022) and the distributional properties of pretraining that can elicit in-context learning (Xie et al., 2021; Chan et al., 2022), but how ICL learns these functions has remained unclear. What learning algorithms (if any) are implementable by deep network models? Which algorithms are actually discovered in the course of training? This paper takes first steps toward answering these questions, focusing on a widely used model architecture (the transformer) and an extremely well-understood class of learning problems (linear regression).
  
  icl how algorithm transformer stanford mit linerar regression
Visit annotations in context

Tags

linerar

regression

icl

mit

how

algorithm

stanford

transformer

Annotators

mshook

URL

ar5iv.labs.arxiv.org/html/2211.15661