Hypothesis

18 Matching Annotations

May 2025
www.cmarix.com www.cmarix.com

Embedding Intelligence with Vector Search and RAG Models: A Deep Dive

1
1. akelahmed 30 May 2025
  
  in Public
  
  The answer most technocrats are leaning towards is vector search technology and Retrieval-Augmented Generation (RAG) models that improve AI experiences. These intelligent search systems are fundamentally changing how users discover information, interact with applications, and receive personalized experiences across industries.
  
  Explore how embedding intelligence transforms Vector Search and RAG (Retrieval-Augmented Generation) models. Learn the key benefits, use cases, and implementation strategies for smarter AI-driven search systems.
  
  Vector search with embeddings RAG models in AI Embedding intelligence in vector search AI-powered search systems Retrieval-Augmented Generation models
Visit annotations in context

Tags

Retrieval-Augmented Generation models

Embedding intelligence in vector search

AI-powered search systems

Vector search with embeddings

RAG models in AI

Annotators

akelahmed

URL

cmarix.com/blog/embedding-intelligence-in-vector-search-and-rag-models/
Feb 2025
srsergiorodriguez.github.io srsergiorodriguez.github.io

Humanidades digitales en América Latina

1
1. offray 17 Feb 2025
  
  in Public
  
  Las incrustaciones de palabras, o word embeddings, son un método que consiste en derivar, por medio de algoritmos diversos, representaciones numéricas vectoriales de los términos presentes en un corpus, de acuerdo con sus relaciones contextuales7Michael Gavin et al., «Spaces of Meaning: Conceptual HIstory, Vector Semantics, and Close Reading», Debates in the Digital Humanities 2019, ed. Matthew K. Gold y Lauren F. Klein (Minneapolis London: University of Minnesota Press, 2019), 243-67.. Este método sigue los principios de la semántica distribucional8
  
  word embeddings semántica probabilística
Visit annotations in context

Tags

word embeddings

semántica probabilística

Annotators

offray

URL

srsergiorodriguez.github.io/exploraciones-digitales/metodos.html
Jul 2021
www.baeldung.com www.baeldung.com

Euclidean Distance vs Cosine Similarity | Baeldung on Computer Science

1
1. mshook 16 Jul 2021
  
  in Public
  
  Vectors with a small Euclidean distance from one another are located in the same region of a vector space. Vectors with a high cosine similarity are located in the same general direction from the origin.
  
  ml nn embeddings distance angle cosine comparison explanation
Visit annotations in context

Tags

embeddings

cosine

distance

explanation

angle

ml

nn

comparison

Annotators

mshook

URL

baeldung.com/cs/euclidean-distance-vs-cosine-similarity
aylien.com aylien.com

An overview of word embeddings and their connection to distributional semantic models - AYLIEN News API

1
1. mshook 05 Jul 2021
  
  in Public
  
  Recommendations DON'T use shifted PPMI with SVD. DON'T use SVD "correctly", i.e. without eigenvector weighting (performance drops 15 points compared to with eigenvalue weighting with (p = 0.5)). DO use PPMI and SVD with short contexts (window size of (2)). DO use many negative samples with SGNS. DO always use context distribution smoothing (raise unigram distribution to the power of (lpha = 0.75)) for all methods. DO use SGNS as a baseline (robust, fast and cheap to train). DO try adding context vectors in SGNS and GloVe.
  
  ml ai recommendations critique word embeddings
Visit annotations in context

Tags

embeddings

recommendations

ai

ml

word

critique

Annotators

mshook

URL

aylien.com/blog/overview-word-embeddings-history-word2vec-cbow-glove
Jun 2020
link.aps.org link.aps.org

Spatial strength centrality and the effect of spatial embeddings on network architecture

1
1. katietaylor_99 10 Jun 2020
  
  in BehSci
  
  Liu, Andrew, and Mason A. Porter. ‘Spatial Strength Centrality and the Effect of Spatial Embeddings on Network Architecture’. Physical Review E 101, no. 6 (9 June 2020): 062305. https://doi.org/10.1103/PhysRevE.101.062305.
  
  is:article lang:en spatial strength centrality spatial embeddings network architecture nodes latent space adjacent models synthetic network Euclidean smaller probabilities longer edges geographical fitness Gaussian
Visit annotations in context

Tags

smaller probabilities

longer edges

latent space

is:article

lang:en

synthetic network

Gaussian

models

Euclidean

nodes

architecture

network

adjacent

spatial strength centrality

geographical fitness

spatial embeddings

Annotators

katietaylor_99

URL

link.aps.org/doi/10.1103/PhysRevE.101.062305
Dec 2019
nlpoverview.com nlpoverview.com

Modern Deep Learning Techniques Applied to Natural Language Processing by Authors

5
1. vitalwarley 29 Dec 2019
  
  in Public
  
  The quality of word representations is generally gauged by its ability to encode syntactical information and handle polysemic behavior (or word senses). These properties result in improved semantic word representations. Recent approaches in this area encode such information into its embeddings by leveraging the context. These methods provide deeper networks that calculate word representations as a function of its context.
  
  Syntactical information
  
  Polysemic behavior (word senses)
  
  Semantic word representations
  
  Entendo que lidar com word senses significa dizer que a representação das palavras consegue medidas similares para palavras similares.
  
  O que seria informação sintática? E sua relação com representações semânticas da palavra?
  
  embeddings nlp
2. vitalwarley 28 Dec 2019
  
  in Public
  
  Traditional word embedding algorithms assign a distinct vector to each word. This makes them unable to account for polysemy. In a recent work, Upadhyay et al. (2017) provided an innovative way to address this deficit. The authors leveraged multilingual parallel data to learn multi-sense word embeddings.
  
  multilingual parallel data
  
  multi-sense word embeddings
  
  embeddings word2vec
3. vitalwarley 28 Dec 2019
  
  in Public
  
  This is very important as training embeddings from scratch requires large amount of time and resource. Mikolov et al. (2013) tried to address this issue by proposing negative sampling which is nothing but frequency-based sampling of negative terms while training the word2vec model.
  
  Amostragem negativa... termos negativos?
  
  word2vec embeddings
4. vitalwarley 28 Dec 2019
  
  in Public
  
  A general caveat for word embeddings is that they are highly dependent on the applications in which it is used. Labutov and Lipson (2013) proposed task specific embeddings which retrain the word embeddings to align them in the current task space.
  
  Acredito que aplicação aqui se relaciona com contexto, logo word embeddings são dependentes de contexto. Isso é bem óbvio, a princípio. Seria isso o que o autor quis dizer?
  
  Retreinar as incorporações para alinhar à tarefa corrente. Alinhar seria nada mais do que adequar as incorporações prévias no novo contexto, é isso?
  
  word2vec embeddings
5. vitalwarley 28 Dec 2019
  
  in Public
  
  One solution to this problem, as explored by Mikolov et al. (2013), is to identify such phrases based on word co-occurrence and train embeddings for them separately. More recent methods have explored directly learning n-gram embeddings from unlabeled data (Johnson and Zhang, 2015).
  
  Co-ocorrência de palavras eu consigo entender, mas treinar as embeddings separadamente não. Seria supor a co-ocorrência das palavras como unidade na incorporação, em vez da palavra apenas?
  
  embeddings nlp word2vec
Visit annotations in context

Tags

embeddings

nlp

word2vec

Annotators

vitalwarley

URL

nlpoverview.com/
gtw.hypotheses.org gtw.hypotheses.org

Open Data Citation for Social Sciences and Humanities – The companion blog to the Humanities at Scale Winter School in Prague: 24th-28th October 2016

1
1. vitalwarley 28 Dec 2019
  
  in Public
  
  The word vector is the arrow from the point where all three axes intersect to the end point defined by the coordinates.
  
  The three axes gives each one a context.
  
  nlp embeddings
Visit annotations in context

Tags

embeddings

nlp

Annotators

vitalwarley

URL

gtw.hypotheses.org/15401
Jun 2017
w4nderlu.st w4nderlu.st

Word Embeddings | w4nderlust

1
1. taniki 12 Jun 2017
  
  in Public
  
  word embeddings machine learning NLP
Visit annotations in context

Tags

machine learning

word embeddings

NLP

Annotators

taniki

URL

w4nderlu.st/teaching/word-embeddings
Apr 2017
levyomer.files.wordpress.com levyomer.files.wordpress.com

dependency-based-word-embeddings-acl-2014.pdf

4
1. akcool123 19 Apr 2017
  
  in Public
  
  arg maxvw;vcP(w;c)2Dlog11+evcvw
  
  maximise the log probability.
  
  dependency-word-embeddings-paper Skip-gram
2. akcool123 19 Apr 2017
  
  in Public
  
  p(D= 1jw;c)the probability that(w;c)came from the data, and byp(D= 0jw;c) =1p(D= 1jw;c)the probability that(w;c)didnot.
  
  probability of word,context present in text or not.
  
  dependency-word-embeddings-paper Skip-gram
3. akcool123 19 Apr 2017
  
  in Public
  
  Loosely speaking, we seek parameter values (thatis, vector representations for both words and con-texts) such that the dot productvwvcassociatedwith “good” word-context pairs is maximized.
  
  dependency-word-embeddings-paper Skip-gram
4. akcool123 19 Apr 2017
  
  in Public
  
  In the skip-gram model, each wordw2Wisassociated with a vectorvw2Rdand similarlyeach contextc2Cis represented as a vectorvc2Rd, whereWis the words vocabulary,Cis the contexts vocabulary, anddis the embed-ding dimensionality.
  
  Factors involved in the Skip gram model
  
  Skip-gram dependency-word-embeddings-paper NLP
Visit annotations in context

Tags

dependency-word-embeddings-paper

NLP

Skip-gram

Annotators

akcool123

URL

levyomer.files.wordpress.com/2014/04/dependency-based-word-embeddings-acl-2014.pdf
Jun 2016
aclweb.org aclweb.org

Right-truncatable Neural Word Embeddings

2
1. ffbe15b4a7 26 Jun 2016
  
  in Public
  
  Neural Word Embedding Methods
  
  formal-definition word-embeddings
2. ffbe15b4a7 26 Jun 2016
  
  in Public
  
  dimension of embedding vectors strongly dependson applications and uses, and is basically determinedbased on the performance and memory space (orcalculation speed) trade-of
  
  dimensionality-of-word-embeddings
Visit annotations in context

Tags

formal-definition

word-embeddings

dimensionality-of-word-embeddings

Annotators

ffbe15b4a7

URL

aclweb.org/anthology/N/N16/N16-1135.pdf

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL