Highest Voted 'embeddings' Questions - Statistical Analysis Stack Exchange

8

votes

1 answer

What is the intuition behind the positional cosine encoding in the transformer network?

I don't understand how adding the cosine encodings/functions to each of the dimension of the word vector embedding enables the network to "understand" where each word is situated in the sentence. What is the intuition behind it? It seems a bit…

asked May 22 '19 at 16:54

Tom

1,204
8
17

7

votes

2 answers

How to embed in Euclidean space

I have what I think might be a standard machine learning problem but I can't find a clear solution. I have lots vectors of different dimensions. For each pair of vectors I can compute their similarity. I would like to embed these vectors into…

machine-learning dimensionality-reduction multidimensional-scaling embeddings

asked Jun 26 '18 at 19:15

graffe

1,799
1
22
34

6

votes

2 answers

Why BERT use learned positional embedding?

Compared with sinusoidal positional encoding used in Transformer, BERT's learned-lookup-table solution has 2 drawbacks in my mind: Fixed length Cannot reflect relative distance Could anyone please tell me the considerations behind such design?

neural-networks natural-language embeddings

asked Apr 13 '20 at 15:03

eric2323223

277
1
3
14

6

votes

2 answers

How to use "IDs" as an input variable to a ML model?

I am trying to include a variable like "account number" which is an "ID" as a predictive variable for a logistic regression model. In fact there are several columns in my dataset that are "IDs" but are important in predicting the outcome. for…

machine-learning logistic neural-networks tensorflow embeddings

asked Nov 06 '17 at 22:50

hm6

91
1
6

5

votes

1 answer

If the curse of dimensionality exists, how does embedding search work?

The curse of dimensionality tells us if the dimension is high, the distance metric will stop working, i.e., everyone will be close to everyone. However, many machine learning retrieval systems rely on calculating embeddings and retrieve similar data…

machine-learning distance high-dimensional embeddings vector-space-model

asked May 20 '21 at 09:01

Haitao Du

32,885
17
118
213

4

votes

1 answer

ArcFace - How to compute $\cos(t+m)$ when $t+m > \pi$

I am trying to understand the ArcFace Implementation and I am stuck at one condition. If the $ \cos(t) > \cos(\pi -m)$ then $t + m > \pi$. In this case the way how we're computing $\cos(t+m)$ is changed into $cos(t+m) = \cos(t) - m * \sin(m)$. Could…

neural-networks classification embeddings

asked Feb 26 '21 at 15:42

pawols

141
4

4

votes

1 answer

What is embedding? (in the context of dimensionality reduction)

In the context of dimensionality reduction one often uses word embedding, which seems to me a rather technical mathematical term, which rather stands out compared to the rest of the discussion, which in case of PCA, MDS and similar methods is just…

pca terminology dimensionality-reduction multidimensional-scaling embeddings

asked Sep 15 '20 at 08:32

Roger Vadim

1,481
6
17

4

votes

0 answers

Why researchers use conv1d for embeddings instead of dense layers?

In some papers (like Reinforcement learning for Vehicle Routing Problem), researchers use conv1d to embed the problem input into a hyperspace; for example, in solving TSP, they use conv1d on the (x,y) coordinates of node, but I don't understand why…

machine-learning convolution embeddings

asked Sep 18 '18 at 12:32

Ahmed Maher

73
5

4

votes

1 answer

What is the difference in the latent space of a variational autoencoder and a regular autoencoder?

Should VAEs be even used for non-generative tasks? If I were to use both models for embedding images, how would the embedding space differ on a structural level?

latent-variable autoencoders embeddings

asked Dec 16 '17 at 21:25

Daniel

151
5

3

votes

2 answers

Embedding data into a larger dimension space

Embeddings or latent spaces are vector spaces that we embed our initial data into that for further processing. The benefit of doing so as far as I am aware, is to reduce the dimension. Often data has many discrete features that doesn't make sense to…

machine-learning dimensionality-reduction embeddings

asked Jun 04 '21 at 00:11

user127776

187
4

3

votes

1 answer

What are state of the art methods for creating embeddings for sets?

I want to create embeddings in $R^D$ for sets. So I want a function (probably a neural network) that takes in a set $ S = \{ s_1, \dots, s_n \} $ (and ideally of any size, so the number of elements might vary but anything is good) and produces…

machine-learning neural-networks embeddings

asked Mar 13 '20 at 18:33

Charlie Parker

5,836
11
57
113

3

votes

1 answer

Is the Keras Embedding layer dependent on the target label?

I learned how to 'use' the Keras Embedding layer, but I am not able to find any more specific information about the actual behavior and training process of this layer. For now, I understand that the Keras Embedding layer maps distinct categorical…

neural-networks keras word-embeddings embeddings

asked Jun 09 '19 at 16:20

Jan Musil

291
2
9

3

votes

0 answers

Can you use VAEs to produce deep word embeddings?

There are many articles about applications of VAE such as image reconstruction, denoising, data compression / augmentation. However, I have not seen an example of embeddings for high dimensional data such as words. Are there some papers about the…

autoencoders word-embeddings word2vec embeddings

asked Apr 26 '19 at 08:03

Théophile Pace

183
5

3

votes

1 answer

Facebook's infersent intuition

When reviewing Infersent's architecture here, I noticed that, after encoding the premise and hypothesis to obtain two vectors u and v, they feed the set of fully connected layers with: (u, v) the concatenation between u and v, u * v the…

word-embeddings embeddings

asked Jan 07 '19 at 08:25

ryuzakinho

163
8

2

votes

0 answers

Is there an MDS/embedding algorithm that is more suitable to the goal of clustering a graph

I am testing ideas on clustering a particular graph. After testing a set of graph clustering/community detection algorithms I thought about mapping the graph to a vector space and using vector space clustering algorithms, let us say GMM in…

clustering data-visualization multidimensional-scaling embeddings

asked Mar 10 '21 at 01:47

Jacques Wainer

5,032
1
20
32

Questions tagged [embeddings]