Tags » Embeddings

Char2vec - Character embeddings for word similarity

Most of my applied data science work is in text heavy domains where the objects are small, there isn’t a clear “vocabulary”, and most of the tasks focus on similarity. 687 more words


Weekly Review: 12/03/2017

Missed a post last week due to the Thanksgiving long weekend :-). We had gone to San Francisco to see the city and try out a couple of hikes). 474 more words

Machine Learning/AI

Embedding Layers, Autoencoders and High Dimensional Forecasting


This post will illustrate how to use Neural Networks to do dimensionality reduction and generate usable factors. I will first discuss methods typically used in machine learning for dimensionality reduction: PCA regression, LASSO, and Ridge Regression. 6,792 more words


Code: Word2Vec in Spark

Here is a snippet that might be useful to you if you are looking to implement Word2Vec and save the embeddings of the trained model. I’ve added types to the variables as well as to some placeholder names to make it easier to understand what is expected as an input to various functions… 295 more words