In the last post, we talked about neural language models and how they could be used to predict the next word in some context. Now, one might ask how we could use the idea for our feature learning model? 376 more words