Distributional Semantic Modeling System

AKA: Distributional Word Vectorizing Function Training System.
Context:
- It can range from being a Dense Distributional Semantic Modeling System to being a Sparse Distributional Semantic Modeling System.
- It can range from being a Continuous Distributional Semantic Modeling System to being a Discrete Distributional Semantic Modeling System.
- …
Example(s):
- Google's word2vec.
- gensim's word2vec [1]
- GloVe-based System.
- …
Counter-Example(s):
See: Supervised Word Classification System, Word Similarity Function Training System, Word-Space Model.

References

http://www.marekrei.com/blog/linguistic-regularities-word-representations/
- As the first step, we need to create feature vectors for each word in our vocabulary. The two main ways of doing this, which are also considered by this paper, are:
  - BOW: The bag-of words approach. We count all the words that a certain word appears with, treat the counts as features in a vector, and weight them using something like positive pointwise mutual information (PPMI).
  - word2vec: Vectors created using the word2vec toolkit. Low-dimensional dense vector representations are learned for each word using a neural network.
- If you want to learn more details about these models, take a look at an earlier post about comparing both of these models.