word-word Co-occurrence
Jump to navigation
Jump to search
A word-word Co-occurrence is a co-occurence of two word mentions.
- See: bigram.
References
2012
- John A. Bullinaria, and Joseph P. Levy. (2012). “Extracting Semantic Representations from Word Co-occurrence Statistics: stop-lists, stemming, and SVD." Behavior research methods 44, no. 3.
- ABSTRACT: In a previous article, we presented a systematic computational study of the extraction of semantic representations from the word–word co-occurrence statistics of large text corpora. The conclusion was that semantic vectors of pointwise mutual information values from very small co-occurrence windows, together with a cosine distance measure, consistently resulted in the best representations across a range of psychologically relevant semantic tasks. This article extends that study by investigating the use of three further factors — namely, the application of stop-lists, word stemming, and dimensionality reduction using singular value decomposition (SVD) — that have been used to provide improved performance elsewhere. It also introduces an additional semantic task and explores the advantages of using a much larger corpus. This leads to the discovery and analysis of improved SVD-based methods for generating semantic representations (that provide new state-of-the-art performance on a standard TOEFL task) and the identification and discussion of problems and misleading results that can arise without a full systematic study.
1969
- Michael E. Lesk. (1969). “Word‐word associations in document retrieval systems." American documentation 20, no. 1.
- QUOTE: Word-Word Associations in Document Retrieval Systems ... Recently automatic methods dependent on statistical co-occurrence of words have been proposed for the determination of word meanings
and the selection of synonymous words, and it has been asserted that the use of ...