Cluster Hypothesis

From GM-RKB
Jump to navigation Jump to search

A Cluster Hypothesis is a model assumption about the clustering properties of the data.



References

2017

  • (Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/cluster_hypothesis Retrieved:2017-6-22.
    • In machine learning and information retrieval, the cluster hypothesis is an assumption about the nature of the data handled in those fields, which takes various forms. In information retrieval, it states that documents that are clustered together "behave similarly with respect to relevance to information needs".[1] In terms of classification, it states that if points are in the same cluster, they are likely to be of the same class. [2] There may be multiple clusters forming a single class.
  1. http://nlp.stanford.edu/IR-book/html/htmledition/clustering-in-information-retrieval-1.html
  2. O. Chapelle and B. Schölkopf and A. Zien, Semi-Supervised Learning, MIT Press, 2006