Latent Semantic Indexing Algorithm

From GM-RKB
Jump to navigation Jump to search

A Latent Semantic Indexing Algorithm is a automatic indexing and information retrieval algorithm that is based on the analysis of distributional semantics of a set a documents.



References

2019

2014a

2014b

2011a

  • (Wikipedia, 2011) ⇒ http://en.wikipedia.org/wiki/Latent_semantic_indexing
    • Latent Semantic Indexing (LSI) is an indexing and retrieval method that uses a mathematical technique called Singular value decomposition (SVD) to identify patterns in the relationships between the terms and concepts contained in an unstructured collection of text. LSI is based on the principle that words that are used in the same contexts tend to have similar meanings. A key feature of LSI is its ability to extract the conceptual content of a body of text by establishing associations between those terms that occur in similar contexts

2011b

2007a

2007b

2003

1998

1997a

1997b

1997c

1990


  1. By “semantic structure” we mean here only the correlation structure in the way in which individual words appear in documents; “semantic” implies only the fact that terms in a document may be taken as referents to the document itself or to its topic.