Very Large Text Corpora
(Redirected from massive amounts of textual content)
Jump to navigation
Jump to search
A Very Large Text Corpora is a large corpora that is a very large dataset (based on a corpus size measure).
- Example(s):
- a Web Snapshot, by Common Crawl.
- a Library of Congress.
- …
- Counter-Example(s):
- a Relatively Large Corpora, such as a PubMed Snapshot, or a Wikipedia Snapshot, or an ACM DL Snapshot.
- a Small Corporate.
- See: Corpora, Digital Library.