ClueWeb12 Database

From GM-RKB
Jump to navigation Jump to search

A ClueWeb12 Database is a data base of ...



References

2016

  • http://lemurproject.org/clueweb12/
    • QUOTE: The ClueWeb12 dataset was created to support research on information retrieval and related human language technologies. The dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. ClueWeb12 is a companion or successor to the ClueWeb09 web dataset. Distribution of ClueWeb12 began in January 2013. … The ClueWeb12 datasets are distributed by Carnegie Mellon University for research purposes only.