TREC LA Times Dataset
Jump to navigation
Jump to search
The TREC LA Times Dataset was a Corpus that ..
- AKA: LA Times Dataset, LA Times Corpus.
- Context:
- It was a part of the TREC Question Answering Track.
References
2009
- (Hu et al., 1999) ⇒ Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, and Xiaohua Zhou. (2009). “Exploiting Wikipedia as External Knowledge for Document Clustering.” In: Proceedings of ACM SIGKDD Conference (KDD-2009). doi:10.1145/1557019.1557066
- We perform clustering experiments on three datasets: TDT2, LA Times (from TREC), and 20-newsgroups (20NG). We selected ... 18,547 documents from top ten sections of LA Times, ... The ten sections selected from LA Times are Entertainment, Financial, Foreign, Late Final, Letters, Metro, National, Sports, Calendar, and View.
2000
- AP newswire (Disks 1-3)
- Wall Street Journal (Disks 1-2)
- San Jose Mercury News (Disk 3)
- Financial Times (Disk 4)
- Los Angeles Times (Disk 5)
- Foreign Broadcast Information Service (FBIS) (Disk 5)