2015 RealTimeTopRTopicDetectiononTwi

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Abstract

Twitter is a "what's-happening-right-now " tool that enables interested parties to follow thoughts and commentary of individual users in nearly real-time. While it is a valuable source of information for real-time topic detection and tracking, Twitter data are not clean because of noisy messages and users, which significantly diminish the reliability of obtained results.

In this paper, we integrate both the extraction of meaningful topics and the filtering of messages over the Twitter stream. We develop a streaming algorithm for a sequence of document-frequency tables; our algorithm enables real-time monitoring of the top-10 topics from approximately 25% of all Twitter messages, while automatically filtering noisy and meaningless topics. We apply our proposed streaming algorithm to the Japanese Twitter stream and successfully demonstrate that, compared with other online nonnegative matrix factorization methods, our framework both tracks real-world events with high accuracy in terms of the perplexity and simultaneously eliminates irrelevant topics.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2015 RealTimeTopRTopicDetectiononTwiKen-ichi Kawarabayashi
Kohei Hayashi
Takanori Maehara
Masashi Toyoda
Real-Time Top-R Topic Detection on Twitter with Topic Hijack Filtering10.1145/2783258.27834022015