2011 UnbiasedOnlineActiveLearninginD
- (Chu et al., 2011) ⇒ Wei Chu, Martin Zinkevich, Lihong Li, Achint Thomas, and Belle Tseng. (2011). “Unbiased Online Active Learning in Data Streams.” In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2011) Journal. ISBN:978-1-4503-0813-7 doi:10.1145/2020408.2020444
Subject Headings:
Notes
Cited By
- http://scholar.google.com/scholar?q=%222011%22+Unbiased+Online+Active+Learning+in+Data+Streams
- http://dl.acm.org/citation.cfm?id=2020408.2020444&preflayout=flat#citedby
Quotes
Author Keywords
- Active learning; adaptive importance sampling; algorithms; bayesian online learning; classifier design and evaluation; data streaming; experimentation; performance; probabilistic algorithms; unbiasedness
Abstract
Unlabeled samples can be intelligently selected for labeling to minimize classification error. In many real-world applications, a large number of unlabeled samples arrive in a streaming manner, making it impossible to maintain all the data in a candidate pool. In this work, we focus on binary classification problems and study selective labeling in data streams where a decision is required on each sample sequentially. We consider the unbiasedness property in the sampling process, and design optimal instrumental distributions to minimize the variance in the stochastic process. Meanwhile, Bayesian linear classifiers with weighted maximum likelihood are optimized online to estimate parameters. In empirical evaluation, we collect a data stream of user-generated comments on a commercial news portal in 30 consecutive days, and carry out offline evaluation to compare various sampling strategies, including unbiased active learning, biased variants, and random sampling. Experimental results verify the usefulness of online active learning, especially in the non-stationary situation with concept drift.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2011 UnbiasedOnlineActiveLearninginD | Wei Chu Belle Tseng Martin Zinkevich Lihong Li Achint Thomas | Unbiased Online Active Learning in Data Streams | 10.1145/2020408.2020444 | 2011 |