2015 MonitoringLeastSquaresModelsofD
- (Gabel et al., 2015) ⇒ Moshe Gabel, Daniel Keren, and Assaf Schuster. (2015). “Monitoring Least Squares Models of Distributed Streams.” In: Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2015). ISBN:978-1-4503-3664-2 doi:10.1145/2783258.2783349
Subject Headings:
Notes
Cited By
- http://scholar.google.com/scholar?q=%222015%22+Monitoring+Least+Squares+Models+of+Distributed+Streams
- http://dl.acm.org/citation.cfm?id=2783258.2783349&preflayout=flat#citedby
Quotes
Author Keywords
- Correlation and regression analysis; data mining; distributed streams; least squares; least squares methods; regression
Abstract
Least squares regression is widely used to understand and predict data behavior in many fields. As data evolves, regression models must be recomputed, and indeed much work has focused on quick, efficient and accurate computation of linear regression models. In distributed streaming settings, however, periodically recomputing the global model is wasteful: communicating new observations or model updates is required even when the model is, in practice, unchanged. This is prohibitive in many settings, such as in wireless sensor networks, or when the number of nodes is very large. The alternative, monitoring prediction accuracy, is not always sufficient: in some settings, for example, we are interested in the model's coefficients, rather than its predictions. We propose the first monitoring algorithm for multivariate regression models of distributed data streams that guarantees a bounded model error. It maintains an accurate estimate using a fraction of the communication by recomputing only when the precomputed model is sufficiently far from the (hypothetical) current global model. When the global model is stable, no communication is needed.
Experiments on real and synthetic datasets show that our approach reduces communication by up to two orders of magnitude while providing an accurate estimate of the current global model in all nodes.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2015 MonitoringLeastSquaresModelsofD | Assaf Schuster Moshe Gabel Daniel Keren | Monitoring Least Squares Models of Distributed Streams | 10.1145/2783258.2783349 | 2015 |