2013 BigDataAnalyticswithSmallFootpr
- (Canny & Zhao, 2013) ⇒ John Canny, and Huasha Zhao. (2013). “Big Data Analytics with Small Footprint: Squaring the Cloud.” In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ISBN:978-1-4503-2174-7 doi:10.1145/2487575.2487677
Subject Headings:
Notes
Cited By
- http://scholar.google.com/scholar?q=%222013%22+Big+Data+Analytics+with+Small+Footprint%3A+Squaring+the+Cloud
- http://dl.acm.org/citation.cfm?id=2487575.2487677&preflayout=flat#citedby
Quotes
Author Keywords
Abstract
This paper describes the BID Data Suite, a collection of hardware, software and design patterns that enable fast, large-scale data mining at very low cost. By co-designing all of these elements we achieve single-machine performance levels that equal or exceed reported [[cluster implementation]]s for common benchmark problems. A key design criterion is rapid exploration of models, hence the system is interactive and primarily single-user. The elements of the suite are: (i) the data engine, a hardware design pattern that balances storage, CPU and GPU acceleration for typical data mining workloads, (ii) BIDMat, an interactive matrix library that integrates CPU and GPU acceleration and novel computational kernels (iii), BIDMach, a machine learning system that includes very efficient model optimizers, (iv) Butterfly mixing, a communication strategy that hides the latency of frequent model updates needed by fast optimizers and (v) Design patterns to improve performance of iterative update algorithms. We present several benchmark problems to show how the above elements combine to yield multiple orders-of-magnitude improvements for each problem.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2013 BigDataAnalyticswithSmallFootpr | John Canny Huasha Zhao | Big Data Analytics with Small Footprint: Squaring the Cloud | 10.1145/2487575.2487677 | 2013 |