Spark Shark Database Querying System
Jump to navigation
Jump to search
A Spark Shark Database Querying System is a Database Querying System that works with Spark System.
References
2014
- https://github.com/amplab/shark/wiki
- Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can execute Hive QL queries up to 100 times faster than Hive without any modification to the existing data or queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions, providing seamless integration with existing Hive deployments and a familiar, more powerful option for new ones.