Spark-based System
Jump to navigation
Jump to search
A Spark-based System is a cluster-based data processing system that is based on a Spark framework.
- Context:
- It can (typically) be used to run Spark Programs.
- Example(s):
- one based on the instructions found at http://docs.google.com/document/d/14lF3GZoWHoe0CkdViCcWa6fE_92sIvgOarsQQJg7ckY
- Counter-Example(s):
- a Hadoop-based System, such as one using Apache Pig.
- an AWS Lambda-based System.
- See: DataBricks, Amazon EMR.