Distributed Dataset
Jump to navigation
Jump to search
A Distributed Dataset is a dataset that is physically distributed.
- Context:
- It can range from being a File-based Distributed Dataset to being an In-Memory Distributed Dataset (such as a Spark dataframe).
- Example(s):
- a dataset in a Distributed DBMS.
- a Bitcoin Ledger.
- …
- Counter-Example(s):
- See: Big Data, Sharded Database.