Data Item Collection
(Redirected from dataset)
Jump to navigation
Jump to search
A Data Item Collection is a machine-processable artifact that represents a collection of one or more data items.
- AKA: Data Set Data Store.
- Context:
- It can range from being a Structured Dataset (with a data structure) to being an Unstructured Dataset.
- It can range from being a Small Dataset to being a Large Dataset.
- It can range from being an Unordered Dataset to being an Ordered Dataset.
- It can range from being a Categorical Dataset to being an Ordinal Dataset to being a Numeric Dataset to being a Mixed Dataset.
- It can range from being a Simple Dataset to being a Complex Dataset.
- It can range from being an In-Memory Data Object or a In-File Data Object.
- It can range from being a Public Dataset to being a Private Dataset.
- It can range from being a Persistent Data Object to being a Temporary Data Object.
- It can range from being a Sparse Data Object to being a Standard Data Object to being a Dense Data Object.
- It can range from being a Human-Processable Dataset to being a Human-Nonprocessable Dataset (such as a Binary File).
- It can range from being a Real-World Dataset, to being a Realistic Dataset, to being a Synthetic Dataset.
- It can be associated to one or more Dataset Itemset (subset of data items), such as a frequent itemset.
- It can be, depending on the Data Item Type, a Non-Vector Dataset, a Vector Dataset, a Graph Dataset.
- It can have Outlier Records.
- ...
- Example(s):
- General Data Item Collections:
- a Data Record Set used in database systems.
- a Data Item String representing textual data.
- a Data Item Sequence used in sequential analysis.
- Storage and Cloud Data Collections:
- an AWS S3 Object for cloud storage.
- a Log File Dataset for monitoring and analyzing system events.
- Application-Specific Data Collections:
- a Sensor Dataset for environmental monitoring.
- a Customer Transaction Dataset used in retail analytics.
- a Genomic Dataset for studying genetic information.
- a Social Media Dataset used for sentiment analysis.
- a Weather Dataset used for climate research and forecasting.
- ...
- General Data Item Collections:
- Counter-Example(s):
- an Abstract Set.
- a Data Structure, a set of Data Constraints.
- a Queue Data Structure.
- See: Data Stream; Corpus, Table (Database), Data Matrix (Multivariate Statistics), Column (Database), Row (Database), Value (Computer Science), Qualitative Data, Quantitative Data, Variable And Attribute (Research), Information, Measurement, Data Reporting, Data Analysis, Data, Table (Database), Data Matrix (Multivariate Statistics), Column (Database), Row (Database), Datum.
References
2015
- (Wikipedia, 2015) ⇒ http://en.wikipedia.org/wiki/data_store Retrieved:2015-5-31.
- A data store is a repository of a set of data objects. These objects are modelled using classes defined in a database schema. A data store is a general concept that includes not just repositories like databases, but also simpler store types such as flat files etc.
Some data stores represent data in only one schema, while other data stores use several schemas for this task. An example are RDBMS-based data stores like MySQL or PostgreSQL.
- A data store is a repository of a set of data objects. These objects are modelled using classes defined in a database schema. A data store is a general concept that includes not just repositories like databases, but also simpler store types such as flat files etc.
2009
- http://en.wikipedia.org/wiki/Data_(computing)
- In computer science, data is anything in a form suitable for use with a computer. Data is often distinguished from programs. A program is a set of instructions that detail a task for the computer to perform. In this sense, data is thus everything that is not program code.
In an alternate usage, binary files (which are not human-readable) are sometimes called "data" as distinguished from human-readable “text”. The total amount of digital data in 2007 was estimated to be 281 billion gigabytes.
- In computer science, data is anything in a form suitable for use with a computer. Data is often distinguished from programs. A program is a set of instructions that detail a task for the computer to perform. In this sense, data is thus everything that is not program code.