Real-World Dataset
(Redirected from real world data set)
Jump to navigation
Jump to search
A Real-World Dataset is a data set of real-world data records (that represent some sensed event).
- Context:
- It can range from being a Small Real-World Dataset to being a Large Real-World Dataset.
- It can be created by a Data Collection Task.
- It can be approximated by a Realistic Dataset.
- Example(s):
- a Real-World Tabular Dataset, such as: Fisher's Iris Dataset.
- a Real-World Clinical Dataset.
- a Real-World Corpus.
- a Real Timeseries.
- a Real-World Evidence.
- …
- Counter-Example(s):
- any Synthetic Dataset.
- See: Data Source, Real-World Application, Sensor Data.
References
2021
- (Wikipedia, 2021) ⇒ https://en.wikipedia.org/wiki/real_world_data Retrieved:2021-12-10.
- Real world data (RWD) in medicine is data derived from a number of sources that are associated with outcomes in a heterogeneous patient population in real-world settings, such as patient surveys, clinical trials, and observational cohort studies.
2008
- (Wick et al., 2008) ⇒ Michael Wick, Khashayar Rohanimanesh, Karl Schultz, and Andrew McCallum. (2008). “A Unified Approach for Schema Matching, Coreference, and Canonicalization.” In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2008).
- QUOTE: Real World Data: For our real-world data we manually extracted faculty and alumni listings from 7 university websites,
1995
- (Kohavi, 1995) ⇒ Ron Kohavi. (1995). “A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection.” In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI 1995).
- QUOTE: We report on a largescale experiment ... on these algorithms on real-world datasets. For cross-validation, we vary the number of folds and whether the folds are stratified or not; for bootstrap, we vary the number of bootstrap samples. Our results indicate that for 'real-word datasets similar to ours....