IID Random Variable Set
An iid random variable set is a random variable set where all random variables are in a statistical independence relation and in an identical distribution relation.
- AKA: Independent and Identically Distributed Random Variables.
- Context:
- It can represent an I.I.D. Dataset.
- It can represent an Independent Identical Random Experiments.
- It can range from a Random IID Sample to being a Non-Random IID Sample.
- It can range from a Small IID Random Variable Set to being a Large IID Random Variable Set.
- …
- Counter-Example(s):
- a Non-IID Dataset, such as a correlated variable set, such as a search query log.
- See: Random Experiment, Random Intervention Study, Law of Large Numbers, Time Dependency.
References
2013
- (Wikipedia, 2013) ⇒ http://en.wikipedia.org/wiki/Independent_and_identically_distributed_random_variables
- In probability theory and statistics, a sequence or other collection of random variables is independent and identically distributed (i.i.d.) if each random variable has the same probability distribution as the others and all are mutually independent.[1]
The abbreviation i.i.d. is particularly common in statistics (often as iid, sometimes written IID), where observations in a sample are often assumed to be effectively i.i.d. for the purposes of statistical inference. The assumption (or requirement) that observations be i.i.d. tends to simplify the underlying mathematics of many statistical methods (see mathematical statistics and statistical theory). However, in practical applications of statistical modeling the assumption may or may not be realistic. To test how realistic the assumption is on a given data set the autocorrelation can be computed, lag plots drawn or turning point test performed.[2] The generalization of exchangeable random variables is often sufficient and more easily met.
The assumption is important in the classical form of the central limit theorem, which states that the probability distribution of the sum (or average) of i.i.d. variables with finite variance approaches a normal distribution.
Note that IID refers to sequences of random variables. "Independent and identically distributed" implies an element in the sequence is independent of the random variables that came before it. In this way, an IID sequence is different from a Markov sequence, where the probability distribution for the nth random variable is a function of the previous random variable in the sequence (for a first order Markov sequence). An IID sequence does not imply the probabilities for all elements of the sample space or event space must be the same.[3] For example, repeated throws of loaded dice will produce a sequence that is IID, despite the outcomes being biased.
- In probability theory and statistics, a sequence or other collection of random variables is independent and identically distributed (i.i.d.) if each random variable has the same probability distribution as the others and all are mutually independent.[1]
- ↑ Aaron Clauset. "A brief primer on probability distributions". Santa Fe Institute. http://tuvalu.santafe.edu/~aaronc/courses/7000/csci7000-001_2011_L0.pdf.
- ↑ Le Boudec, Jean-Yves (2010). Performance Evaluation Of Computer And Communication Systems. EPFL Press. pp. 46-47. ISBN 978-2-940222-40-7. http://infoscience.epfl.ch/record/146812/files/perfPublisherVersion.pdf.
- ↑ Cover, Thomas (2006). Elements Of Information Theory. Wiley-Interscience. pp. 57–58. ISBN 978-0-471-24195-9.
1973
- (Akaike, 1973) ⇒ Hitotogu Akaike. (1973). “Information Theory and an Extension of the Maximum Likelihood Principle.” In: Proceedings of the Second International Symposium on Information Theory.
- QUOTE: Though the discussion in the present paper has been limited to the realization of independent and identically distributed random variables, by following the approach of Billingsley …