Google AudioSet Dataset
(Redirected from Google AudioSet Data)
Jump to navigation
Jump to search
A Google AudioSet Dataset is an audio dataset created by Google Research.
- Context:
- It can be composed of a AudioSet Clips and an AudioSet Ontology.
- …
- Example(s):
- Counter-Example(s):
- …
- See: Audio Classification.
References
2017a
- https://research.google.com/audioset/
- QUOTE: AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By releasing AudioSet, we hope to provide a common, realistic-scale evaluation task for audio event detection, as well as a starting point for a comprehensive vocabulary of sound events. …
… This dataset is brought to you from the Sound Understanding group in the Machine Perception Research organization at Google.
- QUOTE: AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By releasing AudioSet, we hope to provide a common, realistic-scale evaluation task for audio event detection, as well as a starting point for a comprehensive vocabulary of sound events. …
2017b
- https://research.google.com/audioset/download.html
- QUOTE: The dataset is divided in three disjoint sets: a balanced evaluation set, a balanced training set, and an unbalanced training set. In the balanced evaluation and training sets, we strived for each class to have the same number of examples. The unbalanced training set contains the remainder of annotated segments.
- Evaluation - eval_segments.csv
20,383 segments from distinct videos, providing at least 59 examples for each of the 527 sound classes that are used. Because of label co-occurrence, many classes have more examples. - Balanced train - balanced_train_segments.csv
22,176 segments from distinct videos chosen with the same criteria: providing at least 59 examples per class with the fewest number of total segments. - Unbalanced train - unbalanced_train_segments.csv
2,042,985 segments from distinct videos, representing the remainder of the dataset.