A Natural Language Audio Dataset is an Audio Dataset that contains natural language utterances converted to machine-readable items.