2010 WikiNetAVeryLrgScaleMultLingConcNet

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Quotes

Topics

  • Ontologies, Knowledge Discovery/Representation, Multilinguality

Abstract

  • This paper describes a multi-lingual large-scale concept network obtainedautomatically by mining for concepts and relations and exploiting a variety ofsources of knowledge from Wikipedia. Concepts and their lexicalizations areextracted from Wikipedia pages, in particular from article titles, hyperlinks,disambiguation pages and cross-language links. Relations are extracted from thecategory and page network, from the category names, from infoboxes and the bodyof the articles. The resulting network has two main components: (i) a central,language independent index of concepts, which serves to keep track of theconcepts' lexicalizations both within a language and across languages, and toseparate linguistic expressions of concepts from the relations in which theyare involved (concepts themselves are represented as numeric IDs); (ii) a largenetwork built on the basis of the relations extracted, represented as relationsbetween concepts (more specifically, the numeric IDs). The various stages ofobtaining the network were separately evaluated, and the results show aqualitative resource.

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2010 WikiNetAVeryLrgScaleMultLingConcNetMichael Strube
Vivi Nastase
Benjamin Boerschinger
Caecilia Zirn
Anas Elghafari
WikiNet: A Very Large Scale Multi-Lingual Concept Network