2007 OntBasedAnnotOfWebDocSegments
Jump to navigation
Jump to search
- (El-Beltagy & al 2007) ⇒ Samhaa R. El-Beltagy, Maryam Hazma, Ahmed Rafea. (2007). “Ontology Based Annotation Of Web Document Segments.” In: Proceedings of the 22nd Annual ACM Symposium on Applied Computing. (2007)
Subject Headings:
Notes
Cited By
~ 5 http://scholar.google.com/scholar?cites=12574559105284671079
Quotes
Abstract
- This work exploits the logical structure of information rich texts to automatically annotate text segments contained within them using a domain ontology. The underlying assumption behind this work is that segments in such documents embody self contained informative units. Another assumption is that segment headings coupled with a document’s hierarchical structure offer informal representations of segment content; and that matching segment headings to concepts in an ontology/thesaurus can result in the creation of formal labels/meta-data for these segments. When an encountered heading can not be matched with any concepts in the ontology, the hierarchical structure of the document is used to infer where a new concept represented by this heading should be added in the ontology. So, in this work the bootstrap ontology is also enriched by new concepts encountered within input documents. This paper also presents issues/problems related to matching textual entities to concepts in an incomplete ontology. The approach presented in this paper was applied to a set of agricultural extension documents. The results of carrying out this experiment demonstrates that the proposed approach is capable of automatically annotating segments with concepts that describe a segment’s content with a high degree of accuracy.
References
,