SmartWeb Ontology-Based Annotation (SOBA) System
A SmartWeb Ontology-Based Annotation (SOBA) System is a Ontology-based Information Extraction System that consists of a web crawler with linguistic annotation components.
- AKA: Buitelaar's SOBA System.
- It is a sub-system of a SmartWeb System.
- It was developed by Buitelaar et al. (2006).
- …
- Counter-Example(s):
- See: Information Extraction System, Knowledge Base, Knowledge Representation, Ontology Representation, Web Crawling Task, SWIntO Ontology, GM-RKB Ontology, Automatic Ontology Population System, Sequence Ontology Bioinformatics Analysis (SOBA) System, Secrecy-preserving Observable Ballot-level Audit (SOBA) System.
References
2006a
- (Buitelaar et al., 2006) ⇒ Paul Buitelaar, Philipp Cimiano, Stefania Racioppa, and Melanie Siegel. (2006). “Ontology-based Information Extraction with SOBA.” In: Proceedings of the International Conference on Language Resources and Evaluation.
- QUOTE: The SOBA system consists of a web crawler, linguistic annotation components and a component for the transformation of linguistic annotations into a knowledge base, i.e. an ontology-based representation.
The web crawler acts as a monitor on relevant web domains (i.e. the FIFA[1] and UEFA [2] web sites), automatically downloads relevant documents from them and sends these to a linguistic annotation web service.
Linguistic annotation and information extraction is based on the Heart-of-Gold (HoG) architecture Callmeier et al. 2004, which provides a uniform and flexible infrastructure for building multilingual applications that use XML-based natural language processing components.
The linguistically annotated documents are further processed by the semantic transformation component, which generates a knowledge base of soccer-related entities (players, teams, etc.) and events (matches, goals, etc.) by mapping annotated entities or events to ontology classes and their properties (In: Section 2: System Overview)
- QUOTE: The SOBA system consists of a web crawler, linguistic annotation components and a component for the transformation of linguistic annotations into a knowledge base, i.e. an ontology-based representation.
2006b
- (Buitelaar et al., 2006b) ⇒ Paul Buitelaar, Philipp Cimiano, Anette Frank, and Stefania Racioppa (2006). "SOBA: Smartweb Ontology-based Annotation". In: Proceedings of the Demo Session at the International Semantic Web Conference (ISWC).
- QUOTE: We described SOBA, a system for ontology-based extraction, integration and display of information. SOBA as presented here is a domain-specific application. Porting SOBA to another domain can be based on the general purpose NLP components in HoG, but also involves the integration of a domain-specific ontology, extensions and/or modifications of the SProUT gazetteers and rule set and of the KB-related F-Logic rules.