Textpresso system
Jump to navigation
Jump to search
A Textpresso system is an ontology-based IE system.
- Context:
- It can identify concept mentions.
- It can identify properties that connect concepts.
- See: Ontology-based IE Task.
References
2008
- (Michelson & Knoblock, 2008) ⇒ Matthew Michelson, and Craig A. Knoblock. (2008). “Creating Relational Data from Unstructured and Ungrammatical Data Sources.” In: Journal of Artificial Intelligence Research, 31.
- QUOTE: Yet another interesting approach to information extraction using ontologies is the Textpresso system which extracts data from biological text (Müller & Sternberg, 2004). This system uses a regular expression based keyword look-up to label tokens in some text based on the ontology. Once all tokens are labeled, Textpresso can perform “fact extraction” by extracting sequences of labeled tokens that fit a particular pattern, such as gene-allele reference associations. Although this system again uses a reference set for extraction, it differs in that it does a keyword look-up into the lexicon.
2006
- http://www.textpresso.org/
- Textpresso is a text-mining system for scientific literature. Textpresso's two major elements are (1) access to full text, so that entire articles can be searched, and (2) introduction of categories of biological concepts and classes that relate two objects (e.g., association, regulation, etc.) or describe one (e.g., methods, etc). A search engine enables the user to search for one or a combination of these categories and/or keywords within an entire literature. Textpresso is useful as a search engine for researchers as well as a curation tool. It was developed as a part of WormBase and is used extensively by C. elegans curators. Textpresso has currently been implemented for 17 different literatures, and can readily be extended to other corpora of text.
2004
- (Müller, 2004) ⇒ Hans-Michael Müller, Eimear E. Kenny, and Paul W. Sternberg. (2004). “Textpresso: an ontology-based information retrieval and extraction system for biological literature.” In: PLoS Biol, 2(11):e309. doi:10.1371/journal.pbio.0020309 doi:10.1371/journal.pbio.0020309 doi:10.1371/journal.pbio.0020309.
- We have developed Textpresso, a new text-mining system for scientific literature whose capabilities go far beyond those of a simple keyword search engine.