2003 EvidenceCombInBiomedNLP
- (Skounakis & Craven, 2003) ⇒ Marios Skounakis, Mark Craven. (2003). “Evidence Combination in Biomedical Natural-Language Processing.” In: Proceedings of the 3nd ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD 2003).
Subject Headings: machine learning, information extraction, text mining
Notes
Cited By
2008
- (Downey, 2008) ⇒ Doug Downey. (2008). “Redundancy in Web-scale Information Extraction: Probabilistic Model and Experimental Results.” PhD Thesis, University of Washington.
- QUOTE: Skounakis and Craven [55] develop a probabilistic model for combining evidence from multiple extractions in a supervised setting. Their problem formulation differs from ours, as they classify each occurrence of an extraction, and then use a binomial model along with the false positive and true positive rates of the classifier to obtain the probability that at least one occurrence is a true positive. Similar to the above approaches, they do not explicitly account for sample size [math]\displaystyle{ n }[/math], nor do they model the distribution of target and error
Quotes
Abstract
In many natural language tasks, such as information extraction and [[semantic lexicon building]], individual entities and relations of interest may be found in multiple contexts within the corpus. In deciding which putative entities and relations should be extracted, a key problem is how to combine evidence across the multiple occurrences of these entities and relations. We present a novel statistical approach to address this issue, and evaluate it in the context of extracting protein names and protein-protein interactions from MEDLINE abstracts. We experimentally compare our method against a number of intuitive and simpler baselines. Our experimental results suggest that the issue of combining evidence is indeed important in these tasks. Furthermore, we show that our proposed method outperforms the baselines considered in a variety of settings.
References
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2003 EvidenceCombInBiomedNLP | Mark Craven Marios Skounakis | Evidence Combination in Biomedical Natural-Language Processing | Proceedings of the 3nd ACM SIGKDD Workshop on Data Mining in Bioinformatics | http://www.cs.rpi.edu/~zaki/BIOKDD03/proceedings/5-skounakis.pdf | 2003 |