2009 QuestionAnsweringforBiologyandM

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Question Answering System; EAGLi System.

Notes

Cited By

Quotes

Author Keywords

Abstract

Biomedical professionals have at their disposal a huge amount of data, such as literature, i.e. textual contents, or databases, i.e. structured contents. But when they have a question, they often have to deal with too many documents in order to efficiently find the appropriate answer in a reasonable time. We have developed a Question Answering system which aims to analyze the user's question, to retrieve the most relevant documents from MEDLINE, and to extract from these retrieved documents a list of candidate answers, ranked by confidence. These candidate answers are concepts issued from biomedical controlled vocabularies, such as the Medical Subject Headings (MeSH) for a first step, and are extracted from the most relevant documents with pattern matching strategies. For evaluation purposes, we apply the system on two biological databases, UniProt and DrugBank. From these resources, we generated two large benchmarks of 200 questions dealing respectively with diseases and proteins, and with diseases and drugs. For these 2 sets, the first candidate answer proposed by our system is respectively correct in 57% and in 68%, while respectively 70% and 75% of all answers to find are contained in the ten first proposed candidate answers. Despite the use of simple Information Extraction strategies, our system exploits the redundancy of information in literature in order to provide a powerful Question Answering system.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2009 QuestionAnsweringforBiologyandMPatrick Ruch
D. Theodoro
A. -l. Veuthey
C. Lovis
Julien Gobeill
Emilie Pasche
Question Answering for Biology and Medicine.10.1109/ITAB.2009.53943612009