2006 OntolPopFromTextualMentions

Jump to navigation Jump to search

Subject Headings: Ontology Population from Text Algorithm, Weakly-Supervised Algorithm.


Cited By



In this paper we propose and investigate Ontology Population from Textual Mentions (OPTM), a sub-task of Ontology Population from text where we assume that mentions for several kinds of entities (e.g. PERSON, ORGANIZATION, LOCATION, GEOPOLITICAL_ENTITY) are already extracted from a document collection. On the one hand, OPTM simplifies the general Ontology Population task, limiting the input textual material; on the other hand, it introduces challenging extensions to Ontology Population restricted to named entities, being open to a wider spectrum of linguistic phenomena. We describe a manually created benchmark for OPTM and discuss several factors which determine the difficulty of the task.


  • (Almuhareb and Poesio, 2004) ⇒ Abdulrahman Almuhareb, and Massimo Poesio. (2004). “Attribute-based and Value-based Clustering: An evaluation.” In: Proceedings of EMNLP 2004.
  • Avancini, H., Lavelli, A., Bernardo Magnini, Sebastiani, F., Zanoli, R. (2003). Expanding Domain-Specific Lexicons by Term Categorization. In: Proceedings of SAC 2003, 793-79.
  • Hamish Cunningham and Bontcheva, K. Knowledge Management and Human Language: Crossing the Chasm. Journal of Knowledge Management, 9(5), (2005).
  • Buitelaar, P., Cimiano, P. and Bernardo Magnini (Eds.) Ontology Learning from Text: Methods, Evaluation and Applications. IOS Press, 2005.
  • Ferro, L., Gerber, L., Mani, I., Sundheim, B. and Wilson, G. (2005). TIDES 2005 Standard for the Annotation of Temporal Expressions. Technical report, MITRE.
  • Lavelli, A., Bernardo Magnini, Negri, M., Pianta, E., Speranza, M. and Sprugnoli, R. (2005). Italian Content Annotation Bank (I-CAB): Temporal Expressions (V. 1.0.). Technical Report T-0505-12. ITC-irst, Trento.
  • Dekang Lin (1998). Automatic Retrieval and Clustering of Similar Words. In: Proceedings of COLING-ACL98, Montreal, Canada, 1998.
  • Linguistic Data Consortium (2004). ACE (Automatic Content Extraction) English Annotation Guidelines for Entities, version 5.6.1 2005.05.23. http://projects.ldc.upenn.edu/ace/docs/English-Entities-Guidelines_v5.6.1.pdf
  • Bernardo Magnini, Pianta, E., Girardi, C., Negri, M., Romano, L., Speranza, M., Bartalesi Lenzi, V. and Sprugnoli, R. (2006). I-CAB: the Italian Content Annotation Bank. Proceedings of LREC-2006, Genova, Italy, 22-28 May, 2006.
  • Tanev, H. and Bernardo Magnini Weakly Supervised Approaches for Ontology Population. Proceedings of EACL-2006, Trento, Italy, 3-7 April, 2006.
  • Paola Velardi, Roberto Navigli, Cuchiarelli, A., Neri, F. (2004). Evaluation of Ontolearn, a Methodology for Automatic Population of Domain Ontologies. In: Buitelaar, P., Cimiano, P., Bernardo Magnini (eds.): Ontology Learning from Text: Methods, Evaluation and Applications, IOS Press, Amsterdam, 2005.


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2006 OntolPopFromTextualMentionsManuela Speranza
Bernardo Magnini
Emanuele Pianta
Octavian Popescu
Ontology Population from Textual Mentions: Task Definition and BenchmarkProceedings of the ACL 2006 Workshop on Ontology Population and Learninghttp://tcc.itc.it/people/pianta/publications/olp2-2006-final.pdf2006