2011 ExtSructKnowFromROMTexDataInSoftRepos
- (Hasan, 2011) ⇒ Maryam Hasan. (2011). “Extracting Structured Knowledge From ROM Textual Data In Software Repositories.” Masters Thesis, University of Alberta (2011).
Subject Headings:
Notes
Cited By
Quotes
Abstract
Software team members, as they communicate and coordinate their work with others throughout the life-cycle of their projects, generate different kinds of textual artifacts. Despite the variety of works for mining software artifacts, relatively little research has focused on communication artifacts. Software communication artifacts contain useful semantic information that is not fully explored by existing approaches. This thesis, presents the development of a text analysis method and tool to extract and represent valuable information from a wide range of textual data sources from software projects. The developed system integrates Natural Language Processing techniques and statistical text analysis methods, with software domain knowledge. The extracted information is represented as RDF-style triples which constitute interesting relations between developers and software products. We applied our system to analyze five different textual sources, i.e., source code commits, bug reports, email messages, chat logs, and wiki pages. In the evaluation of the system, we found its precision to be 82%, its recall 58%, and its F-measure 68%.
References
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2011 ExtSructKnowFromROMTexDataInSoftRepos | Maryam Hasan | Extracting Structured Knowledge From ROM Textual Data In Software Repositories | http://repository.library.ualberta.ca/dspace/bitstream/10048/1776/1/Hasan Maryam Spring+2011.pdf | 2011 |