2010 AStudyoftheUniquenessofSourceCo
- (Gabel & Su, 2010) ⇒ Mark Gabel, and Zhendong Su. (2010). “A Study of the Uniqueness of Source Code.” In: Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering. ISBN:978-1-60558-791-2 doi:10.1145/1882291.1882315
Subject Headings: Software Code Unit, Software Code Corpus, Uniqueness Measure.
Notes
Cited By
- http://scholar.google.com/scholar?q=%222010%22+A+Study+of+the+Uniqueness+of+Source+Code
- http://dl.acm.org/citation.cfm?id=1882291.1882315&preflayout=flat#citedby
Quotes
Abstract
This paper presents the results of the first study of the uniqueness of source code. We define the uniqueness of a unit of source code with respect to the entire body of written software, which we approximate with a corpus of 420 million lines of source code. Our high-level methodology consists of examining a collection of 6,000 software projects and measuring the degree to which each project can be 'assembled' solely from portions of this corpus, thus providing a precise measure of `uniqueness' that we call syntactic redundancy. We parameterized our study over a variety of variables, the most important of which being the level of granularity at which we view source code. Our suite of experiments together consumed approximately four months of CPU time, providing quantitative answers to the following questions: at whatlevels of granularity is software unique, and at a given level of granularity, how unique is software? While we believe these questions to be of intrinsic interest, we discuss possible applications to genetic programming and developer productivity tools.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2010 AStudyoftheUniquenessofSourceCo | Mark Gabel Zhendong Su | A Study of the Uniqueness of Source Code | 10.1145/1882291.1882315 | 2010 |