Software Code Corpus
(Redirected from software code corpus)
Jump to navigation
Jump to search
A Software Code Corpus is a corpus of software code.
- AKA: Code Repository.
- See: CRAN, CPAN, Software Library Repository, GitHub.
References
2010
- (Gabel & Su, 2010) ⇒ Mark Gabel, and Zhendong Su. (2010). “A Study of the Uniqueness of Source Code.” In: Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering. ISBN:978-1-60558-791-2 doi:10.1145/1882291.1882315
- QUOTE: We define the uniqueness of a unit of source code with respect to the entire body of written software, which we approximate with a corpus of 420 million lines of source code.