Authors
Preetha Chatterjee, Manziba Akanda Nishi, Kostadin Damevski, Vinay Augustine, Lori Pollock, Nicholas A Kraft
Publication date
2017/2/20
Conference
2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)
Pages
382-386
Publisher
IEEE
Description
A large corpora of software-related documents is available on the Web, and these documents offer the unique opportunity to learn from what developers are saying or asking about the code snippets that they are discussing. For example, the natural language in a bug report provides information about what is not functioning properly in a particular code snippet. Previous research has mined information about code snippets from bug reports, emails, and Q&A forums. This paper describes an exploratory study into the kinds of information that is embedded in different software-related documents. The goal of the study is to gain insight into the potential value and difficulty of mining the natural language text associated with the code snippets found in a variety of software-related documents, including blog posts, API documentation, code reviews, and public chats.
Total citations
20172018201920202021202220232024141181733
Scholar articles
P Chatterjee, MA Nishi, K Damevski, V Augustine… - 2017 IEEE 24th International Conference on Software …, 2017