Authors
Qingying Chen, Minghui Zhou
Publication date
2018/9/3
Book
Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering
Pages
826-831
Description
Code retrieval and summarization are two tasks often employed by software developers to reuse code that spreads over online repositories. In this paper, we present a neural framework that allows bidirectional mapping between source code and natural language to improve these two tasks. Our framework, BVAE, is designed to have two Variational AutoEncoders (VAEs) to model bimodal data: C-VAE for source code and L-VAE for natural language. Both VAEs are trained jointly to reconstruct their input as much as possible with regularization that captures the closeness between the latent variables of code and description. BVAE could learn semantic vector representations for both code and description and generate completely new descriptions for arbitrary code snippets. We design two instance models of BVAE for retrieval and summarization tasks respectively and evaluate their performance on a benchmark …
Total citations
2018201920202021202220232024151825252612
Scholar articles
Q Chen, M Zhou - Proceedings of the 33rd ACM/IEEE International …, 2018