Authors
Iman Keivanloo, Chanchal K Roy, Juergen Rilling
Publication date
2014/12/1
Journal
Science of Computer Programming
Volume
95
Pages
426-444
Publisher
Elsevier
Description
While source code clone detection is a well-established research area, finding similar code fragments in binary and other intermediate code representations has been not yet that widely studied. In this paper, we introduce SeByte, a bytecode clone detection and search model that applies semantic-enabled token matching. It is developed based on the idea of relaxation on the code fingerprints. This approach separates the input content based on the types of tokens into different dimensions, with each dimension representing the input content from a specific point of view. Following this approach, SeByte compares each dimension separately and independently which we refer to as multi-dimensional comparison in our research. As the similarity search function we use a well-known measure that supports our multi-dimensional comparison heuristic, the Jaccard similarity coefficient. Our preliminary study shows that …
Total citations
2014201520162017201820192020202120222023202414137376253
Scholar articles
I Keivanloo, CK Roy, J Rilling - Science of Computer Programming, 2014