Authors
Xifeng Yan, Jiawei Han
Publication date
2002/12/9
Conference
2002 IEEE International Conference on Data Mining, 2002. Proceedings.
Pages
721-724
Publisher
IEEE
Description
We investigate new approaches for frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan (graph-based substructure pattern mining), which discovers frequent substructures without candidate generation. gSpan builds a new lexicographic order among graphs, and maps each graph to a unique minimum DFS code as its canonical label. Based on this lexicographic order gSpan adopts the depth-first search strategy to mine frequent connected subgraphs efficiently. Our performance study shows that gSpan substantially outperforms previous algorithms, sometimes by an order of magnitude.
Total citations
200320042005200620072008200920102011201220132014201520162017201820192020202120222023202431709312313316217418817919819419519619115818113013111311210136
Scholar articles
X Yan, J Han - 2002 IEEE International Conference on Data Mining …, 2002