Authors
Andrew McCallum, Ben Wellner
Publication date
2003/8
Journal
KDD Workshop on Data Cleaning, Record Linkage and Object Consolidation
Pages
1-6
Description
Coreference analysis, also known as record linkage, object consolidation or identity uncertainty, is a difficult and important problem in natural language processing, databases, citation matching and many other tasks. This paper introduces several discriminative, conditional-probability models for coreference analysis, all examples of undirected graphical models. Unlike many historical approaches to coreference, the models presented here are relational—they do not assume that pairwise coreference decisions should be made independently from each other. Unlike other relational models of coreference that are generative, the conditional model here can incorporate a great variety of features of the input without having to be concerned about their dependencies—paralleling the advantages of conditional random fields over hidden Markov models. We present positive results on noun coreference in two standard text data sets.
Total citations
2004200520062007200820092010201120122013201420152016201720182019202020212022202320243526539117410711121111
Scholar articles
A McCallum, B Wellner - KDD Workshop on Data Cleaning, Record Linkage and …, 2003