Authors
Gerard De Melo, Gerhard Weikum
Publication date
2009/11/2
Book
Proceedings of the 18th ACM conference on Information and knowledge management
Pages
513-522
Description
Lexical databases are invaluable sources of knowledge about words and their meanings, with numerous applications in areas like NLP, IR, and AI. We propose a methodology for the automatic construction of a large-scale multilingual lexical database where words of many languages are hierarchically organized in terms of their meanings and their semantic relations to other words. This resource is bootstrapped from WordNet, a well-known English-language resource. Our approach extends WordNet with around 1.5 million meaning links for 800,000 words in over 200 languages, drawing on evidence extracted from a variety of resources including existing (monolingual) wordnets, (mostly bilingual) translation dictionaries, and parallel corpora. Graph-based scoring functions and statistical learning techniques are used to iteratively integrate this information and build an output graph. Experiments show that this …
Total citations
20102011201220132014201520162017201820192020202120222023202487182431182511121253232
Scholar articles
G De Melo, G Weikum - Proceedings of the 18th ACM conference on …, 2009