Authors
David Pinto, Jorge Civera, Alberto Barrón-Cedeno, Alfons Juan, Paolo Rosso
Publication date
2009/1/1
Journal
Journal of Algorithms
Volume
64
Issue
1
Pages
51-60
Publisher
Academic Press
Description
The existence of huge volumes of documents written in multiple languages on Internet leads to investigate novel algorithmic approaches to deal with information of this kind. However, most crosslingual natural language processing (NLP) tasks consider a decoupled approach in which monolingual NLP techniques are applied along with an independent translation process. This two-step approach is too sensitive to translation errors, and in general to the accumulative effect of errors. To solve this problem, we propose to use a direct probabilistic crosslingual NLP system which integrates both steps, translation and the specific NLP task, into a single one. In order to perform this integrated approach to crosslingual tasks, we propose to use the statistical IBM 1 word alignment model (M1). The M1 model may show a non-monotonic behaviour when aligning words from a sentence in a source language to words from …
Total citations
2008200920102011201220132014201520162017201820192020202120222449127235754536
Scholar articles
D Pinto, J Civera, A Barrón-Cedeno, A Juan, P Rosso - Journal of algorithms, 2009