Authors
Christina Niklaus, Matthias Cetto, André Freitas, Siegfried Handschuh
Publication date
2019/9/26
Journal
arXiv preprint arXiv:1909.12140
Description
We introduce DisSim, a discourse-aware sentence splitting framework for English and German whose goal is to transform syntactically complex sentences into an intermediate representation that presents a simple and more regular structure which is easier to process for downstream semantic applications. For this purpose, we turn input sentences into a two-layered semantic hierarchy in the form of core facts and accompanying contexts, while identifying the rhetorical relations that hold between them. In that way, we preserve the coherence structure of the input and, hence, its interpretability for downstream tasks.
Total citations
20212022202320245452
Scholar articles
C Niklaus, M Cetto, A Freitas, S Handschuh - arXiv preprint arXiv:1909.12140, 2019