View article

[PDF] from aclanthology.org

Exploiting Debate Portals for Semi-Supervised Argumentation Mining in User-Generated Web Discourse

Authors

Ivan Habernal, Iryna Gurevych

Publication date

2015

Conference

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

Pages

2127--2137

Publisher

Association for Computational Linguistics

Description

Analyzing arguments in user-generated Web discourse has recently gained attention in argumentation mining, an evolving field of NLP. Current approaches, which employ fully-supervised machine learning, are usually domain dependent and suffer from the lack of large and diverse annotated corpora. However, annotating arguments in discourse is costly, errorprone, and highly context-dependent. We asked whether leveraging unlabeled data in a semi-supervised manner can boost the performance of argument component identification and to which extent is the approach independent of domain and register. We propose novel features that exploit clustering of unlabeled data from debate portals based on a word embeddings representation. Using these features, we significantly outperform several baselines in the cross-validation, cross-domain, and cross-register evaluation scenarios.

Total citations

Cited by 104

20162017201820192020202120222023202415 24 8 12 10 9 8 11 7

Scholar articles

Exploiting debate portals for semi-supervised argumentation mining in user-generated web discourse

I Habernal, I Gurevych - Proceedings of the 2015 conference on empirical …, 2015