Authors
Ivan Habernal, Tomáš Ptáček, Josef Steinberger
Publication date
2015/7/1
Journal
Information Processing & Management
Volume
51
Issue
4
Pages
532-546
Publisher
Pergamon
Description
This article describes in-depth research on machine learning methods for sentiment analysis of Czech social media. Whereas in English, Chinese, or Spanish this field has a long history and evaluation datasets for various domains are widely available, in the case of the Czech language no systematic research has yet been conducted. We tackle this issue and establish a common ground for further research by providing a large human-annotated Czech social media corpus. Furthermore, we evaluate state-of-the-art supervised machine learning methods for sentiment analysis. We explore different pre-processing techniques and employ various features and classifiers. We also experiment with five different feature selection algorithms and investigate the influence of named entity recognition and preprocessing on sentiment classification performance. Moreover, in addition to our newly created social media dataset …
Total citations
20142015201620172018201920202021202220232024214152524141391065
Scholar articles
I Habernal, T Ptáček, J Steinberger - Information Processing & Management, 2014