View article

[HTML] from jmir.org

“When ‘bad’is ‘good’”: identifying personal communication and sentiment in drug-related tweets

Authors

Raminta Daniulaityte, Lu Chen, Francois R Lamy, Robert G Carlson, Krishnaprasad Thirunarayan, Amit Sheth

Publication date

2016/10/24

Journal

JMIR public health and surveillance

Volume

Issue

Pages

e6327

Publisher

JMIR Publications Inc., Toronto, Canada

Description

Background: To harness the full potential of social media for epidemiological surveillance of drug abuse trends, the field needs a greater level of automation in processing and analyzing social media content.

Objectives: The objective of the study is to describe the development of supervised machine-learning techniques for the eDrugTrends platform to automatically classify tweets by type/source of communication (personal, official/media, retail) and sentiment (positive, negative, neutral) expressed in cannabis-and synthetic cannabinoid–related tweets.

Methods: Tweets were collected using Twitter streaming Application Programming Interface and filtered through the eDrugTrends platform using keywords related to cannabis, marijuana edibles, marijuana concentrates, and synthetic cannabinoids. After creating coding rules and assessing intercoder reliability, a manually labeled data set (N= 4000) was developed by coding several batches of randomly selected subsets of tweets extracted from the pool of 15,623,869 collected by eDrugTrends (May-November 2015). Out of 4000 tweets, 25%(1000/4000) were used to build source classifiers and 75%(3000/4000) were used for sentiment classifiers. Logistic Regression (LR), Naive Bayes (NB), and Support Vector Machines (SVM) were used to train the classifiers. Source classification (n= 1000) tested Approach 1 that used short URLs, and Approach 2 where URLs were expanded and included into the bag-of-words analysis. For sentiment classification, Approach 1 used all tweets, regardless of their source/type (n= 3000), while Approach 2 applied sentiment classification to personal communication …

Total citations

Cited by 87

2017201820192020202120222023202410 7 10 15 17 14 9 4

Scholar articles

“When ‘bad’is ‘good’”: identifying personal communication and sentiment in drug-related tweets

R Daniulaityte, L Chen, FR Lamy, RG Carlson… - JMIR public health and surveillance, 2016