Inventors
Ankur Tagra, Rajat Verma, Sudarshan Narayanan
Publication date
2019/8/27
Patent office
US
Patent number
10394959
Application number
15849946
Description
Machine training for determining sentiments in social network communications. A text document is extracted from a web site and tokenized into tokens. The tokens are input to a word to vector conversion model to generate word vectors. A term frequency inverse document frequency (TF-IDF) algorithm converts the word vectors to sentence vectors. A randomly selected subset the sentence vectors are tagged and used to train a classifier. The classifier takes a sentence vector and predicts a sentiment associated with the sentence vector. Predicted sentiment associated with each of the sentence vectors may be combined to generate a sentiment associated with the text document.
Total citations
20202021202220233153