A framework for mining on Twitter data

Huang, Yifan

A framework for mining on Twitter data

Files

Please login to access this content.

Primary HUANG-MASTERSTHESIS-2016.pdf (1.63 MB)

Date

2016-12-13

Authors

Huang, Yifan

Publisher

East Carolina University

Abstract

Motivated by the increasing need of information retrieval from social media, a lexicon-based approach Tweet Sentiment Classifier (TSC) is presented to determine sentiment from tweet along with a systematic software for twitter data statistics analysis and topic extraction. The TSC uses annotated dictionaries of words (SentiWordNet) and has a negation detector. While the LDA topic model uses Gibbs Sampling. The entire system is unsupervised. Without the need of training, it has significant advantage on speed comparing to supervised methods. It is robust to provide consistent satisfying results from different topics of twitter data. The performance of the TSC also outperforms one of the baseline sentiment analysis methods.

Keywords

Text Mining, Sentiment Analysis

URI

http://hdl.handle.net/10342/6026

Collections

Master's Theses
Computer Science

Full item page

A framework for mining on Twitter data

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

item.page.doi

Collections

Endorsement

Review

Supplemented By

Referenced By