A framework for mining on Twitter data

Huang, Yifan

A framework for mining on Twitter data

dc.access.option	Restricted Campus Access Only
dc.contributor.advisor	Ding, Qin
dc.contributor.author	Huang, Yifan
dc.contributor.department	Computer Science
dc.date.accessioned	2017-01-11T21:12:27Z
dc.date.available	2019-02-26T14:23:55Z
dc.date.created	2016-12
dc.date.issued	2016-12-13
dc.date.submitted	December 2016
dc.date.updated	2017-01-11T14:33:46Z
dc.degree.department	Computer Science
dc.degree.discipline	MS-Computer Science
dc.degree.grantor	East Carolina University
dc.degree.level	Masters
dc.degree.name	M.S.
dc.description.abstract	Motivated by the increasing need of information retrieval from social media, a lexicon-based approach Tweet Sentiment Classifier (TSC) is presented to determine sentiment from tweet along with a systematic software for twitter data statistics analysis and topic extraction. The TSC uses annotated dictionaries of words (SentiWordNet) and has a negation detector. While the LDA topic model uses Gibbs Sampling. The entire system is unsupervised. Without the need of training, it has significant advantage on speed comparing to supervised methods. It is robust to provide consistent satisfying results from different topics of twitter data. The performance of the TSC also outperforms one of the baseline sentiment analysis methods.
dc.embargo.lift	2019-01-11
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/10342/6026
dc.language.iso	en
dc.publisher	East Carolina University
dc.subject	Text Mining
dc.subject	Sentiment Analysis
dc.subject.lcsh	Data mining
dc.subject.lcsh	Information retrieval--Computer programs
dc.subject.lcsh	Social media
dc.title	A framework for mining on Twitter data
dc.type	Master's Thesis
dc.type.material	text

Files

Original bundle

Now showing 1 - 1 of 1

Name:: HUANG-MASTERSTHESIS-2016.pdf
Size:: 1.63 MB
Format:: Adobe Portable Document Format

Please login to access this content.

Download

Collections

Master's Theses
Computer Science