A framework for mining on Twitter data
dc.access.option | Restricted Campus Access Only | |
dc.contributor.advisor | Ding, Qin | |
dc.contributor.author | Huang, Yifan | |
dc.contributor.department | Computer Science | |
dc.date.accessioned | 2017-01-11T21:12:27Z | |
dc.date.available | 2019-02-26T14:23:55Z | |
dc.date.created | 2016-12 | |
dc.date.issued | 2016-12-13 | |
dc.date.submitted | December 2016 | |
dc.date.updated | 2017-01-11T14:33:46Z | |
dc.degree.department | Computer Science | |
dc.degree.discipline | MS-Computer Science | |
dc.degree.grantor | East Carolina University | |
dc.degree.level | Masters | |
dc.degree.name | M.S. | |
dc.description.abstract | Motivated by the increasing need of information retrieval from social media, a lexicon-based approach Tweet Sentiment Classifier (TSC) is presented to determine sentiment from tweet along with a systematic software for twitter data statistics analysis and topic extraction. The TSC uses annotated dictionaries of words (SentiWordNet) and has a negation detector. While the LDA topic model uses Gibbs Sampling. The entire system is unsupervised. Without the need of training, it has significant advantage on speed comparing to supervised methods. It is robust to provide consistent satisfying results from different topics of twitter data. The performance of the TSC also outperforms one of the baseline sentiment analysis methods. | |
dc.embargo.lift | 2019-01-11 | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/10342/6026 | |
dc.language.iso | en | |
dc.publisher | East Carolina University | |
dc.subject | Text Mining | |
dc.subject | Sentiment Analysis | |
dc.subject.lcsh | Data mining | |
dc.subject.lcsh | Information retrieval--Computer programs | |
dc.subject.lcsh | Social media | |
dc.title | A framework for mining on Twitter data | |
dc.type | Master's Thesis | |
dc.type.material | text |