COMPARISON OF TOPIC MODELING METHODS FOR ANALYZING TWEETS ON COVID-19 VACCINE
Khanjari Nezhad Jooneghani, Zeinab
Twitter is a microblogging site and a popular social media platform for sharing thoughts on current world events. The dynamic of Twitter discussions makes it a valuable data source for mining people's opinions and emotions towards world events. Tweets' dynamic nature can be used to analyze opinion shifting and sentiment shifting for specific targets. The COVID-19 outbreak is one of the recent worldwide events that affect people's lives worldwide in the last two years. Many people share their feelings and experiences through social media towards this pandemic. COVID-19-related tweets have recently been the subject of some research. This thesis also analyzes tweets related to the COVID-19 vaccine. The main objective of this thesis is to mine human concerns towards the COVID-19 vaccine using Twitter data. This thesis applies three topic modeling methods to discover the discussed subjects about the COVID-19 vaccine and analyze the topics' dynamic over a specific period. The models are Latent Dirichlet Allocation (LDA), LDA with Gibbs Sampling, Nonnegative Matrix Factorization (NMF), and Top2vec models. Furthermore, this thesis compares these three topic modeling methods based on human judgment, coherence value, and topics uniqueness. The results show both LDA outperformed NMF in terms of Jaccard score. In addition, LDA-Mallet outperformed LDA and NMF in terms of Coherence score. It is difficult to determine which one of NMF and LDA definitely provided the better score for some of the experiments. But, at all, it can be stated NMF performed better than LDA in terms of Coherence score. Top2Vec returned 255 topics for this case study, which is not desired for the purpose of this study. Three other methods outperform Top2vec in terms of Jaccard score and coherence value.
Khanjari Nezhad Jooneghani, Zeinab. (July 2021). COMPARISON OF TOPIC MODELING METHODS FOR ANALYZING TWEETS ON COVID-19 VACCINE (Master's Thesis, East Carolina University). Retrieved from the Scholarship. (http://hdl.handle.net/10342/9414.)
Khanjari Nezhad Jooneghani, Zeinab. COMPARISON OF TOPIC MODELING METHODS FOR ANALYZING TWEETS ON COVID-19 VACCINE. Master's Thesis. East Carolina University, July 2021. The Scholarship. http://hdl.handle.net/10342/9414. September 21, 2023.
Khanjari Nezhad Jooneghani, Zeinab, “COMPARISON OF TOPIC MODELING METHODS FOR ANALYZING TWEETS ON COVID-19 VACCINE” (Master's Thesis., East Carolina University, July 2021).
Khanjari Nezhad Jooneghani, Zeinab. COMPARISON OF TOPIC MODELING METHODS FOR ANALYZING TWEETS ON COVID-19 VACCINE [Master's Thesis]. Greenville, NC: East Carolina University; July 2021.
East Carolina University