Classifying Vaccine Misinformation in Online Social Media Videos using Natural Language Processing and Machine Learning
| dc.contributor.author | Schmidt, Sarah | |
| dc.contributor.author | Thoms, Brian | |
| dc.contributor.author | Eryilmaz, Evren | |
| dc.contributor.author | Isaacs, Jason | |
| dc.date.accessioned | 2023-12-26T18:42:17Z | |
| dc.date.available | 2023-12-26T18:42:17Z | |
| dc.date.issued | 2024-01-03 | |
| dc.identifier.doi | https://doi.org/10.24251/HICSS.2024.464 | |
| dc.identifier.isbn | 978-0-9981331-7-1 | |
| dc.identifier.other | 5d927a45-7b4a-4aae-8a4e-36f70c669fff | |
| dc.identifier.uri | https://hdl.handle.net/10125/106847 | |
| dc.language.iso | eng | |
| dc.relation.ispartof | Proceedings of the 57th Hawaii International Conference on System Sciences | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | Socia Media and Healthcare Technology | |
| dc.subject | machine learning | |
| dc.subject | misinformation detection | |
| dc.subject | natural language processing | |
| dc.subject | sentiment analysis | |
| dc.title | Classifying Vaccine Misinformation in Online Social Media Videos using Natural Language Processing and Machine Learning | |
| dc.type | Conference Paper | |
| dc.type.dcmi | Text | |
| dcterms.abstract | The spread of information through online social media videos is one of the most popular ways to share and obtain information, while at the same time the spread of misinformation across these same social spaces has become a significant concern affecting human well-being. Being able to detect this misinformation before it spreads is becoming more and more desirable for many social media platforms. This research focuses on exploring the accuracy of detecting misinformation across two social media platforms, YouTube and BitChute. This involves the classification of video data into two types: genuine information or misinformation. More specifically, this research generates additional metadata embedded within online videos related to the COVID-19 vaccination. Using natural language processing (NLP) we extract medical subject headings (MeSH) terms from video transcripts and classify videos using four machine learning techniques including naïve Bayes, random forest, support vector machine, and logistic regression. Implementation of each classifier is presented, and the accuracy of each technique is compared and discussed. | |
| dcterms.extent | 10 pages | |
| prism.startingpage | 3847 |
Files
Original bundle
1 - 1 of 1
