Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/63866

Topic Modeling and Transfer Learning for Automated Surveillance of Injury Reports in Consumer Product Reviews

File Size Format  
0101.pdf 467.85 kB Adobe PDF View/Open

Item Summary

Title:Topic Modeling and Transfer Learning for Automated Surveillance of Injury Reports in Consumer Product Reviews
Authors:Goldberg, David
Zaman, Nohel
Keywords:Data, Text, and Web Mining for Business Analytics
business intelligence
safety
text mining
topic modeling
show 1 moretransfer learning
show less
Date Issued:07 Jan 2020
Abstract:Many modern firms and interest groups are tasked with the challenge of monitoring the status and performance of a bevy of distinct products. As online user-generated content has increased in volume, new unstructured data sources are available for mining unique insights. Reports of injuries arising as a result of product usage are particularly concerning. In this paper, we utilize complimentary approaches to address this problem. We analyze two novel datasets; first, a government-maintained dataset of hazard and injury reports and second, a large dataset of cross-industry consumer product reviews manually coded for the presence of hazard and injury reports. We apply an unsupervised topic modeling approach to characterize the hazard and injury reports detected. Then, we implement a supervised transfer learning technique, using information obtained from the government-maintained dataset to detect hazard and injury reports in online reviews. Our results offer improved surveillance for monitoring hazards across multiple industries.
Pages/Duration:10 pages
URI:http://hdl.handle.net/10125/63866
ISBN:978-0-9981331-3-3
DOI:10.24251/HICSS.2020.127
Rights:Attribution-NonCommercial-NoDerivatives 4.0 International
https://creativecommons.org/licenses/by-nc-nd/4.0/
Appears in Collections: Data, Text, and Web Mining for Business Analytics


Please email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

This item is licensed under a Creative Commons License Creative Commons