Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/50000

Understanding Topic Models in Context: A Mixed-Methods Approach to the Meaningful Analysis of Large Document Collections

File Size Format  
paper0113.pdf 1.17 MB Adobe PDF View/Open

Item Summary

Title:Understanding Topic Models in Context: A Mixed-Methods Approach to the Meaningful Analysis of Large Document Collections
Authors:Eickhoff, Matthias
Wieneke, Runhild
Keywords:Data, Text and Web Mining for Business Analytics
Topic Modelling, Mixed Methdos, LDA, Topic Coding, Textual Data
Date Issued:03 Jan 2018
Abstract:In recent years, we have witnessed an unprecedented proliferation of large document collections. This development has spawned the need for appropriate analytical means. In particular, to seize the thematic composition of large document collections, researchers increasingly draw on quantitative topic models. Among their most prominent representatives is the Latent Dirichlet Allocation (LDA). Yet, these models have significant drawbacks, e.g. the generated topics lack context and thus meaningfulness. Prior research has rarely addressed this limitation through the lens of mixed-methods research. We position our paper towards this gap by proposing a structured mixed-methods approach to the meaningful analysis of large document collections. Particularly, we draw on qualitative coding and quantitative hierarchical clustering to validate and enhance topic models through re-contextualization. To illustrate the proposed approach, we conduct a case study of the thematic composition of the AIS Senior Scholars' Basket of Journals.
Pages/Duration:10 pages
URI/DOI:http://hdl.handle.net/10125/50000
ISBN:978-0-9981331-1-9
DOI:10.24251/HICSS.2018.113
Rights:Attribution-NonCommercial-NoDerivatives 4.0 International
https://creativecommons.org/licenses/by-nc-nd/4.0/
Appears in Collections: Data, Text and Web Mining for Business Analytics


Please email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

Items in ScholarSpace are protected by copyright, with all rights reserved, unless otherwise indicated.