Mestri, RohanBollam, PragnaOvergoor, GijsRand, William2021-12-242021-12-242022-01-04978-0-9981331-5-7http://hdl.handle.net/10125/79892Online user-generated reviews provide a unique view into consumer perceptions of a business. Extant research has demonstrated that text mining provides insight from textual reviews. More recently, we haven seen the adoption of image mining techniques to analyze visual content as well. With data comprising of user-generated imagery (UGI) and textual reviews, we propose to perform a combination of text- and image mining techniques to extract relevant attributes from both modalities. The analysis allows for a comparison between textual and visual content in online reviews. For the UGI analysis, we use a Deep Embedded Clustering model and for the User Generated Text Analysis we use a TF-IDF based mechanism to obtain attributes and polarities. The overall goal is to extract maximum information from text and images and compare the insights we gather from both. We analyze if any modality is self-sufficient or better than the other and also if both modalities combine to give similar or contrasting insights.10 pagesengAttribution-NonCommercial-NoDerivatives 4.0 InternationalElectronic Marketingmachine learningdeep learningonline reviewstext miningimage mininge-commercemarketingText vs. Image: An application of unsupervised multi-modal machine learning to online reviewstext10.24251/HICSS.2022.555