Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/41288

Improving Sentiment Analysis with Document-Level Semantic Relationships from Rhetoric Discourse Structures

File Size Format  
paper0139.pdf 918.02 kB Adobe PDF View/Open

Item Summary

Title:Improving Sentiment Analysis with Document-Level Semantic Relationships from Rhetoric Discourse Structures
Authors:Märkle-Huß, Joscha
Feuerriegel, Stefan
Prendinger, Helmut
Keywords:Sentiment analysis
Semantic Relationships
Rhetoric structure theory
Machine learning
Date Issued:04 Jan 2017
Abstract:Conventional sentiment analysis usually neglects semantic information between (sub-)clauses, as it merely implements so-called bag-of-words approaches, where the sentiment of individual words is aggregated independently of the document structure. Instead, we advance sentiment analysis by the use of rhetoric structure theory (RST), which provides a hierarchical representation of texts at document level. For this purpose, texts are split into elementary discourse units (EDU). These EDUs span a hierarchical structure in the form of a binary tree, where the branches are labeled according to their semantic discourse. Accordingly, this paper proposes a novel combination of weighting and grid search to aggregate sentiment scores from the RST tree, as well as feature engineering for machine learning. We apply our algorithms to the especially hard task of predicting stock returns subsequent to financial disclosures. As a result, machine learning improves the balanced accuracy by 8.6 percent compared to the baseline.
Pages/Duration:10 pages
URI:http://hdl.handle.net/10125/41288
ISBN:978-0-9981331-0-2
DOI:10.24251/HICSS.2017.135
Rights:Attribution-NonCommercial-NoDerivatives 4.0 International
https://creativecommons.org/licenses/by-nc-nd/4.0/
Appears in Collections: Data, Text, and Web Mining for Business Analytics Minitrack


Please email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

This item is licensed under a Creative Commons License Creative Commons