Improving Sentiment Analysis with Document-Level Semantic Relationships from Rhetoric Discourse Structures

dc.contributor.author Märkle-Huß, Joscha
dc.contributor.author Feuerriegel, Stefan
dc.contributor.author Prendinger, Helmut
dc.date.accessioned 2016-12-29T00:30:14Z
dc.date.available 2016-12-29T00:30:14Z
dc.date.issued 2017-01-04
dc.description.abstract Conventional sentiment analysis usually neglects semantic information between (sub-)clauses, as it merely implements so-called bag-of-words approaches, where the sentiment of individual words is aggregated independently of the document structure. Instead, we advance sentiment analysis by the use of rhetoric structure theory (RST), which provides a hierarchical representation of texts at document level. For this purpose, texts are split into elementary discourse units (EDU). These EDUs span a hierarchical structure in the form of a binary tree, where the branches are labeled according to their semantic discourse. Accordingly, this paper proposes a novel combination of weighting and grid search to aggregate sentiment scores from the RST tree, as well as feature engineering for machine learning. We apply our algorithms to the especially hard task of predicting stock returns subsequent to financial disclosures. As a result, machine learning improves the balanced accuracy by 8.6 percent compared to the baseline.
dc.format.extent 10 pages
dc.identifier.doi 10.24251/HICSS.2017.135
dc.identifier.isbn 978-0-9981331-0-2
dc.identifier.uri http://hdl.handle.net/10125/41288
dc.language.iso eng
dc.relation.ispartof Proceedings of the 50th Hawaii International Conference on System Sciences
dc.rights Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject Sentiment analysis
dc.subject Semantic Relationships
dc.subject Rhetoric structure theory
dc.subject Machine learning
dc.title Improving Sentiment Analysis with Document-Level Semantic Relationships from Rhetoric Discourse Structures
dc.type Conference Paper
dc.type.dcmi Text
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
paper0139.pdf
Size:
918.02 KB
Format:
Adobe Portable Document Format
Description: