Sliding Reservoir Approach for Delayed Labeling in Streaming Data Classification

dc.contributor.authorHu, Hanqing
dc.contributor.authorKantardzic, Mehmed
dc.date.accessioned2016-12-29T00:42:23Z
dc.date.available2016-12-29T00:42:23Z
dc.date.issued2017-01-04
dc.description.abstractWhen concept drift occurs within streaming data, a streaming data classification framework needs to update the learning model to maintain its performance. Labeled samples required for training a new model are often unavailable immediately in real world applications. This delay of labels might negatively impact the performance of traditional streaming data classification frameworks. To solve this problem, we propose Sliding Reservoir Approach for Delayed Labeling (SRADL). By combining chunk based semi-supervised learning with a novel approach to manage labeled data, SRADL does not need to wait for the labeling process to finish before updating the learning model. Experiments with two delayed-label scenarios show that SRADL improves prediction performance over the naïve approach by as much as 7.5% in certain cases. The most gain comes from 18-chunk labeling delay time with continuous labeling delivery scenario in real world data experiments.
dc.format.extent10 pages
dc.identifier.doihttps://doi.org/10.24251/HICSS.2017.205
dc.identifier.isbn978-0-9981331-0-2
dc.identifier.urihttp://hdl.handle.net/10125/41358
dc.language.isoeng
dc.relation.ispartofProceedings of the 50th Hawaii International Conference on System Sciences
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectdelayed labeling
dc.subjectstreaming data mining
dc.subjectconcept drift
dc.subjectsemi-supervised learning
dc.subjectclassification
dc.titleSliding Reservoir Approach for Delayed Labeling in Streaming Data Classification
dc.typeConference Paper
dc.type.dcmiText

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
paper0209.pdf
Size:
1.5 MB
Format:
Adobe Portable Document Format