How Useful are Hand-crafted Data? Making Cases for Anomaly Detection Methods

Du, Len; Hutter, Marcus

How Useful are Hand-crafted Data? Making Cases for Anomaly Detection Methods

dc.contributor.author	Du, Len
dc.contributor.author	Hutter, Marcus
dc.date.accessioned	2020-12-24T19:08:58Z
dc.date.available	2020-12-24T19:08:58Z
dc.date.issued	2021-01-05
dc.description.abstract	While the importance of small data has been admitted in principle, they have not been widely adopted as a necessity in current machine learning or data mining research. Most predominantly, machine learning methods were typically evaluated under a “bigger is better” presumption. The more (and the more complex) data we could pour at a method, the better we thought we were at estimating its performance. We deem this mindset detrimental to interpretability, explainability, and the sustained development of the field. For example, despite that new outlier detection methods were often inspired by small, low dimensional samples, their performance has been exclusively evaluated by large, high-dimensional datasets resembling real-world use cases. With these “big data” we miss the chance to gain insights from close looks at how exactly the algorithms perform, as we mere humans cannot really comprehend the samples. In this work, we explore in the exactly opposite direction. We run several classical anomaly detection methods against small, mindfully crafted cases on which the results can be examined in detail. In addition to better understanding of these classical algorithms, our exploration has actually led to the discovery of some novel uses of classical anomaly detection methods to our surprise.
dc.format.extent	10 pages
dc.identifier.doi	10.24251/HICSS.2021.104
dc.identifier.isbn	978-0-9981331-4-0
dc.identifier.uri	http://hdl.handle.net/10125/70716
dc.language.iso	English
dc.relation.ispartof	Proceedings of the 54th Hawaii International Conference on System Sciences
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	Accountability, Evaluation, and Obscurity of AI Algorithms
dc.subject	anomaly detection
dc.subject	evaluation
dc.subject	explainability
dc.subject	small data
dc.subject	testing ai
dc.title	How Useful are Hand-crafted Data? Making Cases for Anomaly Detection Methods
prism.startingpage	847

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 0084.pdf
Size:: 535.6 KB
Format:: Adobe Portable Document Format

Download

Collections

Accountability, Evaluation, and Obscurity of AI Algorithms