Discovering Malware with Time Series Shapelets

Patri, Om; Wojnowicz, Michael; Wolff, Matt

Discovering Malware with Time Series Shapelets

Files

paper0749.pdf (2.56 MB)

Date

2017-01-04

Authors

Patri, Om

Wojnowicz, Michael

Wolff, Matt

Abstract

Malicious software (‘malware’) detection systems are usually signature-based and cannot stop attacks by malicious files they have never encountered. To stop these attacks, we need statistical learning approaches to identify root patterns behind execution of malware. We propose a machine learning approach for detection of malware from portable executable (PE) files. We create an ‘entropy time series’ representation of the content of each file, and then apply a unique time series classification method (called ‘shapelets’) for identifying malware. The shapelet-based approach picks up local discriminative features from the entropy signals. Our approach is file format agnostic, can deal with varying lengths in input instances, and provides fast classification. We evaluate our method on an industrial dataset containing thousands of executable files, and comparison with state-of-the-art methods illustrates the performance of our approach. This work is the first to use time series shapelets for malware detection and information security applications.

Keywords

Antivirus, Entropy Analysis, File Content, Malware, Shapelets

URI

http://hdl.handle.net/10125/41898

Extent

10 pages

Related To

Proceedings of the 50th Hawaii International Conference on System Sciences

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Collections

Deception, Digital Forensics, and Malware Minitrack

Full item page

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

Discovering Malware with Time Series Shapelets

Files

Date

Authors

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

Description

Keywords

Citation

URI

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Rights Holder

Local Contexts

Collections