A Model for Detecting Accounting Frauds by using Machine Learning

Date

2022-01-04

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

This paper aims to develop a machine learning model that enables to predict signs of financial statement frauds by combining the domain knowledge of machine learning and accounting. Inputs of this model is a published dataset of financial statements, and outputs involve the conclusions whether the predicted financial statements indicate the signs of financial statement frauds or not. Currently, XGBoost is recognized as one of the most popular classification methods with fast performance, flexibility, and scalability. However, its default properties are not suitable for fraudulent detecting of imbalanced datasets. To overcome this drawback, this research introduces a new machine learning model based on XGBoost technique, called f(raud)-XGBoost. The proposed model not only inherits XGBoost advantages but also enables it to detect financial statement frauds. We apply the Area Under the Receiver Operating Characteristics Curve and NDCG@k to perform the evaluation process. The experimental results show that the new model performs slightly better than three existing models including logistic regression model that is based on financial ratios, Support-vector-machine model, and RUSBoost model

Description

Keywords

Fraud Detection Using Machine Learning, ensemble learning, fraud detection, machine learning, xgboost

Citation

Extent

10 pages

Format

Geographic Location

Time Period

Related To

Proceedings of the 55th Hawaii International Conference on System Sciences

Related To (URI)

Table of Contents

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Rights Holder

Local Contexts

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.