Don't Buy the Pig in a Poke: Benchmarking DNNs Inference Performance before Development

Völter, Constantin; Koppe, Timo; Rieger, Phillip

Don't Buy the Pig in a Poke: Benchmarking DNNs Inference Performance before Development

Files

0760.pdf (1.73 MB)

Date

2024-01-03

Authors

Völter, Constantin

Koppe, Timo

Rieger, Phillip

Starting Page

7790

Abstract

As deep neural networks (DNNs) are increasingly used in practical settings, developers often adopt pre-existing DNN architectures from a vast collection of well-established models. However, in industrial environments, factors beyond simply achieving high accuracy are becoming important. The runtime performance is critical, as delays can lead to conveyor stops or equipment damage. Currently, determining the runtime performance of a DNN requires multiple iterations and testing specific configurations, whereas existing methods for benchmarking DNNs mainly compare different hardware or runtime parameters. We present tritonPerf, an approach to obtain and compare the runtime (latency and throughput) for a broad range of settings and existing DNN architectures in the final application environment. It allows data scientists to evaluate the performance and compare the results for a wide range of models before time and resource-intensive hyperparameter tuning is performed. We demonstrate the gain of tritonPerf in an extensive field study using an industrial setting, where we benchmark and compare the runtime of 57 different models.

Keywords

Software Technology and Software Development, benchmark framework, computer vision, intelligent edge computing

URI

https://hdl.handle.net/10125/107322

Extent

10 pages

Related To

Proceedings of the 57th Hawaii International Conference on System Sciences

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Collections

Software Technology and Software Development

Full item page

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

Don't Buy the Pig in a Poke: Benchmarking DNNs Inference Performance before Development

Files

Date

Authors

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

Description

Keywords

Citation

URI

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Rights Holder

Local Contexts

Collections