AI Readiness: A Reusability Study of Popular AI Algorithms

dc.contributor.authorQuick, Rob
dc.contributor.authorKasula, Mohith
dc.date.accessioned2024-12-26T21:11:19Z
dc.date.available2024-12-26T21:11:19Z
dc.date.issued2025-01-07
dc.description.abstractThe FAIR Data Principles of findability, accessibility, interoperability, and reusability provide a roadmap to reusing data analysis findings and reproducibility of AI-based data analysis. However, the work done during this research project has identified an issue that impacts AI reproducibility before code and data interoperability can be considered. Namely, code reusability when attempting to recreate the hardware and system-level software or the “runtime environment.” While attempting to determine the metadata needed to FAIRly couple datasets with AI algorithms, the research team determined that the problem of recreating the runtime environment of published state-of-the-art algorithms from the website Papers with Code provided a hurdle that must be overcome before automated data-algorithm coupling can be considered. While containerization solutions such as Docker or Singularity are created to address the issue of inconsistent runtime environments, few AI algorithm developers have embraced publishing containers alongside their AI codes, opting for documenting software dependencies, which only tell part of the runtime story. Additionally, containers are software, and many issues affecting the recreation of runtime environments can also affect orchestrated container solutions. This work describes the process employed to survey 75 openly available AI algorithms, as recorded by Papers with Code, spanning the machine learning areas of computer vision, audio analysis, and natural language processing. It also makes a case that merely publishing the algorithm software repository and datasets used to benchmark the accuracy of the analysis is not enough to enable the reproducibility of results or reuse of AI algorithms. Finally, it identifies the gap in runtime environment reusability between code repositories like GitHub and commercial services like Hugging Face to focus future work. It proposes providing solutions like a container library, enhanced documentation, and other methods to allow reproducible and reusable research and a roadmap for continuing toward a review of enhancing AI-ready data.
dc.format.extent8
dc.identifier.doi10.24251/HICSS.2025.879
dc.identifier.isbn978-0-9981331-8-8
dc.identifier.other505dad9c-841b-4a80-9046-8d8a70e80b58
dc.identifier.urihttps://hdl.handle.net/10125/109731
dc.relation.ispartofProceedings of the 58th Hawaii International Conference on System Sciences
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectSoftware Sustainability: Research on Usability, Maintainability, and Reproducibility
dc.subjectartificial intelligence, fair ai, machine learning, reproducibility, reusability
dc.titleAI Readiness: A Reusability Study of Popular AI Algorithms
dc.typeConference Paper
dc.type.dcmiText
prism.startingpage7349

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0716.pdf
Size:
498.97 KB
Format:
Adobe Portable Document Format