Exploring User Evaluations of Machine Learning Models: A Qualitative Study on the Impact of Confidence Intervals
| dc.contributor.author | Meyers, Scott | |
| dc.contributor.author | Murry, Paige | |
| dc.contributor.author | Jessup, Sarah | |
| dc.contributor.author | Alarcon, Gene | |
| dc.contributor.author | Harris, Krista | |
| dc.date.accessioned | 2024-12-26T21:05:05Z | |
| dc.date.available | 2024-12-26T21:05:05Z | |
| dc.date.issued | 2025-01-07 | |
| dc.description.abstract | Research on artificial intelligence and machine learning models has burgeoned in the last decade. However, research has seldom utilized qualitative methods for assessing user-based experiences and system evaluations of AI/ML models. This study aims to provide an example of how thematic text analysis can be used to provide greater insight into user experiences with these systems and examine how varying levels of model transparency affects evaluations. Participants (N = 130) completed an image binning monitoring task with either an uncalibrated classification model (UCM), which displayed high confidence regardless of classification accuracy or a calibrated classification model (CCM), which had greater calibration between accuracy and confidence. Results revealed detailed information on user evaluations for both models including various performance perceptions, impressions, and strategy behaviors. Furthermore, we identified key differences in user evaluations between these models and our confidence manipulation, such as greater trust and confidence display use. Qualitative analysis has been shown to be an effective approach for detailed investigation of user experiences and model evaluation. | |
| dc.format.extent | 10 | |
| dc.identifier.doi | https://doi.org/10.24251/HICSS.2025.101 | |
| dc.identifier.isbn | 978-0-9981331-8-8 | |
| dc.identifier.other | 46fec49b-167e-4fc0-a514-886b7cb3fb3b | |
| dc.identifier.uri | https://hdl.handle.net/10125/108939 | |
| dc.relation.ispartof | Proceedings of the 58th Hawaii International Conference on System Sciences | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | AI Model Evaluation | |
| dc.subject | image classification, machine learning, qualitative, trust | |
| dc.title | Exploring User Evaluations of Machine Learning Models: A Qualitative Study on the Impact of Confidence Intervals | |
| dc.type | Conference Paper | |
| dc.type.dcmi | Text | |
| prism.startingpage | 841 |
Files
Original bundle
1 - 1 of 1
