Please use this identifier to cite or link to this item:

Dissecting Moneyball: Improving Classification Model Interpretability in Baseball Pitch Prediction

File Size Format  
0026.pdf 787.3 kB Adobe PDF View/Open

Item Summary

Title:Dissecting Moneyball: Improving Classification Model Interpretability in Baseball Pitch Prediction
Authors:Hickey, Kevin
Zhou, Lina
Tao, Jie
Keywords:Collaboration for Data Science
baseball analytics
data science
machine learning
model interpretability
show 1 morepredictive analysis
show less
Date Issued:07 Jan 2020
Abstract:Data science, where technical expertise meets do-main knowledge, is collaborative by nature. Complex machine learning models have achieved human-level performance in many areas, yet they face adoption challenges in practice due to limited interpretability of model outputs, particularly for users who lack specialized technical knowledge. One key question is how to unpack complex classification models by enhancing their interpretability to facilitate collaboration in data science research and application. In this study, we extend two state-of-the-art methods for drawing fine-grained explanations from the results of classification models. The main extensions include aggregating explanations from individual instances to a user-defined aggregation level, and providing explanations with the original features rather than engineered representations. We use the prediction of baseball pitch outcome as a case to evaluate our extended methods. The experiment results of the methods with real sensor data demonstrate their improved interpretability while pre-serving superior prediction performance.
Pages/Duration:10 pages
Rights:Attribution-NonCommercial-NoDerivatives 4.0 International
Appears in Collections: Collaboration for Data Science

Please email if you need this content in ADA-compliant format.

This item is licensed under a Creative Commons License Creative Commons