From PogChamps to Insights: Detecting Original Content in Twitch Chat

dc.contributor.authorLindroos, Jari
dc.contributor.authorPeltonen, Jaakko
dc.contributor.authorVälisalo, Tanja
dc.contributor.authorKoskimaa, Raine
dc.contributor.authorToivanen, Ida
dc.date.accessioned2024-12-26T21:06:28Z
dc.date.available2024-12-26T21:06:28Z
dc.date.issued2025-01-07
dc.description.abstractThe vast volume of chat messages generated during esports events on Twitch represents a valuable source of data for understanding audience behavior. However, the sheer quantity and dynamic nature of this data make manual analysis impractical. This study addresses this challenge by introducing FinTwitchBERT, a model fine-tuned to classify Twitch chat messages into four categories based on their uniqueness. Our model demonstrates the ability to distinguish between original content, repetitive messages such as emote spamming, formulaic messages, and interactive commands chat participants use to interact with channel bots. Pre-trained on over 18 million Finnish Twitch chat messages and utilizing a combination of semi-supervised learning and iterative pseudo-labeling with human-in-the-loop validation, FinTwitchBERT achieves 97.42% accuracy on a test set of unseen chat messages with a limited initial dataset of only 7,529 manually annotated messages.
dc.format.extent10
dc.identifier.doihttps://doi.org/10.24251/HICSS.2025.308
dc.identifier.isbn978-0-9981331-8-8
dc.identifier.other560f1406-0c9c-46ce-abec-e10de4c46bb1
dc.identifier.urihttps://hdl.handle.net/10125/109148
dc.relation.ispartofProceedings of the 58th Hawaii International Conference on System Sciences
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectData Analytics, Data Mining, and Machine Learning for Social Media
dc.subjectchat, machine learning, natural language processing, social media analysis, twitch
dc.titleFrom PogChamps to Insights: Detecting Original Content in Twitch Chat
dc.typeConference Paper
dc.type.dcmiText
prism.startingpage2538

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0249.pdf
Size:
745.13 KB
Format:
Adobe Portable Document Format