Is a Pretrained Model the Answer to Situational Awareness Detection on Social Media?

Lo, Siaw Ling; Lee, Kahhe; Zhang, Yuhao

Is a Pretrained Model the Answer to Situational Awareness Detection on Social Media?

Files

0206.pdf (575.91 KB)

Date

2023-01-03

Authors

Lo, Siaw Ling

Lee, Kahhe

Zhang, Yuhao

Starting Page

2110

Abstract

Social media can be valuable for extracting information about an event or incident on the ground. However, the vast amount of content shared, and the linguistic variants of languages used on social media make it challenging to identify important situational awareness content to aid in decision-making for first responders. In this study, we assess whether pretrained models can be used to address the aforementioned challenges on social media. Various pretrained models, including static word embedding (such as Word2Vec and GloVe) and contextualized word embedding (such as DistilBERT) are studied in detail. According to our findings, a vanilla DistilBERT pretrained language model is insufficient to identify situation awareness information. Fine-tuning by using datasets of various event types and vocabulary extension is essential to adapt a DistilBERT model for real-world situational awareness detection.

Keywords

Data Analytics, Data Mining, and Machine Learning for Social Media, bert, fine tuning, pretrained models, situational awareness, vocabulary extension

URI

https://hdl.handle.net/10125/102894

Extent

10

Related To

Proceedings of the 56th Hawaii International Conference on System Sciences

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Collections

Data Analytics, Data Mining, and Machine Learning for Social Media

Full item page

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

Is a Pretrained Model the Answer to Situational Awareness Detection on Social Media?

Files

Date

Authors

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

Description

Keywords

Citation

URI

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Rights Holder

Local Contexts

Collections