DiFiLE: A Knowledge-Distillation Longformer Model for Finance with Ensembling

dc.contributor.authorHristova, Diana
dc.contributor.authorSatani, Nirav
dc.date.accessioned2024-12-26T21:05:45Z
dc.date.available2024-12-26T21:05:45Z
dc.date.issued2025-01-07
dc.description.abstract10-K reports are a very important source of information in finance. Unfortunately, due to their text length they can hardly be analyzed by state-of-the-art transformer-based methods. In this paper, we aim to address this by combining the fields of efficient attention mechanisms, knowledge distillation (KD), and ensembling. Our five-step approach, DiFiLE, first pre-processes the data and splits it into data chunks based on the report items. Then, for each chunk, we estimate a teacher Longformer model. This is followed by KD and the generation of the corresponding student models. Finally, we aggregate the results from the chunks with ensembling and in particular stacking. We evaluate DiFiLE on the 10-K reports of the DJIA companies. The results show high performance of the teacher model, which is then well mimicked by its distilled version, requiring 30% fewer resources.
dc.format.extent10
dc.identifier.doihttps://doi.org/10.24251/HICSS.2025.191
dc.identifier.isbn978-0-9981331-8-8
dc.identifier.otherb5a56c9e-7347-4776-a6d1-654ee1417024
dc.identifier.urihttps://hdl.handle.net/10125/109031
dc.relation.ispartofProceedings of the 58th Hawaii International Conference on System Sciences
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectNatural Language Processing and Large Language Models Supporting Data Analytics for System Sciences
dc.subject10-k, efficient attention mechanisms, ensembling, knowledge distillation, transformer
dc.titleDiFiLE: A Knowledge-Distillation Longformer Model for Finance with Ensembling
dc.typeConference Paper
dc.type.dcmiText
prism.startingpage1581

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0155.pdf
Size:
1 MB
Format:
Adobe Portable Document Format