TEXT SUMMARIZATION IN QUANTUM COMPUTING

Loading...
Thumbnail Image

Date

Contributor

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Interviewee

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

This study explores the classification of paradigms in natural language processing (NLP) tasks, emphasizing the distinction between compositional and distributional approaches. While compositional methods prioritize structural understanding, distributional approaches focus on contextual behaviors. DisCo, a hybrid model integrating both paradigms, shows some hope in overcoming the limitations of traditional compositional and distributional models by incorporating grammatical structures as inputs and leveraging quantum computing principles. This hope is based on the analogy of composing words as entangled states and the meaning extraction as information flow between entangled states. In this dissertation, we show that this hope might not be realized on the current and near future quantum hardware. Our practical implementation reveals challenges from the model in handling longer texts and issues with handling semantic composition, especially in distinguishing whether a simple sentence is a summary of a simple paragraph. We offer insights on why such task is hard for near future quantum hardware in general, and if some solutions are realized they may not overcome classical NLP in terms of computational resources.

Description

Citation

DOI

Extent

109 pages

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

All UHM dissertations and theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission from the copyright owner.

Rights Holder

Catalog Record

Local Contexts

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.