“Data is Nice:” Theoretical and pedagogical implications of an Eastern Cherokee corpus

Date

2018

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

University of Hawai'i Press

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

This paper serves as a proof of concept for the usefulness of corpus creation in Cherokee language revitalization. It details the initial collection of a digital corpus of Cherokee/English texts and enumerates how corpus material can augment contemporary language revitalization efforts rather than simply preserving language for future analysis. By collecting and analyzing corpus material, we can quickly create new classroom materials and media products, and answer deeper theoretical linguistic questions. With a large enough corpus, we can even implement machine translation systems to facilitate the production of new texts. Although the vast majority of print material in Cherokee is in the Western dialect, this corpus has focused on Eastern texts. Expanding the dataset to include both dialects, however, will allow for comparison and facilitate generalizations about the Cherokee language as a whole. A corpus of Cherokee data can answer second language learners’ questions about the structure of the language and provide patterns for more effective, targeted learning of Cherokee. It can also provide teachers with ready access to accurate representations of the language produced by native speakers. By combining documentation and technology, we can leverage the power of databases to expedite and facilitate language revitalization.

Description

Keywords

language documentation, Cherokee, language corpus, machine translation, language technology

Citation

Frey, Benjamin. 2020. “Data is Nice:” Theoretical and pedagogical implications of an Eastern Cherokee corpus. In Silva, Wilson de Lima and Katherine J. Riestenberg. (Eds.) Collaborative Approaches to the Challenges of Language Documentation and Conservation: Selected papers from the 2018 Symposium on American Indian Languages (SAIL). Language Documentation & Conservation Special Publication no. 20 [PP 38-53] Honolulu: University of Hawai'i Press.

Extent

Format

Geographic Location

Time Period

Related To

LD&C Special Publication

Related To (URI)

Table of Contents

Rights

Creative Commons Attribution Non-Commercial Share Alike License

Rights Holder

Local Contexts

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.