“Data is Nice:” Theoretical and pedagogical implications of an Eastern Cherokee corpus

Date
2018
Authors
Frey, Benjamin
Contributor
Advisor
Department
Instructor
Depositor
Speaker
Researcher
Consultant
Interviewer
Annotator
Journal Title
Journal ISSN
Volume Title
Publisher
University of Hawai'i Press
Volume
Number/Issue
Starting Page
Ending Page
Alternative Title
Abstract
This paper serves as a proof of concept for the usefulness of corpus creation in Cherokee language revitalization. It details the initial collection of a digital corpus of Cherokee/English texts and enumerates how corpus material can augment contemporary language revitalization efforts rather than simply preserving language for future analysis. By collecting and analyzing corpus material, we can quickly create new classroom materials and media products, and answer deeper theoretical linguistic questions. With a large enough corpus, we can even implement machine translation systems to facilitate the production of new texts. Although the vast majority of print material in Cherokee is in the Western dialect, this corpus has focused on Eastern texts. Expanding the dataset to include both dialects, however, will allow for comparison and facilitate generalizations about the Cherokee language as a whole. A corpus of Cherokee data can answer second language learners’ questions about the structure of the language and provide patterns for more effective, targeted learning of Cherokee. It can also provide teachers with ready access to accurate representations of the language produced by native speakers. By combining documentation and technology, we can leverage the power of databases to expedite and facilitate language revitalization.
Description
Keywords
language documentation, Cherokee, language corpus, machine translation, language technology
Citation
Frey, Benjamin. 2020. “Data is Nice:” Theoretical and pedagogical implications of an Eastern Cherokee corpus. In Silva, Wilson de Lima and Katherine J. Riestenberg. (Eds.) Collaborative Approaches to the Challenges of Language Documentation and Conservation: Selected papers from the 2018 Symposium on American Indian Languages (SAIL). Language Documentation & Conservation Special Publication no. 20 [PP 38-53] Honolulu: University of Hawai'i Press.
Extent
Format
Geographic Location
Time Period
Related To
LD&C Special Publication
Table of Contents
Rights
Creative Commons Attribution Non-Commercial Share Alike License
Rights Holder
Local Contexts
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.