Linking endangerment databases and descriptive linguistics: An assessment of the use of terms relating to language endangerment in grammars

dc.creatorRoberto Zariquiey
dc.creatorMónica Arakaki
dc.creatorJavier Vera
dc.creatorGuido Torres-Orihuela
dc.creatorClaret Cuba-Raime
dc.creatorCarlos Barrientos
dc.creatorAracelli García
dc.creatorAdriano Ingunza
dc.creatorHarald Hammarström
dc.date.accessioned2022-09-12T22:38:44Z
dc.date.available2022-09-12T22:38:44Z
dc.date.copyright2022
dc.date.issued2022-09
dc.description.abstractThe world harbours a diversity of some 6,500 mutually unintelligible languages. As has been increasingly observed by linguists, many minority languages are becoming endangered and will be lost forever if not documented. The increased urgency has led to the development of several global endangerment databases and a more fine-grained understanding of the language endangerment progression as well as its possible reversal. In the present paper, we explore the terminological correlates of this development as found in the descriptive linguistic literature, using a corpus of over 10,000 digitized grammatical descriptions. Comparing this with existing endangerment databases, we find that simply counting terms related to endangerment does signal endangerment, but the degree of endangerment is more difficult to assess from grammatical descriptions. The label endangered seems to be an umbrella term that covers different situations ranging from moribund languages with less than ten speakers to minority languages with several thousand speakers. For many languages considered endangered in existing databases, explicit terms to this effect cannot be found in their descriptions. The discrepancy is due to incompleteness of the searchterm set, gaps in the literature, and projected rather than observed information in the databases. Our explorations illustrate the potential for database curation assisted by computational searches both to maintain accuracy of the databases and to investigate assumed language endangerment. Future work includes a larger cloud of search terms, usage of term frequencies, and prescreening of descriptive literature for the existence of a relevant section. From the perspective of descriptive linguistics, this study calls for a more careful correlation between the language endangerment indexes, as developed in the global endangerment databases, and the treatment of the endangerment status of individual languages in descriptive grammars.
dc.description.sponsorshipNational Foreign Language Resource Center
dc.formatArticle
dc.format.extent29
dc.identifier.citationZariquiey, Roberto, Mónica Arakaki, Javier Vera, Guido Torres-Orihuela, Claret Cuba-Raime, Carlos Barrientos, Aracelli García, Adriano Ingunza, Harald Hammarström. 2022. Linking endangerment databases and descriptive linguistics: An assessment of the use of terms relating to language endangerment in grammars. Language Documentation & Conservation 16: 290-318.
dc.identifier.issn1934-5275
dc.identifier.urihttps://hdl.handle.net/10125/74681
dc.languageeng
dc.publisherUniversity of Hawaii Press
dc.titleLinking endangerment databases and descriptive linguistics: An assessment of the use of terms relating to language endangerment in grammars
dcterms.rightsCreative Commons Attribution-NonCommercial 4.0 International
dcterms.typeText
prism.endingpage318
prism.publicationnameLanguage Documentation & Conservation
prism.startingpage290
prism.volume16

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zariquiey_etal.pdf
Size:
1.69 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.73 KB
Format:
Item-specific license agreed upon to submission
Description: