Accessing, managing, and mobilizing an ELAN-based language documentation corpus: the Kwaras and Namuti tools

dc.contributor.author Caballero, Gabriela
dc.contributor.author Carroll, Lucien
dc.contributor.author Mach, Kevin
dc.date.accessioned 2019-02-14T11:31:16Z
dc.date.available 2019-02-14T11:31:16Z
dc.date.issued 2019-02
dc.description.abstract This paper introduces Kwaras and Namuti, two new tools for building, managing, accessing, and mobilizing ELAN-based language documentation corpora. Kwaras integrates WAV files, ELAN annotations, and document metadata into a web-based corpus, allowing immediate access to annotations and recordings. Namuti builds from Kwaras and enables different uses of language documentation products for different audiences and provides links from linguistic analyses to language documentation corpora. The main goal of these new tools is three-fold: (i) to facilitate the use of language documentation in linguistic analysis; (ii) to increase transparency of documentation-based analyses, providing interested users full access to the data on which generalizations are based and contextualization of the projects that generated the data; and (iii) to enable uses of language corpora that may serve the interests of multiple stakeholders, including academic researchers and community members interested in language maintenance and revitalization. We provide a basic overview of how Kwaras and Namuti work, lay out instructions on how to download and use Kwaras, and discuss what uses it currently supports. This article also issues a call for increased collaboration between linguists, community members, language activists, and software developers to further develop these and other similar resources.
dc.description.sponsorship National Foreign Language Resource Center
dc.format.extent 20 pages
dc.identifier.citation Caballero, Gabriela, Lucien Carroll & Kevin Mach. 2019. Accessing, managing, and mobilizing an ELAN-based language documentation corpus: the Kwaras and Namuti tools. Language Documentation & Conservation 13: 63-82.
dc.identifier.issn 1934-5275
dc.identifier.uri http://hdl.handle.net/10125/24799
dc.language.iso en-US
dc.publisher University of Hawaii Press
dc.rights Creative Commons Attribution-NonCommercial 4.0 International
dc.rights Attribution-NonCommercial 3.0 United States
dc.rights.uri http://creativecommons.org/licenses/by-nc/3.0/us/
dc.subject language documentation
dc.subject corpora
dc.subject technology
dc.subject ELAN
dc.title Accessing, managing, and mobilizing an ELAN-based language documentation corpus: the Kwaras and Namuti tools
dc.type Article
dc.type.dcmi Text
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
caballero_et_al_ldc.pdf
Size:
2.51 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.73 KB
Format:
Item-specific license agreed upon to submission
Description: