Documenting and Researching Endangered Languages: The Pangloss Collection

dc.contributor.authorMichailovsky, Boyd
dc.contributor.authorMazaudon, Martine
dc.contributor.authorMichaud, Alexis
dc.contributor.authorGuillaume, Séverine
dc.contributor.authorFrançois, Alexandre
dc.contributor.authorAdamou, Evangelia
dc.date.accessioned2014-06-03T21:48:49Z
dc.date.available2014-06-03T21:48:49Z
dc.date.issued2014-06
dc.description.abstractThe Pangloss Collection is a language archive developed since 1994 at the Langues et Civilisations à Tradition Orale (LACITO) research group of the French Centre National de la Recherche Scientifique (CNRS). It contributes to the documentation and study of the world’s languages by providing free access to documents of connected, spontaneous speech, mostly in endangered or under-resourced languages, recorded in their cultural context and transcribed in consultation with native speakers. The Collection is an Open Archive containing media files (recordings), text annotations, and metadata; it currently contains over 1,400 recordings in 70 languages, including more than 400 transcribed and annotated documents. The annotations consist of transcription, free translation in English, French and/or other languages, and, in many cases, word or morpheme glosses; they are time-aligned with the recordings, usually at the utterance level. A web interface makes these annotations accessible online in an interlinear display format, in synchrony with the sound, using any standard browser. The structure of the XML documents makes them accessible to searching and indexing, always preserving the links to the recordings. Long-term preservation is guaranteed through a partnership with a digital archive. A guiding principle of the Pangloss Collection is that a close association between documentation and research is highly profitable to both. This article presents the collections currently available; it also aims to convey a sense of the range of possibilities they offer to the scientific and speaker communities and to the general public.
dc.description.sponsorshipNational Foreign Language Resource Center
dc.format.extent17
dc.identifier.citationMichailovsky, Boyd, Martine Mazaudon, Alexis Michaud, Séverine Guillaume, Alexandre François, and Evangelia Adamou. 2014. Documenting and Researching Endangered Languages: The Pangloss Collection. Language Documentation & Conservation. 8: 119-135
dc.identifier.issn1934-5275
dc.identifier.urihttp://hdl.handle.net/10125/4621
dc.language.isoeng
dc.publisherUniversity of Hawaii Press
dc.rightsCreative Commons Attribution-NonCommercial 3.0 Unported
dc.rightsAttribution-NonCommercial 3.0 United States
dc.rights.urihttp://creativecommons.org/licenses/by-nc/3.0/us/
dc.subjectPangloss Collection
dc.subjectarchive
dc.subjectlanguage documentation
dc.subjectLangues et Civilisations à Tradition Orale
dc.subjectLACITO
dc.subjectendangered languages
dc.titleDocumenting and Researching Endangered Languages: The Pangloss Collection
dc.typeArticle
dc.type.dcmiText
prism.endingpage135
prism.publicationnameLanguage Documentation & Conservation
prism.startingpage119
prism.volume8

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
michailovsky.pdf
Size:
1.36 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: