Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/1803

Rescuing Legacy Data

File Description Size Format  
schmidt.pdf Better quality 3.19 MB Adobe PDF View/Open
schmidtsmall.pdf Faster download 1.08 MB Adobe PDF View/Open

Item Summary

dc.contributor.author Schmidt, Thomas
dc.contributor.author Bennöhr, Jasmine
dc.date.accessioned 2008-06-27T22:46:51Z
dc.date.available 2008-06-27T22:46:51Z
dc.date.issued 2008-06
dc.identifier.citation Schmidt, Thomas and Jasmine Bennöhr. 2008. Rescuing Legacy Data. Language Documentation & Conservation 2(1):109–129.
dc.identifier.issn 1934-5275
dc.identifier.uri http://hdl.handle.net/10125/1803
dc.description.abstract This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora.
dc.description.sponsorship National Foreign Language Resource Center
dc.language.iso eng
dc.publisher University of Hawai'i Press
dc.subject electronic language data
dc.subject XML
dc.subject legacy data
dc.subject corpus
dc.title Rescuing Legacy Data
dc.type Article
dc.type.dcmi Text
prism.publicationname Language Documentation & Conservation
prism.volume 2
prism.number 1
prism.startingpage 109
prism.endingpage 129
Appears in Collections: Volume 02 Issue 1 : Language Documentation & Conservation


Please email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

This item is licensed under a Creative Commons License Creative Commons