Rescuing Legacy Data
dc.contributor.author | Schmidt, Thomas | |
dc.contributor.author | Bennöhr, Jasmine | |
dc.date.accessioned | 2008-06-27T22:46:51Z | |
dc.date.available | 2008-06-27T22:46:51Z | |
dc.date.issued | 2008-06 | |
dc.description.abstract | This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora. | |
dc.description.sponsorship | National Foreign Language Resource Center | |
dc.identifier.citation | Schmidt, Thomas and Jasmine Bennöhr. 2008. Rescuing Legacy Data. Language Documentation & Conservation 2(1):109–129. | |
dc.identifier.issn | 1934-5275 | |
dc.identifier.uri | http://hdl.handle.net/10125/1803 | |
dc.language.iso | eng | |
dc.publisher | University of Hawai'i Press | |
dc.subject | electronic language data | |
dc.subject | XML | |
dc.subject | legacy data | |
dc.subject | corpus | |
dc.title | Rescuing Legacy Data | |
dc.type | Article | |
dc.type.dcmi | Text | |
prism.endingpage | 129 | |
prism.number | 1 | |
prism.publicationname | Language Documentation & Conservation | |
prism.startingpage | 109 | |
prism.volume | 2 |
Files
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 146 B
- Format:
- Item-specific license agreed upon to submission
- Description: