Rescuing Legacy Data

Date

2008-06

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

University of Hawai'i Press

Volume

2

Number/Issue

1

Starting Page

109

Ending Page

129

Alternative Title

Abstract

This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora.

Description

Keywords

electronic language data, XML, legacy data, corpus

Citation

Schmidt, Thomas and Jasmine Bennöhr. 2008. Rescuing Legacy Data. Language Documentation & Conservation 2(1):109–129.

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Rights Holder

Local Contexts

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.