TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language

Date
2019-03-03
Authors
Contributor
Advisor
Department
Instructor
Depositor
Speaker
Bowers, Jack
Romary, Laurent
Researcher
Consultant
Interviewer
Annotator
Journal Title
Journal ISSN
Volume Title
Publisher
Volume
Number/Issue
Starting Page
Ending Page
Alternative Title
Abstract
Description
This paper presents our approaches to creating, editing, annotating and curating an extensible and reusable TEI corpus for Mixtepec-Mixtec. We cover issues particular to working with an under-resourced language and show how we integrate a variety of homogeneous resources, normalize orthographic and phonetic data, and create searchable multi-layered annotations. (session 3.3.1)
Keywords
Citation
Extent
Format
Geographic Location
Time Period
Related To
Table of Contents
Rights
Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported
Rights Holder
Local Contexts
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.