The first Mirandese text-to-speech system 

Date

2016-02

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

University of Hawai'i Press

Volume

Number/Issue

Starting Page

150

Ending Page

158

Alternative Title

Abstract

This paper describes the creation of base NLP resources and tools for an under-resourced minority language spoken in Portugal, Mirandese, in the context of the generation of a text-to-speech system, a collaborative citizenship project between Microsoft, ILTEC, and ALM – Associaçon de la Lhéngua Mirandesa. Development efforts encompassed the compilation of a large textual corpus, definition of a complete phone-set, development of a tokenizer, inflector, TN and GTP modules, and creation of a large phonetic lexicon with syllable segmentation, stress mark-up, and POS. The TTS system will provide an open access web interface freely available to the community, along with the other resources. We took advantage of mature tools, resources, and processes already available for phylogenetically-close languages, allowing us to cut development time and resources to a great extent, a solution that can be viable for other lesser-spoken languages which enjoy a similar situation.

Description

Keywords

Citation

Ferreira, José Pedro, Cristiano Chesi, Daan Baldewijns, Daniela Braga, Miguel Dias, and Margarita Correia. 2016. The first Mirandese text-to-speech system. In Vera Ferreira and Peter Bouda (eds.). Language Documentation and Conservation in Europe. 150-158. Honolulu: University of Hawai'i Press.

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Creative Commons Attribution Non-Commercial Share Alike License

Rights Holder

Local Contexts

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.