Forced Alignment for Understudied Language Varieties: Testing Prosodylab-Aligner with Tongan Data

Johnson, Lisa M.; Di Paolo, Marianna; Bell, Adrian

Forced Alignment for Understudied Language Varieties: Testing Prosodylab-Aligner with Tongan Data

Files

johnson_et_al.pdf (5.57 MB)

Date

2018-03

Authors

Johnson, Lisa M.

Di Paolo, Marianna

Bell, Adrian

Publisher

University of Hawaii Press

Starting Page

194

Ending Page

203

Abstract

Automated alignment of transcriptions to audio files expedites the process of preparing data for acoustic analysis. Unfortunately, the benefits of auto-alignment have generally been available only to researchers studying majority languages, for which large corpora exist and for which acoustic models have been created by large-scale research projects. Prosodylab-Aligner (PL-A), from McGill University, facilitates automated alignment and segmentation for understudied languages. It allows researchers to train acoustic models using the same audio files for which alignments will be created. Those models can then be used to create time-aligned Praat TextGrids with word and phone boundaries marked. For the benefit of others who wish to use PL-A for research projects, this paper reports on our use of PL-A on Tongan field recordings, reviewing the software, outlining required steps, and providing tips. Since field recordings often contain more background noise than the laboratory recordings for which PL-A was designed, the paper also discusses the relative benefits of removing background noise for both training and alignment purposes. Finally, it compares acoustic measures based on various alignments and compares boundary placements with those of human aligners, demonstrating that automated alignment is both feasible and less time-consuming than manual alignment.

Keywords

automated alignment, acoustic analysis, Tongan, understudied languages

Citation

Johnson, Lisa M., Marianna Di Paolo & Adrian Bell. 2018. Forced Alignment for Understudied Language Varieties: Testing Prosodylab-Aligner with Tongan Data. Language Documentation & Conservation 12. 80-123.

URI

http://hdl.handle.net/10125/24763

Extent

44 pages

Rights

Creative Commons Attribution-NonCommercial 4.0 International

Collections

Volume 12 : Language Documentation & Conservation

Full item page

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.