Forced Alignment for Understudied Language Varieties: Testing Prosodylab-Aligner with Tongan Data

dc.contributor.authorJohnson, Lisa M.
dc.contributor.authorDi Paolo, Marianna
dc.contributor.authorBell, Adrian
dc.date.accessioned2018-03-13T06:23:55Z
dc.date.available2018-03-13T06:23:55Z
dc.date.issued2018-03
dc.description.abstractAutomated alignment of transcriptions to audio files expedites the process of preparing data for acoustic analysis. Unfortunately, the benefits of auto-alignment have generally been available only to researchers studying majority languages, for which large corpora exist and for which acoustic models have been created by large-scale research projects. Prosodylab-Aligner (PL-A), from McGill University, facilitates automated alignment and segmentation for understudied languages. It allows researchers to train acoustic models using the same audio files for which alignments will be created. Those models can then be used to create time-aligned Praat TextGrids with word and phone boundaries marked. For the benefit of others who wish to use PL-A for research projects, this paper reports on our use of PL-A on Tongan field recordings, reviewing the software, outlining required steps, and providing tips. Since field recordings often contain more background noise than the laboratory recordings for which PL-A was designed, the paper also discusses the relative benefits of removing background noise for both training and alignment purposes. Finally, it compares acoustic measures based on various alignments and compares boundary placements with those of human aligners, demonstrating that automated alignment is both feasible and less time-consuming than manual alignment.
dc.description.sponsorshipNational Foreign Language Resource Center
dc.format.extent44 pages
dc.identifier.citationJohnson, Lisa M., Marianna Di Paolo & Adrian Bell. 2018. Forced Alignment for Understudied Language Varieties: Testing Prosodylab-Aligner with Tongan Data. Language Documentation & Conservation 12. 80-123.
dc.identifier.issn1934-5275
dc.identifier.urihttp://hdl.handle.net/10125/24763
dc.language.isoen-US
dc.publisherUniversity of Hawaii Press
dc.rightsCreative Commons Attribution-NonCommercial 4.0 International
dc.subjectautomated alignment
dc.subjectacoustic analysis
dc.subjectTongan
dc.subjectunderstudied languages
dc.titleForced Alignment for Understudied Language Varieties: Testing Prosodylab-Aligner with Tongan Data
dc.typeArticle
dc.type.dcmiText
prism.endingpage203
prism.startingpage194

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
johnson_et_al.pdf
Size:
5.57 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: