Please use this identifier to cite or link to this item:

SayMore: Language documentation productivity

Video Preview


Not all videos support streaming previews. You will not be able to jump to portions of the video that have not been downloaded (progress shown as a yellow bar).

In cases where streaming is not supported, the full video will be loaded before playing. If your computer is capable of playing the video files, it may be advisable to download using the link below instead of trying to view it in your browser.

File SizeFormat 
26153.mp316.32 MBMP3View/Open
26153.mp430.1 MBMPEG-4View/Open

Item Summary

Title: SayMore: Language documentation productivity
Authors: Hatton, John
Issue Date: 28 Feb 2013
Description: Language Documenters quickly amass a large number of original recordings and artifacts based on them. We need to manage recordings, document informed consent, transcribe, translate, enter metadata, convert file formats, and, finally, submit to a digital archive. Along the way, we need to keep all these files well-organized and labeled. And we must keep track of the goals of the project in order to emerge with the desired coverage in areas such as genre, spontaneity, and the social roles of the speaker. We have powerful software for parts of this workflow, including Arbil, Elan, and EXMARaLDA. However, many of these tools appear best-suited for rather computer-savvy linguists, or those who can attend training courses. Recent linguistic software including WeSay (Albright and Author 2007) and FOLKER (Schmidt and Schütte 2010) have demonstrated that we can involve a wider spectrum of participants by using software with a task-focused interface that prioritizes clarity and efficiency over flexibility. In compensation, such software needs to emit data files that can be opened in more complex/powerful applications for further work. This paper presents SayMore (, a new software tool that streamlines the collection and annotations of recordings. Currently, SayMore eases the collecting of media files from a recording device, the addition of metadata, transcription, and, perhaps uniquely, oral annotation (respeaking and translation). All time-aligned data is stored in ELAN XML format, so that projects needing to go beyond SayMore’s built-in annotation capability can do so easily. For interlinearization, SayMore can export to FLEx, Toolbox, and other formats. For sharing with the community, SayMore produces subtitled videos. For archiving, it has built-in capability to convert file formats to those appropriate for long-term accessibility. Finally, the paper describes how SayMore helps researchers monitor progress towards project goals along several axes, including genre, spontaneity, and which workflow steps have been completed.
Rights: Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported
Appears in Collections:3rd International Conference on Language Documentation and Conservation (ICLDC)

Items in ScholarSpace are protected by copyright, with all rights reserved, unless otherwise indicated.