Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/43573

Developing methods for reproducible research in linguistics: a first step

File Size Format  
Poster McDonnell Hall.pdf 73.61 kB Adobe PDF View/Open

Item Summary

Title:Developing methods for reproducible research in linguistics: a first step
Authors:McDonnell, Bradley
Hall, Patrick
Keywords:data citation
attribution
Linguistics
Date Issued:06 Jan 2017
Abstract:Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus.
Description:Poster: Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus.
Pages/Duration:1
URI:http://hdl.handle.net/10125/43573
Rights:Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported
Attribution-NonCommercial-ShareAlike 3.0 United States
Appears in Collections: Presentations from the Linguistic Society of America symposium and poster session on Data Citation and Attribution in Linguistics, 5-9 January 2017, Austin TX


Please email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

Items in ScholarSpace are protected by copyright, with all rights reserved, unless otherwise indicated.