Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/43573

Developing methods for reproducible research in linguistics: a first step

File SizeFormat 
Poster_McDonnell_Hall.pdf73.61 kBAdobe PDFView/Open

Item Summary

Title: Developing methods for reproducible research in linguistics: a first step
Authors: McDonnell, Bradley
Hall, Patrick
Keywords: data citation
attribution
Linguistics
Issue Date: 06 Jan 2017
Abstract: Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus.
Description: Poster: Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus.
Sponsor: This material is based upon work supported by the National Science Foundation under grant SMA-1447886.
Pages/Duration: 1
URI/DOI: http://hdl.handle.net/10125/43573
Rights: Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported Attribution-NonCommercial-ShareAlike 3.0 United States
Appears in Collections:Presentations from the Linguistic Society of America symposium and poster session on Data Citation and Attribution in Linguistics, 5-9 January 2017, Austin TX



Items in ScholarSpace are protected by copyright, with all rights reserved, unless otherwise indicated.