Developing methods for reproducible research in linguistics: a first step

dc.contributor.author McDonnell, Bradley
dc.contributor.author Hall, Patrick
dc.date.accessioned 2017-01-13T00:22:28Z
dc.date.available 2017-01-13T00:22:28Z
dc.date.issued 2017-01-06
dc.description Poster: Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus.
dc.description.abstract Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus.
dc.description.sponsorship This material is based upon work supported by the National Science Foundation under grant SMA-1447886.
dc.format.extent 1
dc.identifier.uri http://hdl.handle.net/10125/43573
dc.language.iso en-US
dc.rights Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported Attribution-NonCommercial-ShareAlike 3.0 United States
dc.subject data citation
dc.subject attribution
dc.subject Linguistics
dc.title Developing methods for reproducible research in linguistics: a first step
dc.type Conference Paper
dc.type Presentation
dc.type.dcmi StillImage
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Poster_McDonnell_Hall.pdf
Size:
73.61 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: