Developing methods for reproducible research in linguistics: a first step

dc.contributor.author McDonnell, Bradley
dc.contributor.author Hall, Patrick
dc.date.accessioned 2017-01-13T00:22:28Z
dc.date.available 2017-01-13T00:22:28Z
dc.date.issued 2017-01-06
dc.description Poster: Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus. en_US
dc.description.abstract Reproducible research in other fields has developed various software tools that facilitate the publishing of code and results in a single document that are linked directly to the data. In mainstream linguistics, however, such software does not exist. The workflows for including linguistic examples in published work typically involve manual methods of copying and pasting text from a database into a word processing document. These manual methods are error-prone and time-consuming--often involving tedious tasks of aligning glosses in tables or with tabs. Furthermore, the examples in these documents are in no way linked to the corpus. This poster presents a first-attempt at developing a family of scripts called glossbox that link data, code, and analysis. At present, glossbox works with the typesetting software LaTeX, allowing users to semi-automatically import examples directly from the corpus. These examples require little to no manual manipulation and automatically produce citations to the corpus. en_US
dc.description.sponsorship This material is based upon work supported by the National Science Foundation under grant SMA-1447886. en_US
dc.format.extent 1 en_US
dc.identifier.uri http://hdl.handle.net/10125/43573
dc.language.iso en-US en_US
dc.rights Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported Attribution-NonCommercial-ShareAlike 3.0 United States en_US
dc.subject data citation en_US
dc.subject attribution en_US
dc.subject Linguistics en_US
dc.title Developing methods for reproducible research in linguistics: a first step en_US
dc.type Conference Paper en_US
dc.type Presentation en_US
dc.type.dcmi StillImage en_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Poster_McDonnell_Hall.pdf
Size:
73.61 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: