ScholarSpace ScholarSpace
 

ScholarSpace at University of Hawaii at Manoa >
Department of Linguistics >
Language Documentation >
Language Documentation & Conservation >
Language Documentation & Conservation (Journal) >
Volume 01 Issue 1 : Language Documentation & Conservation >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/1725


Title: Managing Fieldwork Data with Toolbox and the Natural Language Toolkit
Author(s): Robinson, Stuart
Aumann, Greg
Bird, Steven
Keywords: Toolbox
Natural Language Toolkit
NLTK
data management
Issue Date: 27-Jun-2007
Publisher: University of Hawai'i Press
Citation: Robinson, Stuart, Greg Aumann, and Steven Bird. 2007. Managing fieldwork data with Toolbox and the Natural Language Toolkit. Language Documentation & Conservation 1(1):44–57.
Abstract: This paper shows how fieldwork data can be managed using the program Toolbox together with the Natural Language Toolkit (NLTK) for the Python programming language. It provides background information about Toolbox and describes how it can be downloaded and installed. The basic functionality of the program for lexicons and texts is described, and its strengths and weaknesses are reviewed. Its underlying data format is briefly discussed, and Toolbox processing capabilities of NLTK are introduced, showing ways in which it can be used to extend the functionality of Toolbox. This is illustrated with a few simple scripts that demonstrate basic data management tasks relevant to language documentation, such as printing out the contents of a lexicon as HTML.
Sponsor(s): National Foreign Language Resource Center
URI: http://hdl.handle.net/10125/1725
ISSN: 1934-5275
Appears in Collections:Volume 01 Issue 1 : Language Documentation & Conservation

Files in This Item:

File Description SizeFormat
robinson.htmlOpen this file for a link to the PDF version38.26 kBHTMLView/Open


This item is protected by original copyright

Recommend this item
Statistics

This item is licensed under a Creative Commons License
Creative Commons

Items in ScholarSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2007 MIT and Hewlett-Packard - Feedback