Item Description

Show full item record

Title: Managing Fieldwork Data with Toolbox and the Natural Language Toolkit 
Author: Robinson, Stuart; Aumann, Greg; Bird, Steven
Date: 2007-06-27
Publisher: University of Hawai'i Press
Citation: Robinson, Stuart, Greg Aumann, and Steven Bird. 2007. Managing fieldwork data with Toolbox and the Natural Language Toolkit. Language Documentation & Conservation 1(1):44–57.
Abstract: This paper shows how fieldwork data can be managed using the program Toolbox together with the Natural Language Toolkit (NLTK) for the Python programming language. It provides background information about Toolbox and describes how it can be downloaded and installed. The basic functionality of the program for lexicons and texts is described, and its strengths and weaknesses are reviewed. Its underlying data format is briefly discussed, and Toolbox processing capabilities of NLTK are introduced, showing ways in which it can be used to extend the functionality of Toolbox. This is illustrated with a few simple scripts that demonstrate basic data management tasks relevant to language documentation, such as printing out the contents of a lexicon as HTML.
Sponsorship: National Foreign Language Resource Center
ISSN: 1934-5275
URI: http://hdl.handle.net/10125/1725
Keywords: Toolbox, Natural Language Toolkit, NLTK, data management

Item File(s)

Description Files Size Format View
Open this file for a link to the PDF version robinson.html 38.25Kb HTML View/Open

This item appears in the following Collection(s)

Search


Advanced Search

Browse

My Account

Statistics

About