Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/4533

From corpus to grammar: how DOBES corpora can be exploited for descriptive linguistics

File SizeFormat 
bouda.pdf908.57 kBAdobe PDFView/Open

Item Summary

Title: From corpus to grammar: how DOBES corpora can be exploited for descriptive linguistics
Authors: Bouda, Peter
Helmbrecht, Johannes
Issue Date: Oct 2012
Publisher: University of Hawai'i Press
Citation: Bouda, Peter and Johannes Helmbrecht. 2012. From corpus to grammar: how DOBES corpora can be exploited for descriptive linguistics. In Sebastian Nordoff (ed). 2012. Electronic Grammaticography. 129-159. Honolulu: University of Hawaii Press.
Series/Report no.: LD&C Special Publication 04
Abstract: The principles and techniques of language documentation developed during the last one and half decades and the sheer amount of corpora which have been compiled for endangered languages up to now will have an impact on grammar writing in particular with respect to the data base of grammars. On the other hand, advances in computer technology allow a closer link between corpus data which are the basis for generalizations and the grammatical description itself. The future the grammatical description of a language will not only present selected illustrative examples, but will also be linked to the entire set of corpus data that are the empirical basis for it. This makes generalizations transparent to the reader and open to falsification by the scientific community.

The article critically examines the relations between the DOBES corpus, the analysis and the grammatical description itself. Special attention will be laid on the particular the two fundamental perspectives of a semasiological and an onomasiological grammar, can be translated into the various kinds of search and concordancing routines to be executed in the corpus analysis. We present a typology of searches descriptive linguists need to apply. This typology defines requirements with regard to the functionality of specific software to be developed.

In the second part, the article presents a technical solution, a preliminary version of a database/concordancing software specifically designed to fulfill the functions and principles outlined in the preceding sections.
Sponsor: National Foreign Language Resource Center
URI/DOI: http://hdl.handle.net/10125/4533
ISBN: 978-0-9856211-1-7
Appears in Collections:LD&C Special Publication No. 4: Electronic Grammaticography



Items in ScholarSpace are protected by copyright, with all rights reserved, unless otherwise indicated.