A corpus linguistics perspective on language documentation, data, and challenge of small corpora

dc.contributor.author Lüdeling, Anke
dc.date.accessioned 2012-07-05T23:32:37Z
dc.date.available 2012-07-05T23:32:37Z
dc.date.issued 2012-08
dc.description.abstract This paper deals with issues of corpus design that might prove problematic for the study of under-resourced languages, e.g. in language documentation. It argues that it is not yet well understood which linguistic and extra-linguistic (predictor) variables cause linguistic variation (i.e. the response variable), which means that the scope of a linguistic finding cannot always be assessed. In order to deal with this problem, it is argued that we need a flexible corpus architecture with the option of adding meta-data to corpora/sub-corpora at any point in time.
dc.description.sponsorship National Foreign Language Resource Center
dc.identifier.citation Lüdeling, Anke. 2012. A corpus linguistics perspective on language documentation, data, and challenge of small corpora. In Frank Seifart, Geoffrey Haig, Nikolaus P. Himmelmann, Dagmar Jung, Anna Margetts, and Paul Trilsbeek (eds). 2012. Potentials of Language Documentation: Methods, Analyses, and Utilization. 39-45. Honolulu: University of Hawai'i Press.
dc.identifier.isbn 978-0-9856211-0-0
dc.identifier.uri http://hdl.handle.net/10125/4514
dc.publisher University of Hawai'i Press
dc.relation.ispartofseries LD&C Special Publication
dc.rights Creative Commons Attribution Non-Commercial Share Alike License
dc.title A corpus linguistics perspective on language documentation, data, and challenge of small corpora
prism.endingpage 45
prism.startingpage 39
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
05luedeling.pdf
Size:
74.46 KB
Format:
Adobe Portable Document Format
Description: