A corpus linguistics perspective on language documentation, data, and challenge of small corpora

dc.contributor.authorLüdeling, Anke
dc.date.accessioned2012-07-05T23:32:37Z
dc.date.available2012-07-05T23:32:37Z
dc.date.issued2012-08
dc.description.abstractThis paper deals with issues of corpus design that might prove problematic for the study of under-resourced languages, e.g. in language documentation. It argues that it is not yet well understood which linguistic and extra-linguistic (predictor) variables cause linguistic variation (i.e. the response variable), which means that the scope of a linguistic finding cannot always be assessed. In order to deal with this problem, it is argued that we need a flexible corpus architecture with the option of adding meta-data to corpora/sub-corpora at any point in time.
dc.description.sponsorshipNational Foreign Language Resource Center
dc.identifier.citationLüdeling, Anke. 2012. A corpus linguistics perspective on language documentation, data, and challenge of small corpora. In Frank Seifart, Geoffrey Haig, Nikolaus P. Himmelmann, Dagmar Jung, Anna Margetts, and Paul Trilsbeek (eds). 2012. Potentials of Language Documentation: Methods, Analyses, and Utilization. 39-45. Honolulu: University of Hawai'i Press.
dc.identifier.isbn978-0-9856211-0-0
dc.identifier.urihttp://hdl.handle.net/10125/4514
dc.publisherUniversity of Hawai'i Press
dc.relation.ispartofseriesLD&C Special Publication
dc.rightsCreative Commons Attribution Non-Commercial Share Alike License
dc.titleA corpus linguistics perspective on language documentation, data, and challenge of small corpora
prism.endingpage45
prism.startingpage39

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
05luedeling.pdf
Size:
74.46 KB
Format:
Adobe Portable Document Format