Linguistic Annotation in/for Corpus Linguistics

Date
2017
Authors
Gries, Stefan Th.
Berez, Andrea L.
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science
Volume
Number/Issue
Starting Page
Ending Page
Alternative Title
Abstract
This article surveys linguistic annotation in corpora and corpus linguistics. We first define the concept of ‘corpus’ as a radial category and then, in Sect.2, discuss a variety of kinds of information for which corpora are annotated and that are exploited in contemporary corpus linguistics. Section3 then exemplifies many current formats of annotation with an eye to highlighting both the diversity of formats currently available and the emergence of XML annotation as, for now, the most widespread form of annotation. Section4 summarizes and concludes with desiderata for future developments.
Description
This article surveys linguistic annotation in corpora and corpus linguistics. We first define the concept of ‘corpus’ as a radial category and then, in Sect.2, discuss a variety of kinds of information for which corpora are annotated and that are exploited in contemporary corpus linguistics. Section3 then exemplifies many current formats of annotation with an eye to highlighting both the diversity of formats currently available and the emergence of XML annotation as, for now, the most widespread form of annotation. Section4 summarizes and concludes with desiderata for future developments.
Keywords
Linguistics
Citation
Gries, Stefan Th. & Andrea L. Berez. 2017. Linguistic annotation in/for corpus linguistics. In Nancy Ide & James Pustejovsky (eds.), Handbook of Linguistic Annotation, 379-408. Dordrecht: Springer Science.
Extent
31 pages
Format
Geographic Location
Time Period
Related To
Rights
Rights Holder
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.