Corpus-Based Research into Language
In honour of Jan Aarts
Rodopi (Publisher)
Published on 1. January 1994
Book
Paperback/Softback
286 pages
978-90-5183-588-5 (ISBN)
Description
For over two decades Jan Aarts has been actively involved in corpus linguistic research. He was the instigator of a large number of projects, and he was responsible for what has become known as the Nijmegen approach to corpus linguistics. It is thanks to him that words like TOSCA and LDB have become household names in the corpus linguistic community.
The present volume has been collected in his honour. The contributions in it cover a wide range of topics in the field of corpus linguistic research, especially those in which Jan Aarts takes a keen interest: corpus encoding and tagging, parsing and databases, and the linguistic exploration of corpus data. The contributions in this volume discuss work done in this field outside Nijmegen, for the obvious reason that we do not wish to present him with a report on work in which he is himself involved.
The present volume has been collected in his honour. The contributions in it cover a wide range of topics in the field of corpus linguistic research, especially those in which Jan Aarts takes a keen interest: corpus encoding and tagging, parsing and databases, and the linguistic exploration of corpus data. The contributions in this volume discuss work done in this field outside Nijmegen, for the obvious reason that we do not wish to present him with a report on work in which he is himself involved.
More details
Series
Language
English
Place of publication
Leiden
Netherlands
Publishing group
Brill
Target group
Professional and scholarly
Dimensions
Height: 220 mm
Width: 150 mm
ISBN-13
978-90-5183-588-5 (9789051835885)
Copyright in bibliographic data is held by Nielsen Book Services Limited or its licensors: all rights reserved.
Schweitzer Classification
Content
Flor AARTS: A tribute to Jan Aarts. Nelleke OOSTDIJK and Pieter de HAAN: Introduction. PART I: THE ENCODING AND TAGGING OF CORPORA. Stig JOHANSSON: Continuity and change in the encoding of computer corpora. Sidney GREENBAUM and Ni YIBIN: Tagging the British ICE Corpus: English word classes. Geoffrey LEECH, Roger GARSIDE, and Michael BRYANT: The large-scale grammatical tagging of text: Experience with the British National Corpus. Willem MEIJS: Computerized lexicons and theoretical models. Louise GUTHRIE, Joe GUTHRIE, and Jim COWIE: Resolving lexical ambiguity. PART II: PARSING AND DATABASES. Ted BRISCOE: Prospects for practical parsing of unrestricted text: Robust statistical parsing techniques. Fred KARLSSON: Robust parsing of unconstrained text. Clive SOUTER and Eric ATWELL: Using parsed corpora: A review of current practice. Ezra BLACK: An experiment in customizing the Lancaster Treebank. Geoffrey SAMPSON: SUSANNE: A Domesday Book of English grammar. William GALE and Kenneth CHURCH: What is wrong with adding one? PART III: LINGUISTIC EXPLORATION OF THE DATA. Douglas BIBER and Edward FINEGAN: Intra-textual variation within medical research articles. Bengt ALTENBERG: On the functions of such in spoken and written English. Anna-Brita STENSTROEM and Jan SVARTVIK: Imparsable speech: Repeats and other nonfluencies in spoken English. References. List of contributors.