
Developing Linguistic Corpora
A Guide to Good Practice
Martin Wynne(Editor)
Oxbow Books (Publisher)
Published on 16. September 2005
Book
Paperback/Softback
96 pages
978-1-84217-205-6 (ISBN)
Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of
corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
More details
Series
Language
English
Place of publication
Oxford
United Kingdom
Target group
College/higher education
Professional and scholarly
Dimensions
Height: 242 mm
Width: 170 mm
ISBN-13
978-1-84217-205-6 (9781842172056)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Person
edited by Martin Wynne
Content
Preface; 1. Corpus and Text: basic principles (John Sinclair); 2. Adding Linguistic Annotation (Geoffrey Leech); 3. Metadata for Corpus Work (Lou Burnard); 4. Character encoding in corpus construction (Anthony McEnery and ZhonghuaXiao); 5. Spoken Language Corpora (Paul Thompson); 6. Multilingual Corpora (Pernilla Danielsson and Wolfgang Teubert); 7. Archiving and Preservation issues (Martin Wynne); Bibliography;Index.