
Designing and Evaluating Language Corpora
A Practical Framework for Corpus Representativeness
Cambridge University Press
Published on 14. April 2022
Book
Hardback
300 pages
978-1-107-15138-3 (ISBN)
Description
Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' - highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation.
Reviews / Votes
'A valuable guide for corpora users and designers, a must-read before beginning the process of corpora selection and design.' Ana Abigahil Flores Hernandez and Pauline Moore, Tertium Linguistic JournalMore details
Language
English
Place of publication
Cambridge
United Kingdom
Dimensions
Height: 235 mm
Width: 157 mm
Thickness: 21 mm
Weight
588 gr
ISBN-13
978-1-107-15138-3 (9781107151383)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

Jesse Egbert
Designing and Evaluating Language Corpora
A Practical Framework for Corpus Representativeness
E-Book
04/2022
Cambridge University Press
€28.99
Available for download

Jesse Egbert | Douglas Biber | Bethany Gray
Designing and Evaluating Language Corpora
A Practical Framework for Corpus Representativeness
Book
04/2022
Cambridge University Press
€49.10
Shipment within 15-20 days

Jesse Egbert | Douglas Biber | Bethany Gray
Designing and Evaluating Language Corpora
A Practical Framework for Corpus Representativeness
E-Book
03/2022
Cambridge University Press
€28.99
Available for download
Persons
Jesse Egbert is Associate Professor of Applied Linguistics at Northern Arizona University. He is a co-founding General Editor of Register Studies, and his recent books focus on online register variation (2018), methodogical triangulation (2016, 2020), and corpus linguistics methods (2020).
Author
Northern Arizona University
Northern Arizona University
Iowa State University
Content
1. Introduction; 2. Approaches to representativeness in previous corpus linguistic research; 3. Corpus representativeness: a conceptual and methodological framework; 4. Domain considerations; 5. Distribution considerations; 6. The influence of domain and distribution considerations on corpus representativeness - bringing it all together; 7. Corpus design and representativeness in practice; Glossary; Appendix A. Example articles documenting existing corpora; Appendix B. Survey of corpus design and compilation practices.