Linguistic research increasingly relies on large electronic corpora for its primary data. While off-the-shelf programs can perform a set of standard searches, specialized questions usually require a custom-written program to find their answers. "Essential Python for Corpus Linguistics" uses the programming language Python to explain how to write simple programs that extract linguistically useful information, such as the frequency of a given utterance in a particular context within a corpus, or instances of certain phrasal structures in a Treebank. Assuming no prior programming background, the book provides numerous example programs that search for phonological, morphological and syntactic constructions in corpora, and the associated web site provides sample data and programs, which make it easy to start working independently. This book is a valuable resource for linguists who use corpus methods but have no programming training.
Sprache
Verlagsort
Verlagsgruppe
Zielgruppe
Für höhere Schule und Studium
Für Beruf und Forschung
ISBN-13
978-1-4051-4564-0 (9781405145640)
Copyright in bibliographic data is held by Nielsen Book Services Limited or its licensors: all rights reserved.
Schweitzer Klassifikation
I. Introduction II. Getting Started Downloading and Installing Python Checking that it Works Using Python as a Calculator String Lists Variables and Assignment Statements Counting Lines and Words Exercises III. More on Sequences Tuples Operations on Sequences List Operations String Operations IV. Tests and Control Equality comparison and membership tests The IF statement The FOR statement Example programs Exercises V. Dictionaries and Functions Dictionaries Defining Functions Example programs Exercises VI. Additional Python Topics Modules Custom Sorting List Comprehensions String formatting or interpolation File input and output Unicode Example Programs Exercises VII. Regular Expressions Applications of Regular Expressions Using regular expressions in Python Using regular expression objects Example: Reading CHAT files Homework VIII. Reading XML files Using the DOM to search a treebank Using the CHILDES XML data