Linguistic Databases explores the increasing use of databases in linguistics. The enormous potential in linguistic data - billions of utterances and messages daily - has been difficult to exploit. Many linguists have had to concentrate on introspective data with its inevitable blinders toward frequency, variation, and naturalness. Applications of linguistics have been handicapped. This volume explores the potential advantages of database applications to linguistics. Included in this volume are reports on database activities in phonetics, phonology, lexicography and syntax, comparative grammar, second-language acquisition, linguistic fieldwork, and language pathology. The book presents the specialized problems of multi-media (especially audio) and multi-lingual texts, including those in exotic writing systems. Implemented solutions are also discussed. The opportunities to use existing, minimally structured text repositories are presented.
1. Introduction John Nerbonne; 2. Test suites for natural language processing Stephen Oepen, Klaus Netter and Judith Klein; 3. From annotated corpora to databases: the SgmlQL Jacques Le Maitre, Elisabeth Murisasco, and Monique Rolbert; 4. Markup of a test suite with SGML Martin Volk; 5. An open systems approach for an acoustic-phonetic continuous speech database: the S. tools database-management systems (STDBMS) Werner A. Deutsch, Ralf Vollman, Anton Noll, and Sylvia Moosmüller; 6. The reading database of syllable structure Erik Fudge and Linda Shockey; 7. A database application for the generation of phonetic atlas maps Edgar Haimerl; 8. Swiss-French polyphone and polyvar: telephone speech databases to model inter- and intra-speaker variability Gerard Chollet, Jean-Luc Cochard, Andrei Constantinescu, Cedric Jaboulet, and Philippe Langlais; 9. Investigating argument structure: the Russian nominalization database Andrew Bredenkamp, Louisa Sadler, and Andrew Spencer; 10. The use of a psycholinguistic database in the simplification of text for aphasic readers Siobhan Devlin and John Tait; 11. The computer learner Corpus: a testbed for electronic EFL Tools Sylviane Granger; 12. Linking wordnet to a Corpus query system Oliver Christ; 13. Multilingual data processing in the Cellar environment Gary F. Simons and John V. Thomson.
Dewey Decimal Classfication (DDC)