Researchers in a number of disciplines deal with large text sets requiring both text management and text analysis. Faced with a large amount of textual data collected in marketing surveys, literary investigations, historical archives and documentary data bases, these researchers require assistance with organizing, describing and comparing texts.
Exploring Textual Data demonstrates how exploratory multivariate statistical methods such as correspondence analysis and cluster analysis can be used to help investigate, assimilate and evaluate textual data. The main text does not contain any strictly mathematical demonstrations, making it accessible to a large audience. This book is very user-friendly with proofs abstracted in the appendices. Full definitions of concepts, implementations of procedures and rules for reading and interpreting results are fully explored. A succession of examples is intended to allow the reader to appreciate the variety of actual and potential applications and the complementary processing methods. A glossary of terms is provided.
Rezensionen / Stimmen
`... this book should becomne an essential reference for any researcher interested in the quantitative analysis of textual data. I know of no other book that covers the same range of topics with this amount of detail. ... researchers from any field who use quantitative techniques to process texts should find much here that will be of use.'
Computational Linguistics, 25:1 (1999)
Reihe
Auflage
Softcover reprint of the original 1st ed. 1998
Sprache
Verlagsort
Zielgruppe
Für Beruf und Forschung
Research
Illustrationen
Maße
Höhe: 297 mm
Breite: 210 mm
Dicke: 15 mm
Gewicht
ISBN-13
978-90-481-4942-1 (9789048149421)
DOI
10.1007/978-94-017-1525-6
Schweitzer Klassifikation
1 Textual Statistics: Scope and Applications.- 2 The Units of Textual Statistics.- 3 Correspondence Analysis of Lexical Tables.- 4 Cluster Analysis of Words and Texts.- 5 Visualization of Textual Data.- 6 Characteristic Textual Units, Modal Responses and Modal Texts.- 7 Longitudinal Partitions, Textual Time Series.- 8 Textual Discriminant Analysis.- Appendix 1: Singular value decomposition and correspondence analysis.- Appendix 2: Clustering techniques.- Appendix 3: More details about the nonparametric estimation model.- Appendix 4: Search for repeated segments in a corpus.- References.- Author Index.- Symbols.