
Statistical Methods for Imbalanced Data in Ecological and Biological Studies
Springer (Publisher)
Published on 15. July 2019
Book
Paperback/Softback
VIII, 59 pages
978-4-431-55569-8 (ISBN)
Description
This book presents a fresh, new approach in that it provides a comprehensive recent review of challenging problems caused by imbalanced data in prediction and classification, and also in that it introduces several of the latest statistical methods of dealing with these problems. The book discusses the property of the imbalance of data from two points of view. The first is quantitative imbalance, meaning that the sample size in one population highly outnumbers that in another population. It includes presence-only data as an extreme case, where the presence of a species is confirmed, whereas the information on its absence is uncertain, which is especially common in ecology in predicting habitat distribution. The second is qualitative imbalance, meaning that the data distribution of one population can be well specified whereas that of the other one shows a highly heterogeneous property. A typical case is the existence of outliers commonly observed in gene expression data, and another is heterogeneous characteristics often observed in a case group in case-control studies. The extension of the logistic regression model, maxent, and AdaBoost for imbalanced data is discussed, providing a new framework for improvement of prediction, classification, and performance of variable selection. Weights functions introduced in the methods play an important role in alleviating the imbalance of data. This book also furnishes a new perspective on these problem and shows some applications of the recently developed statistical methods to real data sets.
More details
Series
Edition
2019 ed.
Language
English
Place of publication
Tokyo
Japan
Target group
Professional and scholarly
Research
Illustrations
15 s/w Abbildungen, 7 farbige Abbildungen
VIII, 59 p. 22 illus., 7 illus. in color.
Dimensions
Height: 235 mm
Width: 155 mm
Thickness: 5 mm
Weight
119 gr
ISBN-13
978-4-431-55569-8 (9784431555698)
DOI
10.1007/978-4-431-55570-4
Schweitzer Classification
Other editions
Additional editions

Osamu Komori | Shinto Eguchi
Statistical Methods for Imbalanced Data in Ecological and Biological Studies
E-Book
07/2019
1st Edition
Springer
€53.49
Available for download
Persons
Osamu Komori, The Institute of Statistical Mathematics,
Shinto Eguchi, The Institute of Statistical Mathematics
Shinto Eguchi, The Institute of Statistical Mathematics
Content
1. Imbalance Data.- 2. Weighted Logistic Regression.- 3. Beta-Maxent.- 4. Generalized-t Statistic.- 5. Machine Learning Methods for Imbalance Data.