M-statistics

Name: M-statistics | Optimal Statistical Inference for a Small Sample
Brand: Wiley
Price: 100.99 EUR
Availability: OnlineOnly

Optimal Statistical Inference for a Small Sample

Eugene Demidenko(Autor*in)

Wiley (Verlag)

1. Auflage

Erschienen am 1. August 2023

240 Seiten

E-Book

ePUB mit Adobe-DRM

Systemvoraussetzungen

978-1-119-89181-9 (ISBN)

100,99 €inkl. 7% MwSt.

Systemvoraussetzungen

für ePUB mit Adobe-DRM

E-Book Einzellizenz

Als Download verfügbar

Beschreibung

M-STATISTICS

A comprehensive resource providing new statistical methodologies and demonstrating how new approaches work for applications

M-statistics introduces a new approach to statistical inference, redesigning the fundamentals of statistics, and improving on the classical methods we already use. This book targets exact optimal statistical inference for a small sample under one methodological umbrella. Two competing approaches are offered: maximum concentration (MC) and mode (MO) statistics combined under one methodological umbrella, which is why the symbolic equation M=MC+MO. M-statistics defines an estimator as the limit point of the MC or MO exact optimal confidence interval when the confidence level approaches zero, the MC and MO estimator, respectively. Neither mean nor variance plays a role in M-statistics theory.

Novel statistical methodologies in the form of double-sided unbiased and short confidence intervals and tests apply to major statistical parameters:

* Exact statistical inference for small sample sizes is illustrated with effect size and coefficient of variation, the rate parameter of the Pareto distribution, two-sample statistical inference for normal variance, and the rate of exponential distributions.

* M-statistics is illustrated with discrete, binomial, and Poisson distributions. Novel estimators eliminate paradoxes with the classic unbiased estimators when the outcome is zero.

* Exact optimal statistical inference applies to correlation analysis including Pearson correlation, squared correlation coefficient, and coefficient of determination. New MC and MO estimators along with optimal statistical tests, accompanied by respective power functions, are developed.

* M-statistics is extended to the multidimensional parameter and illustrated with the simultaneous statistical inference for the mean and standard deviation, shape parameters of the beta distribution, the two-sample binomial distribution, and finally, nonlinear regression.

Our new developments are accompanied by respective algorithms and R codes, available at GitHub, and as such readily available for applications.

M-statistics is suitable for professionals and students alike. It is highly useful for theoretical statisticians and teachers, researchers, and data science analysts as an alternative to classical and approximate statistical inference.

Weitere Details

Weitere Ausgaben

Person

Inhalt

Cover
Title Page
Copyright
Contents
Preface
Chapter 1 Limitations of classic statistics and motivation
1.1 Limitations of classic statistics
1.1.1 Mean
1.1.2 Unbiasedness
1.1.3 Limitations of equal-tail statistical inference
1.2 The rationale for a new statistical theory
1.3 Motivating example: normal variance
1.3.1 Confidence interval for the normal variance
1.3.2 Hypothesis testing for the variance
1.3.3 MC and MO estimators of the variance
1.3.4 Sample size determination for variance
1.4 Neyman-Pearson lemma and its extensions
1.4.1 Introduction
1.4.2 Two lemmas
References
Chapter 2 Maximum concentration statistics
2.1 Assumptions
2.2 Short confidence interval and MC estimator
2.3 Density level test
2.4 Efficiency and the sufficient statistic
2.5 Parameter is positive or belongs to a finite interval
2.5.1 Parameter is positive
2.5.2 Parameter belongs to a finite interval
References
Chapter 3 Mode statistics
3.1 Unbiased test
3.2 Unbiased CI and MO estimator
3.3 Cumulative information and the sufficient statistic
References
Chapter 4 P-value and duality
4.1 P-value for the double-sided hypothesis
4.1.1 General definition
4.1.2 P-value for normal variance
4.2 The overall powerful test
4.3 Duality: converting the CI into a hypothesis test
4.4 Bypassing assumptions
4.5 Overview
References
Chapter 5 M-statistics for major statistical parameters
5.1 Exact statistical inference for standard deviation
5.1.1 MC-statistics
5.1.2 MC-statistics on the log scale
5.1.3 MO-statistics
5.1.4 Computation of the p-value
5.2 Pareto distribution
5.2.1 Confidence intervals
5.2.2 Hypothesis testing
5.3 Coefficient of variation for lognormal distribution
5.4 Statistical testing for two variances
5.4.1 Computation of the p-value
5.4.2 Optimal sample size
5.5 Inference for two-sample exponential distribution
5.5.1 Unbiased statistical test
5.5.2 Confidence intervals
5.5.3 The MC estimator of ?
5.6 Effect size and coefficient of variation
5.6.1 Effect size
5.6.2 Coefficient of variation
5.6.3 Double-sided hypothesis tests
5.6.4 Multivariate ES
5.7 Binomial probability
5.7.1 The MCL estimator
5.7.2 The MCL2 estimator
5.7.3 The MCL2 estimator of pn
5.7.4 Confidence interval on the double-log scale
5.7.5 Equal-tail and unbiased tests
5.8 Poisson rate
5.8.1 Two-sided short CI on the log scale
5.8.2 Two-sided tests and p-value
5.8.3 The MCL estimator of the rate parameter
5.9 Meta-analysis model
5.9.1 CI and MCL estimator
5.10 M-statistics for the correlation coefficient
5.10.1 MC and MO estimators
5.10.2 Equal-tail and unbiased tests
5.10.3 Power function and p-value
5.10.4 Confidence intervals
5.11 The square multiple correlation coefficient
5.11.1 Unbiased statistical test
5.11.2 Computation of p-value
5.11.3 Confidence intervals
5.11.4 The two-sided CI on the log scale
5.11.5 The MCL estimator
5.12 Coefficient of determination for linear model
5.12.1 CoD and multiple correlation coefficient
5.12.2 Unbiased test
5.12.3 The MCL estimator for CoD
References
Chapter 6 Multidimensional parameter
6.1 Density level test
6.2 Unbiased test
6.3 Confidence region dual to the DL test
6.4 Unbiased confidence region
6.5 Simultaneous inference for normal mean and standard deviation
6.5.1 Statistical test
6.5.2 Confidence region
6.6 Exact confidence inference for parameters of the beta distribution
6.6.1 Statistical tests
6.6.2 Confidence regions
6.7 Two-sample binomial probability
6.7.1 Hypothesis testing
6.7.2 Confidence region
6.8 Exact and profile statistical inference for nonlinear regression
6.8.1 Statistical inference for the whole parameter
6.8.2 Statistical inference for an individual parameter of interest via profiling
References
Index
EULA

Chapter 1
Limitations of classic statistics and motivation

In this chapter, we discuss the limitations of classic statistics that build on the concepts of the mean and variance. We argue that the mean and variance are appropriate measures of the center and the scatter of symmetric distributions. Many distributions we deal with are asymmetric, including distributions of positive data. The mean not only has a weak practical appeal but also may create theoretical trouble in the form of unbiased estimation - the existence of an unbiased estimator is more an exception than the rule.

Optimal statistical inference for normal variance in the form of minimum length or unbiased CI was developed more than 50 years ago and has been forgotten. This example serves as a motivation for our theory. Many central concepts, such as unbiased tests, mode, and maximum concentration estimators for normal variance serve as prototypes for the general theory to be deployed in subsequent chapters.

The Neyman-Pearson lemma is a fundamental statistical result that proves maximum power among all tests with fixed type I error. In this chapter, we prove two results, as an extension of this lemma, to be later used for demonstrating some optimal properties of M-statistics such as the superiority of the sufficient statistic and minimum volume of the density level test.

1.1 Limitations of classic statistics

1.1.1 Mean

A long time ago, several prominent statisticians pointed out to limitations of the mean as a measure of central tendency or short center (Deming 1964; Tukey 1977). Starting from introductory statistics textbooks the mean is often criticized because it is not robust to outliers. We argue that the mean's limitations are conceptually serious compared to other centers, the median and the mode.

For example, when characterizing the distribution of English letters the mean is not applicable, but the mode is "e.". Occasionally, statistics textbooks discuss the difference between mean, mode, and median from the application standpoint. For example, consider the distribution of house prices on the real estate market in a particular town. For a town clerk, the most appropriate measure of the center is the mean because the total property taxes received by the town are proportional to the sum of house values and therefore the mean. For a potential buyer, who compares prices between small nearby towns, the most appropriate center is the mode as the typical house price. A person who can afford a house at the median price knows that they can afford 50% of the houses on the market.

Remarkably, modern statistical packages, like R, compute the mean and median as mean(Y) and median(Y), but not the mode, although it requires just two lines of code

where Y is the array of data. The centerpiece of the mode computation is the density function, which by default assumes the Gaussian kernel and the bandwidth computed by Silverman's "rule of thumb" (1986).

Consider another example of reporting the summary statistics for U.S. hourly wages (the data are obtained from the Bureau of Labor Statistics at https://www.bls.gov/mwe). Figure 1.1 depicts the Gaussian kernel density of hourly wages for 234,986 employees. The mean is almost twice as large as the mode because of the heavy right tail. What center should be used when reporting the average wage? The answer depends on how the center is used. The mean may be informative to the U.S. government because the sum of wages is proportional to consumer buying power and collected income taxes. The median has a clear interpretation: 50% of workers earn less than $17.10 per hour. The mode offers a better interpretation of the individual level as the typical wage - the point of maximum concentration of wages. In parentheses, we report the proportion of workers who earn $1 within each center. The mode has maximum data concentration probability - that is why we call the mode typical value. The mean ($20.40) may be happily reported by the government, but $11.50 is what people typically earn.

Figure 1.1: The Gaussian kernel density for a sample of 234,986 hourly wages in the country. The percent value in the parentheses estimates the probability that the wage is within $1 of the respective center.

Mean is a convenient quantity for computers, but humans never count and sum - they judge and compare samples based on the typical value.

Figure 1.2 illustrates this statement. It depicts a NASA comet image downloaded from https://solarviews.com/cap/comet/cometneat.htm. The bull's-eye of the comet is the mode where the concentration of masses is maximum. Mean does not have a particular interpretation.

Mean is for computers, and mode is for people. People immediately identify the mode as the maximum concentration of the distribution, but we never sum the data in our head and divide it by the number of points - this is what computers do. This picture points out the heart of this book: the mean is easy to compute because it requires arithmetic operations suitable to computers. The mode requires more sophisticated techniques such as density estimation - unavailable at the time when statistics was born. Estimation of the mode is absent even in comprehensive modern statistics books. The time has come to reconsider and rewrite statistical theory.

Figure 1.2: Image of comet C/2001 Q4 (NEAT) taken at the WIYN 0.9-meter telescope at Kitt Peak National Observatory near Tucson, Arizona, on May 7, 2004. Source: NASA.

1.1.2 Unbiasedness

The mean dominates not only statistical applications but also statistical theory in the form of an unbiased estimator. Finding a new unbiased estimator is regarded as one of the most rewarding works of a statistician. However, unbiasedness has serious limitations:

The existence of unbiased estimator is an exception, not the rule. Chen (2003) writes ". the condition on unbiasedness is generally a strong one." Unbiased estimators mostly exist in the framework of linear statistical models and yet classic statistical inference like the Cramér-Rao lower bound or Lehmann-Sheffe theorem relies on unbiasedness (Rao 1973; Zacks 1971; Casella and Berger 1990; Cox and Hinkley 1996; Lehmann and Casella 1998; Bickel and Docksom 2001; Lehmann and Romano 2005). Unbiased estimators do not exist for simple nonlinear quantities such as the coefficient of variation or the ratio of regression coefficients (Fieller 1932).
Unbiasedness is not invariant to nonlinear transformation. For example, the square root of the sample variance is a positively biased estimator of the standard deviation. If an unbiased estimator exists for a parameter, it rarely exists for its reciprocal.
An unbiased estimator of a positive parameter may take a negative value, especially with small degrees of freedom. The most notorious examples are the unbiased estimation of variance components and the squared correlation coefficient. For example, as shown by Ghosh (1996), the unbiased nonnegative estimator of the variance component does not exist.
Variance and mean square error (MSE), as established criteria of statistical efficiency, suffer from the same illness: they are appropriate measures of scattering for symmetric distributions and may not exist even in simple statistical problems.

Note that while we criticize the unbiased estimators, there is nothing wrong with unbiased statistical tests and CIs - although the same term unbiasedness is used, these concepts are not related. Our theory embraces unbiased tests and CIs and derives the mode (MO) and maximum concentration (MC) estimator as the limit point of the unbiased and minimum length CI, respectively, when the coverage probability approaches zero.

1.1.3 Limitations of equal-tail statistical inference

Classic statistics uses the equal-tail approach for statistical hypothesis testing and CIs. This approach works for symmetric distributions or large sample sizes. It was convenient in the pre-computer era when tables at the end of statistics textbooks were used. The unequal approach, embraced in the present work, requires computer algorithms and implies optimal statistical inference for any sample size. Why use a suboptimal equal-tail approach when a better one exists? True, for a fairly large sample size, the difference is negligible but when the number of observations is small say, from 5 to 10 we may gain up to 20% improvement measured as the length of the CI or the power of the test.

1.2 The rationale for a new statistical theory

The classic statistical inference was developed almost 100 years ago. It tends to offer simple procedures, often relying on the precomputed table of distributions printed at the end of books. This explains why until now equal-tail tests and CIs have been widely used even though for asymmetric distributions the respective inference is suboptimal. Certainly, for a moderate sample size, the difference is usually negligible but when the sample size is small the difference can be considerable. Classic equal-tail statistical inference is outdated. Yes, unequal-tail inferences do not have a closed-form solution, but this should not serve as an excuse for using suboptimal inference....

Systemvoraussetzungen

Als PDF speichern Als Link merken

M-statistics

Beschreibung

Weitere Details

Weitere Ausgaben

Person

Inhalt

Chapter 1 Limitations of classic statistics and motivation

1.1 Limitations of classic statistics

1.1.1 Mean

1.1.2 Unbiasedness

1.1.3 Limitations of equal-tail statistical inference

1.2 The rationale for a new statistical theory

Systemvoraussetzungen

Chapter 1
Limitations of classic statistics and motivation