Fundamental Statistical Inference

Name: Fundamental Statistical Inference | A Computational Approach
Brand: Wiley
Price: 109.99 EUR
Availability: OnlineOnly

A Computational Approach

Marc S. Paolella(Author)

Wiley (Publisher)

1st Edition

Published on 19. June 2018

584 pages

E-Book

ePUB with Adobe-DRM

System requirements

978-1-119-41788-0 (ISBN)

€109.99incl. 7% vat

System requirements

for ePUB with Adobe-DRM

E-Book Single Licence

Available for download

Description

Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.

Alles über E-Books, Kopierschutz & Dateiformate finden Sie in unserem Info- & Hilfebereich.

A hands-on approach to statistical inference that addresses the latest developments in this ever-growing field This clear and accessible book for beginning graduate students offers a practical and detailed approach to the field of statistical inference, providing complete derivations of results, discussions, and MATLAB programs for computation. It emphasizes details of the relevance of the material, intuition, and discussions with a view towards very modern statistical inference. In addition to classic subjects associated with mathematical statistics, topics include an intuitive presentation of the (single and double) bootstrap for confidence interval calculations, shrinkage estimation, tail (maximal moment) estimation, and a variety of methods of point estimation besides maximum likelihood, including use of characteristic functions, and indirect inference. Practical examples of all methods are given. Estimation issues associated with the discrete mixtures of normal distribution, and their solutions, are developed in detail. Much emphasis throughout is on non-Gaussian distributions, including details on working with the stable Paretian distribution and fast calculation of the noncentral Student's t. An entire chapter is dedicated to optimization, including development of Hessian-based methods, as well as heuristic/genetic algorithms that do not require continuity, with MATLAB codes provided. The book includes both theory and nontechnical discussions, along with a substantial reference to the literature, with an emphasis on alternative, more modern approaches. The recent literature on the misuse of hypothesis testing and p-values for model selection is discussed, and emphasis is given to alternative model selection methods, though hypothesis testing of distributional assumptions is covered in detail, notably for the normal distribution. Presented in three parts Essential Concepts in Statistics; Further Fundamental Concepts in Statistics; and Additional Topics Fundamental Statistical Inference: A Computational Approach offers comprehensive chapters on: Introducing Point and Interval Estimation; Goodness of Fit and Hypothesis Testing; Likelihood; Numerical Optimization; Methods of Point Estimation; Q-Q Plots and Distribution Testing; Unbiased Point Estimation and Bias Reduction; Analytic Interval Estimation; Inference in a Heavy-Tailed Context; The Method of Indirect Inference; and, as an appendix, A Review of Fundamental Concepts in Probability Theory, the latter to keep the book self-contained, and giving material on some advanced subjects such as saddlepoint approximations, expected shortfall in finance, calculation with the stable Paretian distribution, and convergence theorems and proofs.

More details

Other editions

Person

Content

Preface xi

PART I ESSENTIAL CONCEPTS IN STATISTICS

1 Introducing Point and Interval Estimation 3

1.1 Point Estimation / 4

1.1.1 Bernoulli Model / 4

1.1.2 Geometric Model / 6

1.1.3 Some Remarks on Bias and Consistency / 11

1.2 Interval Estimation via Simulation / 12

1.3 Interval Estimation via the Bootstrap / 18

1.3.1 Computation and Comparison with Parametric Bootstrap / 18

1.3.2 Application to Bernoulli Model and Modification / 20

1.3.3 Double Bootstrap / 24

1.3.4 Double Bootstrap with Analytic Inner Loop / 26

1.4 Bootstrap Confidence Intervals in the Geometric Model / 31

1.5 Problems / 35

2 Goodness of Fit and Hypothesis Testing 37

2.1 Empirical Cumulative Distribution Function / 38

2.1.1 The Glivenko-Cantelli Theorem / 38

2.1.2 Proofs of the Glivenko-Cantelli Theorem / 41

2.1.3 Example with Continuous Data and Approximate Confidence Intervals / 45

2.1.4 Example with Discrete Data and Approximate Confidence Intervals / 49

2.2 Comparing Parametric and Nonparametric Methods / 52

2.3 Kolmogorov-Smirnov Distance and Hypothesis Testing / 57

2.3.1 The Kolmogorov-Smirnov and Anderson-Darling Statistics / 57

2.3.2 Significance and Hypothesis Testing / 59

2.3.3 Small-Sample Correction / 63

2.4 Testing Normality with KD and AD / 65

2.5 Testing Normality with W² and U² / 68

2.6 Testing the Stable Paretian Distributional Assumption: First Attempt / 69

2.7 Two-Sample Kolmogorov Test / 73

2.8 More on (Moron?) Hypothesis Testing / 74

2.8.1 Explanation / 75

2.8.2 Misuse of Hypothesis Testing / 77

2.8.3 Use and Misuse of p-Values / 79

2.9 Problems / 82

3 Likelihood 85

3.1 Introduction / 85

3.1.1 Scalar Parameter Case / 87

3.1.2 Vector Parameter Case / 92

3.1.3 Robustness and the MCD Estimator / 100

3.1.4 Asymptotic Properties of the Maximum Likelihood Estimator / 102

3.2 Cramér-Rao Lower Bound / 107

3.2.1 Univariate Case / 108

3.2.2 Multivariate Case / 111

3.3 Model Selection / 114

3.3.1 Model Misspecification / 114

3.3.2 The Likelihood Ratio Statistic / 117

3.3.3 Use of Information Criteria / 119

3.4 Problems / 120

4 Numerical Optimization 123

4.1 Root Finding / 123

4.1.1 One Parameter / 124

4.1.2 Several Parameters / 131

4.2 Approximating the Distribution of the Maximum Likelihood Estimator / 135

4.3 General Numerical Likelihood Maximization / 136

4.3.1 Newton-Raphson and Quasi-Newton Methods / 137

4.3.2 Imposing Parameter Restrictions / 140

4.4 Evolutionary Algorithms / 145

4.4.1 Differential Evolution / 146

4.4.2 Covariance Matrix Adaption Evolutionary Strategy / 149

4.5 Problems / 155

5 Methods of Point Estimation 157

5.1 Univariate Mixed Normal Distribution / 157

5.1.1 Introduction / 157

5.1.2 Simulation of Univariate Mixtures / 160

5.1.3 Direct Likelihood Maximization / 161

5.1.4 Use of the EM Algorithm / 169

5.1.5 Shrinkage-Type Estimation / 174

5.1.6 Quasi-Bayesian Estimation / 176

5.1.7 Confidence Intervals / 178

5.2 Alternative Point Estimation Methodologies / 184

5.2.1 Method of Moments Estimator / 185

5.2.2 Use of Goodness-of-Fit Measures / 190

5.2.3 Quantile Least Squares / 191

5.2.4 Pearson Minimum Chi-Square / 193

5.2.5 Empirical Moment Generating Function Estimator / 195

5.2.6 Empirical Characteristic Function Estimator / 198

5.3 Comparison of Methods / 199

5.4 A Primer on Shrinkage Estimation / 200

5.5 Problems / 202

PART II FURTHER FUNDAMENTAL CONCEPTS IN STATISTICS

6 Q-Q Plots and Distribution Testing 209

6.1 P-P Plots and Q-Q Plots / 209

6.2 Null Bands / 211

6.2.1 Definition and Motivation / 211

6.2.2 Pointwise Null Bands via Simulation / 212

6.2.3 Asymptotic Approximation of Pointwise Null Bands / 213

6.2.4 Mapping Pointwise and Simultaneous Significance Levels / 215

6.3 Q-Q Test / 217

6.4 Further P-P and Q-Q Type Plots / 219

6.4.1 (Horizontal) Stabilized P-P Plots / 219

6.4.2 Modified S-P Plots / 220

6.4.3 MSP Test for Normality / 224

6.4.4 Modified Percentile (Fowlkes-MP) Plots / 228

6.5 Further Tests for Composite Normality / 231

6.5.1 Motivation / 232

6.5.2 Jarque-Bera Test / 234

6.5.3 Three Powerful (and More Recent) Normality Tests / 237

6.5.4 Testing Goodness of Fit via Binning: Pearson's X_P² Test / 240

6.6 Combining Tests and Power Envelopes / 247

6.6.1 Combining Tests / 248

6.6.2 Power Comparisons for Testing Composite Normality / 252

6.6.3 Most Powerful Tests and Power Envelopes / 252

6.7 Details of a Failed Attempt / 255

6.8 Problems / 260

7 Unbiased Point Estimation and Bias Reduction 269

7.1 Sufficiency / 269

7.1.1 Introduction / 269

7.1.2 Factorization / 272

7.1.3 Minimal Sufficiency / 276

7.1.4 The Rao-Blackwell Theorem / 283

7.2 Completeness and the Uniformly Minimum Variance Unbiased Estimator / 286

7.3 An Example with i.i.d. Geometric Data / 289

7.4 Methods of Bias Reduction / 293

7.4.1 The Bias-Function Approach / 293

7.4.2 Median-Unbiased Estimation / 296

7.4.3 Mode-Adjusted Estimator / 297

7.4.4 The Jackknife / 302

7.5 Problems / 305

8 Analytic Interval Estimation 313

8.1 Definitions / 313

8.2 Pivotal Method / 315

8.2.1 Exact Pivots / 315

8.2.2 Asymptotic Pivots / 318

8.3 Intervals Associated with Normal Samples / 319

8.3.1 Single Sample / 319

8.3.2 Paired Sample / 320

8.3.3 Two Independent Samples / 322

8.3.4 Welch's Method for ¿¿¿¿₁ - ¿¿¿¿₂ when ¿¿¿¿₁² ¿ ¿¿¿¿₂²/ 323

8.3.5 Satterthwaite's Approximation / 324

8.4 Cumulative Distribution Function Inversion / 326

8.4.1 Continuous Case / 326

8.4.2 Discrete Case / 330

8.5 Application of the Nonparametric Bootstrap / 334

8.6 Problems / 337

PART III ADDITIONAL TOPICS

9 Inference in a Heavy-Tailed Context 341

9.1 Estimating the Maximally Existing Moment / 342

9.2 A Primer on Tail Estimation / 346

9.2.1 Introduction / 346

9.2.2 The Hill Estimator / 346

9.2.3 Use with Stable Paretian Data / 349

9.3 Noncentral Student's t Estimation / 351

9.3.1 Introduction / 351

9.3.2 Direct Density Approximation / 352

9.3.3 Quantile-Based Table Lookup Estimation / 353

9.3.4 Comparison of NCT Estimators / 354

9.4 Asymmetric Stable Paretian Estimation / 358

9.4.1 Introduction / 358

9.4.2 The Hint Estimator / 359

9.4.3 Maximum Likelihood Estimation / 360

9.4.4 The McCulloch Estimator / 361

9.4.5 The Empirical Characteristic Function Estimator / 364

9.4.6 Testing for Symmetry in the Stable Model / 366

9.5 Testing the Stable Paretian Distribution / 368

9.5.1 Test Based on the Empirical Characteristic Function / 368

9.5.2 Summability Test and Modification / 371

9.5.3 ALHADI: The ¿¿¿¿-Hat Discrepancy Test / 375

9.5.4 Joint Test Procedure / 383

9.5.5 Likelihood Ratio Tests / 384

9.5.6 Size and Power of the Symmetric Stable Tests / 385

9.5.7 Extension to Testing the Asymmetric Stable Paretian Case / 395

10 The Method of Indirect Inference 401

10.1 Introduction / 401

10.2 Application to the Laplace Distribution / 403

10.3 Application to Randomized Response / 403

10.3.1 Introduction / 403

10.3.2 Estimation via Indirect Inference / 406

10.4 Application to the Stable Paretian Distribution / 409

10.5 Problems / 416

A Review of Fundamental Concepts in Probability Theory 419

A.1 Combinatorics and Special Functions / 420

A.2 Basic Probability and Conditioning / 423

A.3 Univariate Random Variables / 424

A.4 Multivariate Random Variables / 427

A.5 Continuous Univariate Random Variables / 430

A.6 Conditional Random Variables / 432

A.7 Generating Functions and Inversion Formulas / 434

A.8 Value at Risk and Expected Shortfall / 437

A.9 Jacobian Transformations / 451

A.10 Sums and Other Functions / 453

A.11 Saddlepoint Approximations / 456

A.12 Order Statistics / 460

A.13 The Multivariate Normal Distribution / 462

A.14 Noncentral Distributions / 465

A.15 Inequalities and Convergence / 467

A.15.1 Inequalities for Random Variables / 467

A.15.2 Convergence of Sequences of Sets / 469

A.15.3 Convergence of Sequences of Random Variables / 473

A.16 The Stable Paretian Distribution / 483

A.17 Problems / 492

A.18 Solutions / 509

References 537

Index 561

Preface

Young people today love luxury. They have bad manners, despise authority, have no respect for older people, and chatter when they should be working.

(Socrates, 470-399 BC)

This book on statistical inference can be viewed as a continuation of the author's previous two books on probability theory (Paolella, 2006, 2007), hereafter referred to as Books I and II. Of those two, Book I (or any book at a comparable level) is more relevant, in establishing the basics of random variables and distributions as required to understand statistical methodology. Occasional use of material from Book II is made, though most of that required material is reviewed in the appendix herein in order to keep this volume as self-contained as possible. References to those books will be abbreviated as I and II, respectively. For example, Figure 5.1 in (Chapter 5 of) Paolella (2006) is referred to as Figure I.5.1; and similarly for equation references, where (I.5.22) and (II.4.3) refer to equations (5.22) and (4.3) in Paolella (2006) and Paolella (2007) respectively (and both are the Cauchy-Schwarz inequality).

Further prerequisites are the same as those for Book I, namely a solid command of basic undergraduate calculus and matrix algebra, and occasionally very rudimentary concepts from complex analysis, as required for working with characteristic functions. As with Books I and II, a solutions manual to the exercises is available.

Certainly, no measure theory is required, nor any previous exposure to statistical inference, though it would be useful to have had an introductory course in statistics or data analysis. The book is aimed at beginning master's students in statistics, though it is written to be fully accessible to master's students in the social sciences. In particular, I have in mind students in economics and finance, as I provide introductory coverage of some nonstandard topics, notably Chapter 9 on heavy-tailed distributions and tail estimation, and detailed coverage of the mixed normal distribution in Chapter 5.

Naturally, the book can be also used for undergraduates in a mathematics program. For the intended audience of master's students in statistics or the social sciences, the instructor is welcome to skip material that uses concepts from convergence and limit theorems if the target audience is not ready for such mathematics. This is one of the points of this book: such material is included so that, for example, accessible, detailed proofs of the Glivenko-Cantelli theorem and the limiting distribution of the maximum likelihood estimator can be demonstrated at a reasonably rigorous level. The vast majority of the book only requires simple algebra and basic calculus.

In this book, I stick to the independent, identically distributed (i.i.d.) setting, using it as a platform for introducing the major concepts arising in statistics without the additional overhead and complexities associated with, say, (generalized) linear models, survival analysis, copula methods, and time series. This also allows for more in-depth coverage of important topics such as bootstrap techniques, nonparametric inference via the empirical c.d.f., numerical optimization, discrete mixture models, bias-adjusted estimators, tail estimation (as a nice segue into the study of extreme value theory), and the method of indirect inference. A future project, referred to as Book IV, builds on the framework in the present volume and is dedicated to the linear model (regression and ANOVA) and, primarily, time series analysis (univariate ARMAX models), GARCH, and multivariate distributions for modeling and predicting financial asset returns.

Before discussing the contents of this volume, it is important to mention that, similar to Books I and II, the overriding goals are:

to emphasize the practical side of matters by addressing computation issues;
to motivate students to actively engage in the material by replicating and extending reported results, and to read the literature on topics of their interest;
to go beyond the standard topics and examples traditionally taught at this level, albeit still within the i.i.d. framework; and
to set the stage for students intending to pursue further courses in statistical/econometric inference (and quantitative risk management), as well as those embarking on careers as modern data analysts and applied quantitative researchers.

Regarding point (i), I explain to students that computer programming skills are necessary, but far from sufficient, to be successful in applied research. In an occasional lecture dedicated to programming issues, I emphasize (not sarcastically - I do not test computer skills) that it is fully optional, and those students who are truly mathematically talented can skip it, explaining that they will always have programmers in their team (in industry) or PhD students and co-authors (in academics) as resources to do the computer grunt work implementing their theoretical constructs. Oddly, nobody leaves the room.

With respect to point (ii), the reader will notice that some chapters have few (or no) exercises (some have many). This is because I believe the nature of the material presented is such that it offers the student a judicious platform for self experimentation, particularly with respect to numerical implementation. Some of the material could have been packaged as exercises (and much is), though I prefer to illustrate important concepts, distributions, and methods in a detailed way, along with code and graphics, instead of banishing it to the exercises (or, far worse, littering the exercises with trite, useless algebraic manipulations devoid of genuine application) and instead encourage the student to replicate, complement, and extend the presented material. The reader will no doubt tire at my occasional explicit suggestions for this ("The reader is encouraged."). One of my role model authors is Hamilton (1994), whose book has no exercises, is twice the size of this book, and has been praised as an outstanding presentation of time series. Hamilton clearly intended to teach the material in a straightforward, clear way, with highly detailed and accessible derivations. I aspire to a similar approach, as well as adding numeric illustrations and Matlab code.1

Regarding point (iii), besides the obvious benefit of giving students a more modern viewpoint on methods and applications in statistics, having a large variety of such is useful for students (and instructors) looking for interesting, relevant topics for master's theses. An example of a nonstandard topic of interest is in Chapter 5, giving a detailed discussion on the problems associated with, and solutions to, estimating the (univariate) discrete mixed normal, via a variety of non-m.l.e. methods (empirical m.g.f., c.f., quantile-based methods, etc.), and the use of the EM algorithm with shrinkage, with its immediate extension to the multivariate case. For the latter, I refer to recent work of mine using the minimum covariance determinant (MCD) for parameter estimation, this also serving as an example of (i) what can be done when, here, the multivariate normal mixture is surely misspecified, and (ii) use of a most likely inconsistent estimator (which outperforms the m.l.e. in terms of density forecasting and portfolio allocation for financial returns data).

Particularly with the less common topics developed in Part III of this book, the result is, like Books I and II, a substantially larger project than some similarly positioned books. It is thus essential to understand that not everything in the text is supposed to be (or could be) covered in the classroom, at least not in one semester. In my opinion, students (even in mathematics departments, but particularly those in the social sciences) benefit from having clearly laid out explanations, detailed proofs, illustrative examples, a variety of approaches, introductions to modern techniques, and discussions of important, possibly controversial topics (e.g., the irrelevance of consistent estimators in light of the notion that, in realistic settings, the model is wrong anyway, and changing through time or space; and the arguable superfluousness, if not danger, of the typical hypothesis testing framework), as well as topics that could initially be skipped in a first course, but returned to later or assigned as outside reading, depending on the interests and abilities of the students.

I wish to emphasize that this book is for teaching, as (obviously) opposed to being a research monograph, or (less obviously) a dry regurgitation of traditional concepts and examples. An anonymous reviewer of Book I, when I initially submitted it to the publisher Wiley, remarked "it's too much material: It seems the author has written a brain dump." While I like to think I have much more in my head than what was written in that book, he (his gender was indeed disclosed to me) apparently believes that students (let alone instructors) are incapable of assessing what material is core, and what can be deemed "extra," or suitable for reading after the main concepts are mastered. It is trivial to just skip some material, whereas not having it at all results in an admittedly shorter book (who cares, besides arguably the publisher?) that accomplishes far less, and might even give the student a false sense of understanding and competence (which will be painfully revealed in a quant job interview). Fortunately, not everyone agrees with him: Besides heart-warming student feedback over the years on Book I (from...

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Fundamental Statistical Inference

Description

More details

Other editions

Additional editions

Person

Content

Preface

System requirements