
Fundamental Statistical Inference
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
More details
Other editions
Additional editions


Person
Marc S. Paolella, PhD, is a Professor at the Department of Banking and Finance, University of Zurich. He is also the Editor of Econometrics and an Associate Editor of the Royal Statistical Society Journal Series A.
Content
Preface xi
PART I ESSENTIAL CONCEPTS IN STATISTICS
1 Introducing Point and Interval Estimation 3
1.1 Point Estimation / 4
1.1.1 Bernoulli Model / 4
1.1.2 Geometric Model / 6
1.1.3 Some Remarks on Bias and Consistency / 11
1.2 Interval Estimation via Simulation / 12
1.3 Interval Estimation via the Bootstrap / 18
1.3.1 Computation and Comparison with Parametric Bootstrap / 18
1.3.2 Application to Bernoulli Model and Modification / 20
1.3.3 Double Bootstrap / 24
1.3.4 Double Bootstrap with Analytic Inner Loop / 26
1.4 Bootstrap Confidence Intervals in the Geometric Model / 31
1.5 Problems / 35
2 Goodness of Fit and Hypothesis Testing 37
2.1 Empirical Cumulative Distribution Function / 38
2.1.1 The Glivenko-Cantelli Theorem / 38
2.1.2 Proofs of the Glivenko-Cantelli Theorem / 41
2.1.3 Example with Continuous Data and Approximate Confidence Intervals / 45
2.1.4 Example with Discrete Data and Approximate Confidence Intervals / 49
2.2 Comparing Parametric and Nonparametric Methods / 52
2.3 Kolmogorov-Smirnov Distance and Hypothesis Testing / 57
2.3.1 The Kolmogorov-Smirnov and Anderson-Darling Statistics / 57
2.3.2 Significance and Hypothesis Testing / 59
2.3.3 Small-Sample Correction / 63
2.4 Testing Normality with KD and AD / 65
2.5 Testing Normality with W2 and U2 / 68
2.6 Testing the Stable Paretian Distributional Assumption: First Attempt / 69
2.7 Two-Sample Kolmogorov Test / 73
2.8 More on (Moron?) Hypothesis Testing / 74
2.8.1 Explanation / 75
2.8.2 Misuse of Hypothesis Testing / 77
2.8.3 Use and Misuse of p-Values / 79
2.9 Problems / 82
3 Likelihood 85
3.1 Introduction / 85
3.1.1 Scalar Parameter Case / 87
3.1.2 Vector Parameter Case / 92
3.1.3 Robustness and the MCD Estimator / 100
3.1.4 Asymptotic Properties of the Maximum Likelihood Estimator / 102
3.2 Cramér-Rao Lower Bound / 107
3.2.1 Univariate Case / 108
3.2.2 Multivariate Case / 111
3.3 Model Selection / 114
3.3.1 Model Misspecification / 114
3.3.2 The Likelihood Ratio Statistic / 117
3.3.3 Use of Information Criteria / 119
3.4 Problems / 120
4 Numerical Optimization 123
4.1 Root Finding / 123
4.1.1 One Parameter / 124
4.1.2 Several Parameters / 131
4.2 Approximating the Distribution of the Maximum Likelihood Estimator / 135
4.3 General Numerical Likelihood Maximization / 136
4.3.1 Newton-Raphson and Quasi-Newton Methods / 137
4.3.2 Imposing Parameter Restrictions / 140
4.4 Evolutionary Algorithms / 145
4.4.1 Differential Evolution / 146
4.4.2 Covariance Matrix Adaption Evolutionary Strategy / 149
4.5 Problems / 155
5 Methods of Point Estimation 157
5.1 Univariate Mixed Normal Distribution / 157
5.1.1 Introduction / 157
5.1.2 Simulation of Univariate Mixtures / 160
5.1.3 Direct Likelihood Maximization / 161
5.1.4 Use of the EM Algorithm / 169
5.1.5 Shrinkage-Type Estimation / 174
5.1.6 Quasi-Bayesian Estimation / 176
5.1.7 Confidence Intervals / 178
5.2 Alternative Point Estimation Methodologies / 184
5.2.1 Method of Moments Estimator / 185
5.2.2 Use of Goodness-of-Fit Measures / 190
5.2.3 Quantile Least Squares / 191
5.2.4 Pearson Minimum Chi-Square / 193
5.2.5 Empirical Moment Generating Function Estimator / 195
5.2.6 Empirical Characteristic Function Estimator / 198
5.3 Comparison of Methods / 199
5.4 A Primer on Shrinkage Estimation / 200
5.5 Problems / 202
PART II FURTHER FUNDAMENTAL CONCEPTS IN STATISTICS
6 Q-Q Plots and Distribution Testing 209
6.1 P-P Plots and Q-Q Plots / 209
6.2 Null Bands / 211
6.2.1 Definition and Motivation / 211
6.2.2 Pointwise Null Bands via Simulation / 212
6.2.3 Asymptotic Approximation of Pointwise Null Bands / 213
6.2.4 Mapping Pointwise and Simultaneous Significance Levels / 215
6.3 Q-Q Test / 217
6.4 Further P-P and Q-Q Type Plots / 219
6.4.1 (Horizontal) Stabilized P-P Plots / 219
6.4.2 Modified S-P Plots / 220
6.4.3 MSP Test for Normality / 224
6.4.4 Modified Percentile (Fowlkes-MP) Plots / 228
6.5 Further Tests for Composite Normality / 231
6.5.1 Motivation / 232
6.5.2 Jarque-Bera Test / 234
6.5.3 Three Powerful (and More Recent) Normality Tests / 237
6.5.4 Testing Goodness of Fit via Binning: Pearson's X P2 Test / 240
6.6 Combining Tests and Power Envelopes / 247
6.6.1 Combining Tests / 248
6.6.2 Power Comparisons for Testing Composite Normality / 252
6.6.3 Most Powerful Tests and Power Envelopes / 252
6.7 Details of a Failed Attempt / 255
6.8 Problems / 260
7 Unbiased Point Estimation and Bias Reduction 269
7.1 Sufficiency / 269
7.1.1 Introduction / 269
7.1.2 Factorization / 272
7.1.3 Minimal Sufficiency / 276
7.1.4 The Rao-Blackwell Theorem / 283
7.2 Completeness and the Uniformly Minimum Variance Unbiased Estimator / 286
7.3 An Example with i.i.d. Geometric Data / 289
7.4 Methods of Bias Reduction / 293
7.4.1 The Bias-Function Approach / 293
7.4.2 Median-Unbiased Estimation / 296
7.4.3 Mode-Adjusted Estimator / 297
7.4.4 The Jackknife / 302
7.5 Problems / 305
8 Analytic Interval Estimation 313
8.1 Definitions / 313
8.2 Pivotal Method / 315
8.2.1 Exact Pivots / 315
8.2.2 Asymptotic Pivots / 318
8.3 Intervals Associated with Normal Samples / 319
8.3.1 Single Sample / 319
8.3.2 Paired Sample / 320
8.3.3 Two Independent Samples / 322
8.3.4 Welch's Method for ¿¿¿¿1 - ¿¿¿¿2 when ¿¿¿¿12 ¿ ¿¿¿¿22 / 323
8.3.5 Satterthwaite's Approximation / 324
8.4 Cumulative Distribution Function Inversion / 326
8.4.1 Continuous Case / 326
8.4.2 Discrete Case / 330
8.5 Application of the Nonparametric Bootstrap / 334
8.6 Problems / 337
PART III ADDITIONAL TOPICS
9 Inference in a Heavy-Tailed Context 341
9.1 Estimating the Maximally Existing Moment / 342
9.2 A Primer on Tail Estimation / 346
9.2.1 Introduction / 346
9.2.2 The Hill Estimator / 346
9.2.3 Use with Stable Paretian Data / 349
9.3 Noncentral Student's t Estimation / 351
9.3.1 Introduction / 351
9.3.2 Direct Density Approximation / 352
9.3.3 Quantile-Based Table Lookup Estimation / 353
9.3.4 Comparison of NCT Estimators / 354
9.4 Asymmetric Stable Paretian Estimation / 358
9.4.1 Introduction / 358
9.4.2 The Hint Estimator / 359
9.4.3 Maximum Likelihood Estimation / 360
9.4.4 The McCulloch Estimator / 361
9.4.5 The Empirical Characteristic Function Estimator / 364
9.4.6 Testing for Symmetry in the Stable Model / 366
9.5 Testing the Stable Paretian Distribution / 368
9.5.1 Test Based on the Empirical Characteristic Function / 368
9.5.2 Summability Test and Modification / 371
9.5.3 ALHADI: The ¿¿¿¿-Hat Discrepancy Test / 375
9.5.4 Joint Test Procedure / 383
9.5.5 Likelihood Ratio Tests / 384
9.5.6 Size and Power of the Symmetric Stable Tests / 385
9.5.7 Extension to Testing the Asymmetric Stable Paretian Case / 395
10 The Method of Indirect Inference 401
10.1 Introduction / 401
10.2 Application to the Laplace Distribution / 403
10.3 Application to Randomized Response / 403
10.3.1 Introduction / 403
10.3.2 Estimation via Indirect Inference / 406
10.4 Application to the Stable Paretian Distribution / 409
10.5 Problems / 416
A Review of Fundamental Concepts in Probability Theory 419
A.1 Combinatorics and Special Functions / 420
A.2 Basic Probability and Conditioning / 423
A.3 Univariate Random Variables / 424
A.4 Multivariate Random Variables / 427
A.5 Continuous Univariate Random Variables / 430
A.6 Conditional Random Variables / 432
A.7 Generating Functions and Inversion Formulas / 434
A.8 Value at Risk and Expected Shortfall / 437
A.9 Jacobian Transformations / 451
A.10 Sums and Other Functions / 453
A.11 Saddlepoint Approximations / 456
A.12 Order Statistics / 460
A.13 The Multivariate Normal Distribution / 462
A.14 Noncentral Distributions / 465
A.15 Inequalities and Convergence / 467
A.15.1 Inequalities for Random Variables / 467
A.15.2 Convergence of Sequences of Sets / 469
A.15.3 Convergence of Sequences of Random Variables / 473
A.16 The Stable Paretian Distribution / 483
A.17 Problems / 492
A.18 Solutions / 509
References 537
Index 561
Preface
Young people today love luxury. They have bad manners, despise authority, have no respect for older people, and chatter when they should be working.
(Socrates, 470-399 BC)
This book on statistical inference can be viewed as a continuation of the author's previous two books on probability theory (Paolella, 2006, 2007), hereafter referred to as Books I and II. Of those two, Book I (or any book at a comparable level) is more relevant, in establishing the basics of random variables and distributions as required to understand statistical methodology. Occasional use of material from Book II is made, though most of that required material is reviewed in the appendix herein in order to keep this volume as self-contained as possible. References to those books will be abbreviated as I and II, respectively. For example, Figure 5.1 in (Chapter 5 of) Paolella (2006) is referred to as Figure I.5.1; and similarly for equation references, where (I.5.22) and (II.4.3) refer to equations (5.22) and (4.3) in Paolella (2006) and Paolella (2007) respectively (and both are the Cauchy-Schwarz inequality).
Further prerequisites are the same as those for Book I, namely a solid command of basic undergraduate calculus and matrix algebra, and occasionally very rudimentary concepts from complex analysis, as required for working with characteristic functions. As with Books I and II, a solutions manual to the exercises is available.
Certainly, no measure theory is required, nor any previous exposure to statistical inference, though it would be useful to have had an introductory course in statistics or data analysis. The book is aimed at beginning master's students in statistics, though it is written to be fully accessible to master's students in the social sciences. In particular, I have in mind students in economics and finance, as I provide introductory coverage of some nonstandard topics, notably Chapter 9 on heavy-tailed distributions and tail estimation, and detailed coverage of the mixed normal distribution in Chapter 5.
Naturally, the book can be also used for undergraduates in a mathematics program. For the intended audience of master's students in statistics or the social sciences, the instructor is welcome to skip material that uses concepts from convergence and limit theorems if the target audience is not ready for such mathematics. This is one of the points of this book: such material is included so that, for example, accessible, detailed proofs of the Glivenko-Cantelli theorem and the limiting distribution of the maximum likelihood estimator can be demonstrated at a reasonably rigorous level. The vast majority of the book only requires simple algebra and basic calculus.
In this book, I stick to the independent, identically distributed (i.i.d.) setting, using it as a platform for introducing the major concepts arising in statistics without the additional overhead and complexities associated with, say, (generalized) linear models, survival analysis, copula methods, and time series. This also allows for more in-depth coverage of important topics such as bootstrap techniques, nonparametric inference via the empirical c.d.f., numerical optimization, discrete mixture models, bias-adjusted estimators, tail estimation (as a nice segue into the study of extreme value theory), and the method of indirect inference. A future project, referred to as Book IV, builds on the framework in the present volume and is dedicated to the linear model (regression and ANOVA) and, primarily, time series analysis (univariate ARMAX models), GARCH, and multivariate distributions for modeling and predicting financial asset returns.
Before discussing the contents of this volume, it is important to mention that, similar to Books I and II, the overriding goals are:
- to emphasize the practical side of matters by addressing computation issues;
- to motivate students to actively engage in the material by replicating and extending reported results, and to read the literature on topics of their interest;
- to go beyond the standard topics and examples traditionally taught at this level, albeit still within the i.i.d. framework; and
- to set the stage for students intending to pursue further courses in statistical/econometric inference (and quantitative risk management), as well as those embarking on careers as modern data analysts and applied quantitative researchers.
Regarding point (i), I explain to students that computer programming skills are necessary, but far from sufficient, to be successful in applied research. In an occasional lecture dedicated to programming issues, I emphasize (not sarcastically - I do not test computer skills) that it is fully optional, and those students who are truly mathematically talented can skip it, explaining that they will always have programmers in their team (in industry) or PhD students and co-authors (in academics) as resources to do the computer grunt work implementing their theoretical constructs. Oddly, nobody leaves the room.
With respect to point (ii), the reader will notice that some chapters have few (or no) exercises (some have many). This is because I believe the nature of the material presented is such that it offers the student a judicious platform for self experimentation, particularly with respect to numerical implementation. Some of the material could have been packaged as exercises (and much is), though I prefer to illustrate important concepts, distributions, and methods in a detailed way, along with code and graphics, instead of banishing it to the exercises (or, far worse, littering the exercises with trite, useless algebraic manipulations devoid of genuine application) and instead encourage the student to replicate, complement, and extend the presented material. The reader will no doubt tire at my occasional explicit suggestions for this ("The reader is encouraged."). One of my role model authors is Hamilton (1994), whose book has no exercises, is twice the size of this book, and has been praised as an outstanding presentation of time series. Hamilton clearly intended to teach the material in a straightforward, clear way, with highly detailed and accessible derivations. I aspire to a similar approach, as well as adding numeric illustrations and Matlab code.1
Regarding point (iii), besides the obvious benefit of giving students a more modern viewpoint on methods and applications in statistics, having a large variety of such is useful for students (and instructors) looking for interesting, relevant topics for master's theses. An example of a nonstandard topic of interest is in Chapter 5, giving a detailed discussion on the problems associated with, and solutions to, estimating the (univariate) discrete mixed normal, via a variety of non-m.l.e. methods (empirical m.g.f., c.f., quantile-based methods, etc.), and the use of the EM algorithm with shrinkage, with its immediate extension to the multivariate case. For the latter, I refer to recent work of mine using the minimum covariance determinant (MCD) for parameter estimation, this also serving as an example of (i) what can be done when, here, the multivariate normal mixture is surely misspecified, and (ii) use of a most likely inconsistent estimator (which outperforms the m.l.e. in terms of density forecasting and portfolio allocation for financial returns data).
Particularly with the less common topics developed in Part III of this book, the result is, like Books I and II, a substantially larger project than some similarly positioned books. It is thus essential to understand that not everything in the text is supposed to be (or could be) covered in the classroom, at least not in one semester. In my opinion, students (even in mathematics departments, but particularly those in the social sciences) benefit from having clearly laid out explanations, detailed proofs, illustrative examples, a variety of approaches, introductions to modern techniques, and discussions of important, possibly controversial topics (e.g., the irrelevance of consistent estimators in light of the notion that, in realistic settings, the model is wrong anyway, and changing through time or space; and the arguable superfluousness, if not danger, of the typical hypothesis testing framework), as well as topics that could initially be skipped in a first course, but returned to later or assigned as outside reading, depending on the interests and abilities of the students.
I wish to emphasize that this book is for teaching, as (obviously) opposed to being a research monograph, or (less obviously) a dry regurgitation of traditional concepts and examples. An anonymous reviewer of Book I, when I initially submitted it to the publisher Wiley, remarked "it's too much material: It seems the author has written a brain dump." While I like to think I have much more in my head than what was written in that book, he (his gender was indeed disclosed to me) apparently believes that students (let alone instructors) are incapable of assessing what material is core, and what can be deemed "extra," or suitable for reading after the main concepts are mastered. It is trivial to just skip some material, whereas not having it at all results in an admittedly shorter book (who cares, besides arguably the publisher?) that accomplishes far less, and might even give the student a false sense of understanding and competence (which will be painfully revealed in a quant job interview). Fortunately, not everyone agrees with him: Besides heart-warming student feedback over the years on Book I (from...
System requirements
File format: ePUB
Copy protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (not Kindle).
The file format ePub works well for novels and non-fiction books – i.e., „flowing” text without complex layout. On an e-reader or smartphone, line and page breaks automatically adjust to fit the small displays.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our ebook Help page.