Small Area Estimation

Name: Small Area Estimation
Brand: Wiley
Price: 98.99 EUR
Availability: OnlineOnly

J. N. K. Rao Isabel Molina(Autor*in)

Wiley (Verlag)

2. Auflage

Erschienen am 24. August 2015

480 Seiten

E-Book

ePUB mit Adobe-DRM

Systemvoraussetzungen

978-1-118-73572-5 (ISBN)

98,99 €inkl. 7% MwSt.

Systemvoraussetzungen

für ePUB mit Adobe-DRM

E-Book Einzellizenz

Als Download verfügbar

Beschreibung

Praise for the First Edition "This pioneering work, in which Rao provides a comprehensive and up-to-date treatment of small area estimation, will become a classic...I believe that it has the potential to turn small area estimation...into a larger area of importance to both researchers and practitioners." --Journal of the American Statistical Association Written by two experts in the field, Small Area Estimation, Second Edition provides a comprehensive and up-to-date account of the methods and theory of small area estimation (SAE), particularly indirect estimation based on explicit small area linking models. The model-based approach to small area estimation offers several advantages including increased precision, the derivation of "optimal" estimates and associated measures of variability under an assumed model, and the validation of models from the sample data. Emphasizing real data throughout, the Second Edition maintains a self-contained account of crucial theoretical and methodological developments in the field of SAE. The new edition provides extensive accounts of new and updated research, which often involves complex theory to handle model misspecifications and other complexities. Including information on survey design issues and traditional methods employing indirect estimates based on implicit linking models, Small Area Estimation, Second Edition also features: * Additional sections describing the use of R code data sets for readers to use when replicating applications * Numerous examples of SAE applications throughout each chapter, including recent applications in U.S. Federal programs * New topical coverage on extended design issues, synthetic estimation, further refinements and solutions to the Fay-Herriot area level model, basic unit level models, and spatial and time series models * A discussion of the advantages and limitations of various SAE methods for model selection from data as well as comparisons of estimates derived from models to reliable values obtained from external sources, such as previous census or administrative data Small Area Estimation, Second Edition is an excellent reference for practicing statisticians and survey methodologists as well as practitioners interested in learning SAE methods. The Second Edition is also an ideal textbook for graduate-level courses in SAE and reliable small area statistics.

Rezensionen / Stimmen

"The book is an excellent reference for practicing statisticians and survey methodologists as well as practitioners interested in learning SAE methods. The second edition is also an ideal textbook for graduate-level courses in SAE and reliable small area statistics." (Zentralblatt MATH 2016)The book is an excellent reference for practicing statisticians and survey methodologists aswell as practitioners interested in learning SAE methods. The second edition is also an idealtextbook for graduate-level courses in SAE and reliable small area statistics.

Weitere Details

Weitere Ausgaben

Personen

Inhalt

List of Figures xv List of Tables xvii Foreword to the First Edition xix Preface to the Second Edition xxiii Preface to the First Edition xxvii 1 *Introduction 1 1.1 What is a Small Area? 1 1.2 Demand for Small Area Statistics, 3 1.3 Traditional Indirect Estimators, 4 1.4 Small Area Models, 4 1.5 Model-Based Estimation, 5 1.6 Some Examples, 6 2 Direct Domain Estimation 9 2.1 Introduction, 9 2.2 Design-Based Approach, 10 2.3 Estimation of Totals, 11 2.4 Domain Estimation, 16 2.5 Modified GREG Estimator, 21 2.6 Design Issues, 23 2.7 *Optimal Sample Allocation for Planned Domains, 26 2.8 Proofs, 32 3 Indirect Domain Estimation 35 3.1 Introduction, 35 3.2 Synthetic Estimation, 36 3.3 Composite Estimation, 57 3.4.1 Common Weight, 63 3.5 Proofs, 71 4 Small Area Models 75 4.1 Introduction, 75 4.2 Basic Area Level Model, 76 4.3 Basic Unit Level Model, 78 4.4 Extensions: Area Level Models, 81 4.5 Extensions: Unit Level Models, 88 4.6 Generalized Linear Mixed Models, 92 5 Empirical Best Linear Unbiased Prediction (EBLUP): Theory 97 5.1 Introduction, 97 5.2 General Linear Mixed Model, 98 5.3 Block Diagonal Covariance Structure, 108 5.4 *Model Identification and Checking, 111 5.5 *Software, 118 6 Empirical Best Linear Unbiased Prediction (EBLUP): Basic Area Level Model 123 6.1 EBLUP Estimation, 123 6.2 MSE Estimation, 136 6.3 *Robust Estimation in the Presence of Outliers, 146 6.4 *Practical Issues, 148 6.5 *Software, 169 7 Basic Unit Level Model 173 7.1 EBLUP Estimation, 173 7.2 MSE Estimation, 179 7.3 *Applications, 186 7.4 *Outlier Robust EBLUP Estimation, 193 7.5 *M-Quantile Regression, 200 7.6 *Practical Issues, 205 7.7 *Software, 227 7.8 *Proofs, 231 8 EBLUP: Extensions 235 8.1 *Multivariate Fay-Herriot Model, 235 8.2 Correlated Sampling Errors, 237 8.3 Time Series and Cross-Sectional Models, 240 8.4 *Spatial Models, 248 8.5 *Two-Fold Subarea Level Models, 251 8.6 *Multivariate Nested Error Regression Model, 253 8.7 Two-Fold Nested Error Regression Model, 254 8.8 *Two-Level Model, 259 8.9 *Models for Multinomial Counts, 261 8.10 *EBLUP for Vectors of Area Proportions, 262 8.11 *Software, 264 9 Empirical Bayes (EB) Method 269 9.1 Introduction, 269 9.2 Basic Area Level Model, 270 9.3 Linear Mixed Models, 287 9.4 *EB Estimation of General Finite Population Parameters, 289 9.5 Binary Data, 298 9.6 Disease Mapping, 308 9.7 *Design-Weighted EB Estimation: Exponential Family Models, 313 9.8 Triple-Goal Estimation, 315 9.9 Empirical Linear Bayes, 319 9.10 Constrained LB, 324 9.11 *Software, 325 9.12 Proofs, 330 10 Hierarchical Bayes (HB) Method 333 10.1 Introduction, 333 10.2 MCMC Methods, 335 10.3 Basic Area Level Model, 347 10.4 *Unmatched Sampling and Linking Area Level Models, 356 10.5 Basic Unit Level Model, 362 10.6 General ANOVA Model, 368 10.7 *HB Estimation of General Finite Population Parameters, 369 10.8 Two-Level Models, 374 10.9 Time Series and Cross-Sectional Models, 377 10.10 Multivariate Models, 381 10.11 Disease Mapping Models, 383 10.12 *Two-Part Nested Error Model, 388 10.13 Binary Data, 389 10.14 *Missing Binary Data, 397 10.15 Natural Exponential Family Models, 398 10.16 Constrained HB, 399 10.17 *Approximate HB Inference and Data Cloning, 400 10.18 Proofs, 402 References 405 Author Index 431 Subject Index 437

Preface to the Second Edition

Small area estimation (SAE) deals with the problem of producing reliable estimates of parameters of interest and the associated measures of uncertainty for subpopulations (areas or domains) of a finite population for which samples of inadequate sizes or no samples are available. Traditional "direct estimates," based only on the area-specific sample data, are not suitable for SAE, and it is necessary to "borrow strength" across related small areas through supplementary information to produce reliable "indirect" estimates for small areas. Indirect model-based estimation methods, based on explicit linking models, are now widely used.

The first edition of Small Area Estimation (Rao 2003a) provided a comprehensive account of model-based methods for SAE up to the end of 2002. It is gratifying to see the enthusiastic reception it has received, as judged by the significant number of citations and the rapid growth in SAE literature over the past 12 years. Demand for reliable small area estimates has also greatly increased worldwide. As an example, the estimation of complex poverty measures at the municipality level is of current interest, and World Bank uses a model-based method, based on simulating multiple censuses, in more than 50 countries worldwide to produce poverty statistics for small areas.

The main aim of the present second edition is to update the first edition by providing a comprehensive account of important theoretical developments from 2003 to 2014. New SAE literature is quite extensive and often involves complex theory to handle model misspecifications and other complexities. We have retained a large portion of the material from the first edition to make the book self-contained, and supplemented it with selected new developments in theory and methods of SAE. Notations and terminology used in the first edition are largely retained. As in the first edition, applications are included throughout the chapters. An added feature of the second edition is the inclusion of sections (Sections *Software, *Software, 7.7, 8.11, and 9.11) describing specific R software for SAE, concretely the R package sae (Molina and Marhuenda 2013; Molina and Marhuenda 2015). These sections include examples of SAE applications using data sets included in the package and provide all the necessary R codes, so that the user can exactly replicate the applications. New sections and old sections with significant changes are indicated by an asterisk in the book. Chapter 3 on "Traditional Demographic Methods" from first edition is deleted partly due to page constraints and the fact that the material is somewhat unrelated to mainstream model-based methods. Also, we have not been able to keep up to date with the new developments in demographic methods.

Chapter 1 introduces basic terminology related to SAE and presents selected important applications as motivating examples. Chapter 2, as in the first edition, presents a concise account of direct estimation of totals or means for small areas and addresses survey design issues that have a bearing on SAE. New Section *Optimal Sample Allocation for Planned Domains deals with optimal sample allocation for planned domains and the estimation of marginal row and column strata means in the presence of two-way stratification. Chapter 3 gives a fairly detailed account of traditional indirect estimation based on implicit linking models. The well-known James-Stein method of composite estimation is also studied in the context of sample survey data. New Section *Generalized SPREE studies generalized structure preserving estimation (GSPREE) based on relaxing some interaction assumptions made in the traditional SPREE, which is often used in practice because it makes fuller use of reliable direct estimates at a higher level to produce synthetic estimates. Another important addition is weight sharing (or splitting) methods studied in Section *Weight-Sharing Methods. The weight-sharing methods produce a two-way table of weights with rows as the units in the full sample and columns as the areas such that the cell weights in each row add up to the original sample weight. Such methods are especially useful in micro-simulation modeling that can involve a large number of variables of interest.

Explicit small area models that account for between-area variability are introduced in Chapter 4 (previous Chapter 5), including linear mixed models and generalized linear mixed models such as logistic linear mixed models with random area effects. The models are classified into two broad categories: (i) area level models that relate the small area means or totals to area level covariates; and (ii) unit level models that relate the unit values of a study variable to unit-specific auxiliary variables. Extensions of the models to handle complex data structures, such as spatial dependence and time series structures, are also considered. New Section *Semi-parametric Mixed Models introduces semi-parametric mixed models, which are studied later. Chapter 5 (previous Chapter 6) studies linear mixed models involving fixed and random effects. It gives general results on empirical best linear-unbiased prediction (EBLUP) and the estimation of mean squared error (MSE) of the EBLUP. A detailed account of model identification and checking for linear mixed models is presented in the new Section *Model Identification and Checking. Available SAS software and R statistical software for linear mixed models are summarized in the new Section *Software. The R package sae specifically designed for SAE is also described.

Chapter 6 of the First Edition provided a detailed account of EBLUP estimation of small area means or totals for the basic area level and unit level models, using the general theory given in Chapter 5. In the past 10 years or so, researchers have done extensive work on those two models, especially addressing problems related to model misspecification and other practical issues. As a result, we decided to split the old Chapter 6 into two new chapters, with Chapter 6 focusing on area level models and Chapter 7 addressing unit level models. New topics covered in Chapter 6 include bootstrap MSE estimation (Section *Bootstrap MSE Estimation) and robust estimation in the presence of outliers (Section *Robust estimation in the presence of outliers). Section *Practical issues deals with practical issues related to the basic area level model. It includes important topics such as covariates subject to sampling errors (Section *Practical issues.4), misspecification of linking models (Section *Practical issues.7), benchmarking of model-based area estimators to ensure agreement with a reliable direct estimate when aggregated (Section *Practical issues.6), and the use of "big data" as possible covariates in area level models (Section *Practical issues.5). Functions of the R package sae designed for estimation under the area level model are described in Section *Software. An example illustrating the use of these functions is provided. New topics introduced in Chapter 7 include bootstrap MSE estimation (Section *Bootstrap MSE Estimation), outlier robust EBLUP estimation (Section *Outlier Robust EBLUP Estimation), and M-quantile regression (Section *M-Quantile Regression). Section *Practical Issues deals with practical issues related to the basic unit level model. It presents methods to deal with important topics, including measurement errors in covariates (Section *Practical Issues.4), model misspecification (Section *Practical Issues.5), and semi-parametric nested error models (Sections Semi-parametric Nested Error Model: EBLUP and Semi-parametric Nested Error Model: REBLUP). Most of the published literature assumes that the assumed model for the population values also holds for the sample. However, in many applications, this assumption may not be true due to informative sampling leading to sample selection bias. Section *Practical Issues.3 gives a detailed treatment of methods to make valid inferences under informative sampling. Functions of R package sae dealing with the basic unit level model are described in Section *Software. The use of these functions is illustrated through an application to the County Crop Areas data of Battese, Harter, and Fuller (1988). This application includes calculation of model diagnostics and drawing residual plots. Several important applications are also presented in Chapters 6 and 7.

New chapters 8, 9, and 10 cover the same material as the corresponding chapters in the first edition. Chapter 8 contains EBLUP theory for various extensions of the basic area level and unit level models, providing updates to the sections in the first edition, in particular a more detailed account of spatial and two-level models. Section *Spatial Models on spatial models is updated, and functions of the R package sae dealing with spatial area level models are described in Section *Software. An example illustrating the use of these functions is provided. Section *Two-fold Subarea Level Models presents theory for two-fold subarea level models, which are natural extensions of the basic area level models. Chapter 9 presents empirical Bayes (EB) estimation. The EB method (also called empirical best) is more generally applicable than the EBLUP method. New Section *EB Confidence Intervals gives an account of methods for constructing confidence intervals in the case of basic area level model. EB estimation of general area parameters is the theme of Section *EB Estimation of General Finite Population...

Inhalt (EPUB)

Systemvoraussetzungen

Als PDF speichern Als Link merken