
Statistics by Simulation
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
An accessible guide to understanding statistics using simulations, with examples from a range of scientific disciplines Real-world challenges such as small sample sizes, skewed distributions of data, biased sampling designs, and more predictors than data points are pushing the limits of classical statistical analysis. This textbook provides a new tool for the statistical toolkit: data simulations. It shows that using simulation and data-generating models is an excellent way to validate statistical reasoning and to augment study design and statistical analysis with planning and visualization. Although data simulations are not new to professional statisticians, Statistics by Simulation makes the approach accessible to a broader audience, with examples from many fields. It introduces the reasoning behind data simulation and then shows how to apply it in planning experiments or observational studies, developing analytical workflows, deploying model diagnostics, and developing new indices and statistical methods. • Covers all steps of statistical practice, from planning projects to post-hoc analysis and model checking • Provides examples from disciplines including sociology, psychology, ecology, economics, physics, and medicine • Includes R code for all examples, with data and code freely available online • Offers bullet-point outlines and summaries of each chapter • Minimizes the use of jargon and requires only basic statistical background and skills
More details
Other editions
Additional editions


Persons
Content
- Cover
- Table of Contents
- Preface
- Acknowledgments
- Part I: Propositi: Why and how to Simulate
- 1. General Introduction
- 1.1. What are Simulated Data?
- 1.2. Simulated Data are Specific
- 1.3. Yes, Scientists Really Simulate Data
- 1.4. There Aremany Good Reasons to Simulate Data
- 1.5. Useful Background Knowledge to Use this Bookmost Effectively
- 1.6. Notational Conventions
- 1.7. Structure, Organisation, and Flow
- 1.8. Summary
- 2. The Basics of Simulating Data and the Need for Computational Competence
- 2.1. A Roadmap for Simulation in Statistics
- 2.2. Two Simple Examples
- 2.3. More Complex Examples
- 2.4. Simulating Autocorrelated Data
- 2.5. Simulation Versus Randomisation Techniques
- 2.6. Summary
- Part II: Ante Mensuram: Prospective Simulations of Study Designs and their Power
- 3. Think Before you Act
- 3.1. The Illusion of Truth:A Case Study
- 3.2. The Question Comes First
- 3.3. Setting Expectations, Defining Hypotheses
- 3.4. Testing Hypotheses and Assessing their Support
- 3.5. Pre-registration
- 3.6. Summary
- 4. Prospective Simulation of Statistical Power
- 4.1. Simple Group Comparisons
- 4.2. How many Data Points do we Need for a Simple Correlation?
- 4.3. Is "Recruit Until Significant" Problematic?
- 4.4. How Long does a Time Series Have to be?
- 4.5. Improving Estimates: Is the Experiment Powerful Enough?
- 4.6. Summary
- Part III: Post Mensuram: Simulations in Statistical Analysis
- 5. Assumptions: Is that One Important
- 5.1. Linear Regression Requires the data to be Normally Distributed
- 5.2. Regression Models Also Assume that Errors in Predictor Variables are Negligible or Unimportant
- 5.3. The Intended, Rather than the Realised, Manipulation is an Admissible Predictor Variable
- 5.4. ANOVA Requires Homoscedasticity
- 5.5. Multiple Testing and the Inflation of False Positives
- 5.6. Hyper-Distributions Inmixed-Effectmodels are Normal
- 5.7. Correlations Among Predictors are the Same Outside the Range of the Observed Data
- 5.8. Summary
- 6. Folklore: Is that Rule-of-Thumb True or Useful
- 6.1. Model Selection does Not Always Improve Interpretation
- 6.2. Selecting One of Two Correlated Predictors does Not Mitigate Collinearity in Regression and Machine Learning
- 6.3. It is Notok to Categorise Continuous Predictor Variables
- 6.4. Usemonte Carlo Simulation when Data are Heteroscedastic
- 6.5. Time Series Should Not be Detrended by Default
- 6.6. Machine Learning and Bigdata do Not Obviate Rules-of-Thumb
- 6.7. Summary
- 7. Workflows and Pipelines Can Introduce and Propagate Artefacts
- 7.1. What Can we do Aboutmissing Data?
- 7.2. Types Ofmissing Data
- 7.3. Imputation Ofmissing Predictors
- 7.4. Estimating Values for Censored Observations
- 7.5. Pre-Selecting Predictors
- 7.6. Regression on Residuals
- 7.7. Error Propagation
- 7.8. Workflow: Stringing Multiple Statistical Steps into an Analytical Pipeline
- 7.9. Summary
- Part IV: Post Exemplum: Diagnostic Simulations
- 8. Evaluating Models: How well do they Really Fit?
- 8.1. Learning from the Prior
- 8.2. What does Amodel Tell us, and what does it not tell us?
- 8.3. Visualising More Complex Effects: Conditional, Marginal, and Partial Plots
- 8.4. Model Diagnostics
- 8.5. Predicting with Confidence is Not the same as Confidence in Prediction
- 8.6. Iterative Learning: New Priors from Old Posteriors
- 8.7. Outlook
- 8.8. Summary
- 9. Post Hoc Alternatives to Retrospective Power Analysis
- 9.1. Reprise: Prospective Power Analysis
- 9.2. What is Retrospective Power Analysis?
- 9.3. Post Hoc Alternatives to Retrospective Power Analysis
- 9.4. Summary: Most Retrospective Analyses Should be Avoided
- 9.5. Coda: What Would a Bayesian do Instead?
- Part V: In Posterum: Simulations for New Methods
- 10. Combining Studies: Meta-Analysis and Federated Analysis
- 10.1. Whence the data?
- 10.2. From Meta-Analysis through Federated Analysis to Complete Aanalysis
- 10.3. Meta-Analysis
- 10.4. Individual Participant-Level Meta-Analysis
- 10.5. One-Step Federated Analysis
- 10.6. Multi-Step Federated Analysis
- 10.7. Complete Data Analysis
- 10.8. Conclusions and Outlook
- 10.9. Summary
- 11. Putting it through Its Paces: Does this New Method Work?
- 11.1. Unit Testing
- 11.2. Dimensional Analysis
- 11.3. Comparisons
- 11.4. Intellectual Advancement
- 11.5. Intuitive Understanding
- 11.6. Model-Agnostic Number of Parameters: Generalised Degrees of Freedom
- 11.7. Know your Limits
- 11.8. Summary
- 12. Outroduction: How Far Should we Push Simulations?
- 12.1. Stochastic Weather Forecasting
- 12.2. Infusing Fake Signals to Test the Workflow at LIGO
- 12.3. Virtual LIDAR Scanning
- 12.4. Advanced Simulation may be Neither Possible Nor Desirable
- Appendix A: Useful R Functions for data Simulations
- A.1 Drawing Random Values Froma Distribution
- A.2 Doing Things Repeatedly: For-Loops and Replicate
- A.3 Shuffling, Resampling, and Bootstrapping: Sample ()
- A.4 Little Helpers
- A.5 Dedicated Simulation Packages
- Index
System requirements
File format: ePUB
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use a reading software that can process the file format ePUB: e.g., Adobe Digital Editions or FBReader – both free (see eBook Help).
- Tablet/Smartphone (Android; iOS): Before downloading, install the free app Adobe Digital Editions (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (not Kindle).
The file format ePUB works well for novels and non-fiction books – i.e., „flowing” text without complex layout. On an e-reader or smartphone, line and page breaks automatically adjust to fit the small displays.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.