Statistics by Simulation

Name: Statistics by Simulation | A Synthetic Data Approach
Brand: Princeton University Press
Price: 43.99 EUR
Availability: OnlineOnly

A Synthetic Data Approach

Carsten F. Dormann Aaron M. Ellison(Author)

Princeton University Press

Published on 3. June 2025

456 pages

E-Book

ePUB with digital watermarking

System requirements

978-0-691-27546-8 (ISBN)

€43.99incl. 7% vat

System requirements

for ePUB with digital watermarking

E-Book Single Licence

Available for download

Description

More details

Other editions

Persons

Content

Cover
Table of Contents
Preface
Acknowledgments
Part I: Propositi: Why and how to Simulate
1. General Introduction
1.1. What are Simulated Data?
1.2. Simulated Data are Specific
1.3. Yes, Scientists Really Simulate Data
1.4. There Aremany Good Reasons to Simulate Data
1.5. Useful Background Knowledge to Use this Bookmost Effectively
1.6. Notational Conventions
1.7. Structure, Organisation, and Flow
1.8. Summary
2. The Basics of Simulating Data and the Need for Computational Competence
2.1. A Roadmap for Simulation in Statistics
2.2. Two Simple Examples
2.3. More Complex Examples
2.4. Simulating Autocorrelated Data
2.5. Simulation Versus Randomisation Techniques
2.6. Summary
Part II: Ante Mensuram: Prospective Simulations of Study Designs and their Power
3. Think Before you Act
3.1. The Illusion of Truth:A Case Study
3.2. The Question Comes First
3.3. Setting Expectations, Defining Hypotheses
3.4. Testing Hypotheses and Assessing their Support
3.5. Pre-registration
3.6. Summary
4. Prospective Simulation of Statistical Power
4.1. Simple Group Comparisons
4.2. How many Data Points do we Need for a Simple Correlation?
4.3. Is "Recruit Until Significant" Problematic?
4.4. How Long does a Time Series Have to be?
4.5. Improving Estimates: Is the Experiment Powerful Enough?
4.6. Summary
Part III: Post Mensuram: Simulations in Statistical Analysis
5. Assumptions: Is that One Important
5.1. Linear Regression Requires the data to be Normally Distributed
5.2. Regression Models Also Assume that Errors in Predictor Variables are Negligible or Unimportant
5.3. The Intended, Rather than the Realised, Manipulation is an Admissible Predictor Variable
5.4. ANOVA Requires Homoscedasticity
5.5. Multiple Testing and the Inflation of False Positives
5.6. Hyper-Distributions Inmixed-Effectmodels are Normal
5.7. Correlations Among Predictors are the Same Outside the Range of the Observed Data
5.8. Summary
6. Folklore: Is that Rule-of-Thumb True or Useful
6.1. Model Selection does Not Always Improve Interpretation
6.2. Selecting One of Two Correlated Predictors does Not Mitigate Collinearity in Regression and Machine Learning
6.3. It is Notok to Categorise Continuous Predictor Variables
6.4. Usemonte Carlo Simulation when Data are Heteroscedastic
6.5. Time Series Should Not be Detrended by Default
6.6. Machine Learning and Bigdata do Not Obviate Rules-of-Thumb
6.7. Summary
7. Workflows and Pipelines Can Introduce and Propagate Artefacts
7.1. What Can we do Aboutmissing Data?
7.2. Types Ofmissing Data
7.3. Imputation Ofmissing Predictors
7.4. Estimating Values for Censored Observations
7.5. Pre-Selecting Predictors
7.6. Regression on Residuals
7.7. Error Propagation
7.8. Workflow: Stringing Multiple Statistical Steps into an Analytical Pipeline
7.9. Summary
Part IV: Post Exemplum: Diagnostic Simulations
8. Evaluating Models: How well do they Really Fit?
8.1. Learning from the Prior
8.2. What does Amodel Tell us, and what does it not tell us?
8.3. Visualising More Complex Effects: Conditional, Marginal, and Partial Plots
8.4. Model Diagnostics
8.5. Predicting with Confidence is Not the same as Confidence in Prediction
8.6. Iterative Learning: New Priors from Old Posteriors
8.7. Outlook
8.8. Summary
9. Post Hoc Alternatives to Retrospective Power Analysis
9.1. Reprise: Prospective Power Analysis
9.2. What is Retrospective Power Analysis?
9.3. Post Hoc Alternatives to Retrospective Power Analysis
9.4. Summary: Most Retrospective Analyses Should be Avoided
9.5. Coda: What Would a Bayesian do Instead?
Part V: In Posterum: Simulations for New Methods
10. Combining Studies: Meta-Analysis and Federated Analysis
10.1. Whence the data?
10.2. From Meta-Analysis through Federated Analysis to Complete Aanalysis
10.3. Meta-Analysis
10.4. Individual Participant-Level Meta-Analysis
10.5. One-Step Federated Analysis
10.6. Multi-Step Federated Analysis
10.7. Complete Data Analysis
10.8. Conclusions and Outlook
10.9. Summary
11. Putting it through Its Paces: Does this New Method Work?
11.1. Unit Testing
11.2. Dimensional Analysis
11.3. Comparisons
11.4. Intellectual Advancement
11.5. Intuitive Understanding
11.6. Model-Agnostic Number of Parameters: Generalised Degrees of Freedom
11.7. Know your Limits
11.8. Summary
12. Outroduction: How Far Should we Push Simulations?
12.1. Stochastic Weather Forecasting
12.2. Infusing Fake Signals to Test the Workflow at LIGO
12.3. Virtual LIDAR Scanning
12.4. Advanced Simulation may be Neither Possible Nor Desirable
Appendix A: Useful R Functions for data Simulations
A.1 Drawing Random Values Froma Distribution
A.2 Doing Things Repeatedly: For-Loops and Replicate
A.3 Shuffling, Resampling, and Bootstrapping: Sample ()
A.4 Little Helpers
A.5 Dedicated Simulation Packages
Index

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Statistics by Simulation

Description

More details

Other editions

Additional editions

Persons

Content

System requirements