The new edition of this book provides an easily accessible introduction to the statistical analysis of network data using R. It has been fully revised and can be used as a stand-alone resource in which multiple R packages are used to illustrate how to conduct a wide range of network analyses, from basic manipulation and visualization, to summary and characterization, to modeling of network data. The central package is igraph, which provides extensive capabilities for studying network graphs in R. The new edition of this book includes an overhaul to recent changes in igraph. The material in this book is organized to flow from descriptive statistical methods to topics centered on modeling and inference with networks, with the latter separated into two sub-areas, corresponding first to the modeling and inference of networks themselves, and then, to processes on networks.
The book begins by covering tools for the manipulation of network data. Next, it addresses visualization and characterization of networks. The book then examines mathematical and statistical network modeling. This is followed by a special case of network modeling wherein the network topology must be inferred. Network processes, both static and dynamic are addressed in the subsequent chapters. The book concludes by featuring chapters on network flows, dynamic networks, and networked experiments. Statistical Analysis of Network Data with R, 2nd Ed. has been written at a level aimed at graduate students and researchers in quantitative disciplines engaged in the statistical analysis of network data, although advanced undergraduates already comfortable with R should find the book fairly accessible as well.
Eric D. Kolaczyk is a professor of statistics and a data science faculty fellow at Boston University, in the Department of Mathematics and Statistics, where he also is an affiliated faculty member in the Bioinformatics Program, the Division of Systems Engineering, and the Center for Systems Neuroscience. Currently, he serves as the director of Boston University's Hariri Institute for Computing. His publications on network-based topics, beyond the development of statistical methodology and theory, include work on applications ranging from the detection of anomalous traffic patterns in computer networks to the prediction of biological function in networks of interacting proteins to the characterization of influence of groups of actors in social networks. He is an elected fellow of the American Association for the Advancement of Science (AAAS), the American Statistical Association (ASA), and the Institute of Mathematical Statistics, an elected member of the International Statistical Institute (ISI), and an elected senior member of the Institute of Electrical and Electronics Engineers (IEEE).
is a software engineer at RStudio, where he works on R infrastructure packages. He holds a PhD in Computer Science from Eötvös University, Hungary, and he has done postdocs at the Swiss Institute of Bioinformatics, the University of Lausanne, and Harvard University.
1 Introduction.- 2 Manipulating Network Data.- 3 Visualizing Network Data.- 4 Descriptive Analysis of Network Graph Characteristics.- 5 Mathematical Models for Network Graphs.- 6 Statistical Models for Network Graphs.- 7 Network Topology Inference.- 8 Modeling and Prediction for Processes on Network Graphs.- 9 Analysis of Network Flow Data.- 10 Networked Experiments.- 11 Dynamic Networks.- Index.