Addressing a broad range of big data analytics in cross-disciplinary applications, this essential handbook focuses on the statistical prospects offered by recent developments in this field. To do so, it covers statistical methods for high-dimensional problems, algorithmic designs, computation tools, analysis flows and the software-hardware co-designs that are needed to support insightful discoveries from big data. The book is primarily intended for statisticians, computer experts, engineers and application developers interested in using big data analytics with statistics. Readers should have a solid background in statistics and computer science.
Wolfgang Karl Härdle is Ladislaus von Bortkievicz Professor of Statistics at the Humboldt University of Berlin and director of C.A.S.E. (Center for Applied Statistics and Economics), director of the Collaborative Research Center 649 "Economic Risk" and also of the IRTG 1792 "High Dimensional Nonstationary Time Series". He teaches quantitative finance and semiparametric statistics. Professor Härdle's research focuses on dynamic factor models, multivariate statistics in finance and computational statistics. He is an elected member of the International Statistical Institute (ISI) and advisor to the Guanghua School of Management, Peking University, China.
Henry Horng-Shing Lu is Professor at the Institute of Statistics of the National Chiao Tung University, Taiwan and serves as the Vice President of Academic Affairs. He received his Ph.D. in Statistics from Cornell University, NY in 1994. He is an elected member of the International Statistical Institute (ISI). His research interests include statistics, applications and big data analytics. Professor Lu analyzes different types of data by developing statistical methodologies for machine learning with the power of statistical inference and computation algorithms. His findings were published in a wide spectrum of journals and conference papers. He also co-edited the Handbook of Statistical Bioinformatics, published by Springer in 2011.
Xiaotong Shen is John Black Johnston Distinguished Professor at the School of Statistics of the University of Minnesota, MN. He received his Ph.D. in Statistics from the University of Chicago, IL in 1991. He is Fellow of the American Statistical Association (ASA), the Institute of Mathematical Statistics (IMS), and the American Association for the Advancement of Science (AAAS) as well as an elected member of the International Statistical Institute (ISI). Professor Shen's areas of interest include machine learning and data mining, likelihood-based inference, semiparametric and nonparametric models, model selection and averaging. His current research efforts are mainly devoted to the further development of structured learning as well as high-dimensional/high-order analysis. The targeted application areas are biomedical sciences and engineering.
Preface.- Statistics, Statisticians, and the Internet of Things (John M. Jordan and Dennis K. J. Lin).- Cognitive Data Analysis for Big Data (Jing Shyr, Jane Chu and Mike Woods).- Statistical Leveraging Methods in Big Data (Xinlian Zhang, Rui Xie and Ping Ma).- Scattered Data and Aggregated Inference (Xiaoming Huo, Cheng Huang and Xuelei Sherry Ni).- Nonparametric Methods for Big Data Analytics (Hao Helen Zhang).- Finding Patterns in Time Series (James E. Gentle and Seunghye J. Wilson).- Variational Bayes for Hierarchical Mixture Models (Muting Wan, James G. Booth and Martin T. Wells).- Hypothesis Testing for High-Dimensional Data (Wei Biao Wu, Zhipeng Lou and Yuefeng Han).- High-Dimensional Classification (Hui Zou).- Analysis of High-Dimensional Regression Models Using Orthogonal Greedy Algorithms (Hsiang-Ling Hsu, Ching-Kang Ing and Tze Leung Lai).- Semi-Supervised Smoothing for Large Data Problems (Mark Vere Culp, Kenneth Joseph Ryan and George Michailidis).- Inverse Modeling: A Strategy to Cope with Non-Linearity (Qian Lin, Yang Li and Jun S. Liu).- Sufficient Dimension Reduction for Tensor Data (Yiwen Liu, Xin Xing and Wenxuan Zhong).- Compressive Sensing and Sparse Coding (Kevin Chen and H. T. Kung).- Bridging Density Functional Theory and Big Data Analytics with Applications (Chien-Chang Chen, Hung-Hui Juan, Meng-Yuan Tsai and Henry Horng-Shing Lu).- A Tutorial on Libra: R Package for the Linearized Bregman Algorithm in High-Dimensional Statistics (Jiechao Xiong, Feng Ruan and Yuan Yao).- Q3-D3-LSA: D3.js and generalized vector space models for Statistical Computing (Lukas Borke and Wolfgang Karl Härdle).- Functional Data Analysis for Big Data: A Case Study on California Temperature Trends (Pantelis Zenon Hadjipantelis and Hans-Georg Müller).- Bayesian Spatiotemporal Modeling for Detecting Neuronal Activation via Functional Magnetic Resonance Imaging (Martin Bezener, Lynn E. Eberly, John Hughes, Galin Jones and Donald R. Musgrove).- Construction of Tight Frames on Graphs and Application to Denoising (Franziska Göbel, Gilles Blanchard and Ulrike von Luxburg).- Beta-Boosted Ensemble for Big Credit Scoring Data (Maciej Zieba and Wolfgang Karl Härdle).-