
Linear Dimensionality Reduction
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
This book provides an overview of some classical linear methods in Multivariate Data Analysis. This is an old domain, well established since the 1960s, and refreshed timely as a key step in statistical learning. It can be presented as part of statistical learning, or as dimensionality reduction with a geometric flavor. Both approaches are tightly linked: it is easier to learn patterns from data in low-dimensional spaces than in high-dimensional ones. It is shown how a diversity of methods and tools boil down to a single core method, PCA with SVD, so that the efforts to optimize codes for analyzing massive data sets like distributed memory and task-based programming, or to improve the efficiency of algorithms like Randomized SVD, can focus on this shared core method, and benefit all methods.
This book is aimed at graduate students and researchers working on massive data who have encountered the usefulness of linear dimensionality reduction and are looking for a recipe to implement it. It has been written according to the view that the best guarantee of a proper understanding and use of a method is to study in detail the calculations involved in implementing it. With an emphasis on the numerical processing of massive data, it covers the main methods of dimensionality reduction, from linear algebra foundations to implementing the calculations. The basic requisite elements of linear and multilinear algebra, statistics and random algorithms are presented in the appendix.
More details
Other editions
Additional editions

Person
Alain Franc is a senior researcher at INRAE (National Research Institute for Agriculture, Food and the Environment) and INRIA (National Institute for Research in Digital Science and Technology). He works on dimension reduction and statistical modelling with applications to the discovery of patterns in biodiversity. His focus is on the development of methods for handling massive data sets, which is a challenge for high-performance computing.
Content
- 1. Introduction.- 2. Principal Component Analysis (PCA).- 3. Complements on PCA.- 4. PCA with Metrics on Rows and Columns.- 5. Correspondence Analysis.- 6. PCA with Instrumental Variables.- 7. Canonical Correlation Analysis.- 8. Multiple Canonical Correlation Analysis.- 9. Multidimensional Scaling.
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.