Applied Mathematics for the Analysis of Biomedical Data

Name: Applied Mathematics for the Analysis of Biomedical Data | Models, Methods, and MATLAB
Brand: Wiley
Price: 113.99 EUR
Availability: OnlineOnly

Models, Methods, and MATLAB

Peter J. Costa(Author)

Wiley (Publisher)

Published on 21. February 2017

448 pages

E-Book

ePUB with Adobe-DRM

System requirements

978-1-119-26951-9 (ISBN)

€113.99incl. 7% vat

System requirements

for ePUB with Adobe-DRM

E-Book Single Licence

Available for download

Description

Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.

Alles über E-Books, Kopierschutz & Dateiformate finden Sie in unserem Info- & Hilfebereich.

Features a practical approach to the analysis of biomedical data via mathematical methods and provides a MATLAB® toolbox for the collection, visualization, and evaluation of experimental and real-life data Applied Mathematics for the Analysis of Biomedical Data: Models, Methods, and MATLAB® presents a practical approach to the task that biological scientists face when analyzing data. The primary focus is on the application of mathematical models and scientific computing methods to provide insight into the behavior of biological systems. The author draws upon his experience in academia, industry, and government-sponsored research as well as his expertise in MATLAB to produce a suite of computer programs with applications in epidemiology, machine learning, and biostatistics. These models are derived from real-world data and concerns. Among the topics included are the spread of infectious disease (HIV/AIDS) through a population, statistical pattern recognition methods to determine the presence of disease in a diagnostic sample, and the fundamentals of hypothesis testing. In addition, the author uses his professional experiences to present unique case studies whose analyses provide detailed insights into biological systems and the problems inherent in their examination. The book contains a well-developed and tested set of MATLAB functions that act as a general toolbox for practitioners of quantitative biology and biostatistics. This combination of MATLAB functions and practical tips amplifies the book's technical merit and value to industry professionals. Through numerous examples and sample code blocks, the book provides readers with illustrations of MATLAB programming. Moreover, the associated toolbox permits readers to engage in the process of data analysis without needing to delve deeply into the mathematical theory. This gives an accessible view of the material for readers with varied backgrounds. As a result, the book provides a streamlined framework for the development of mathematical models, algorithms, and the corresponding computer code. In addition, the book features: * Real-world computational procedures that can be readily applied to similar problems without the need for keen mathematical acumen * Clear delineation of topics to accelerate access to data analysis * Access to a book companion website containing the MATLAB toolbox created for this book, as well as a Solutions Manual with solutions to selected exercises Applied Mathematics for the Analysis of Biomedical Data: Models, Methods, and MATLAB® is an excellent textbook for students in mathematics, biostatistics, the life and social sciences, and quantitative, computational, and mathematical biology. This book is also an ideal reference for industrial scientists, biostatisticians, product development scientists, and practitioners who use mathematical models of biological systems in biomedical research, medical device development, and pharmaceutical submissions.

More details

Other editions

Person

Content

Preface xi Acknowledgements xiii About the Companion Website xv Introduction xvii 1 Data 1 1.1 Data Visualization 1 1.2 Data Transformations 3 1.3 Data Filtering 7 1.4 Data Clustering 17 1.5 Data Quality and Data Cleaning 25 References 28 2 Some Examples 29 2.1 Glucose-Insulin Interaction 30 2.2 Transition from HIV to AIDS 33 2.3 Real-Time Polymerase Chain Reaction 37 References 45 Further Reading 45 3 SEIR Models 47 3.1 Practical Applications of SEIR Models 50 References 88 Further Reading 90 4 Statistical Pattern Recognition and Classification 93 4.1 Measurements and Data Classes 94 4.2 Data Preparation Normalization and Weighting Matrix 98 4.3 Principal Components 104 4.4 Discriminant Analysis 107 4.5 Regularized Discriminant Analysis and Classification 112 4.6 Minimum Bayes Score Maximum Likelihood and Minimum Bayes Risk 116 4.7 The Confusion Matrix Receiver-Operator Characteristic Curves and Assessment Metrics 122 4.8 An Example 127 4.9 Nonlinear Methods 131 References 139 Further Reading 140 5 Biostatistics and Hypothesis Testing 141 5.1 Hypothesis Testing Framework 142 5.2 Test of Means 157 5.3 Tests of Proportions 179 5.4 Tests of Variances 212 5.5 Other Hypothesis Tests 232 References 268 Further Reading 270 6 Clustered Data and Analysis of Variance 271 6.1 Clustered Matched-Pair Data and Non-Inferiority 273 6.2 Clustered Data Assessment Metrics and Diagnostic Likelihood Ratios 278 6.3 Relative Diagnostic Likelihood Ratios 286 6.4 Analysis of Variance for Clustered Data 291 6.5 Examples for Anova 300 6.6 Bootstrapping and Confidence Intervals 314 References 316 Further Reading 316 Appendix: Mathematical Matters 317 Glossary of MATLAB Functions 335 Index 407

Introduction

The phrase, mathematical analysis of biomedical data, at first glance seems impossibly ambitious. Does the author assert that inherently irregular biological systems can be described with any consistency via the rigid rules of mathematics? Add to this expression applied mathematics and a great deal of skepticism will likely fill the reader's mind. What is the intention of this work?

The answer is, in part, to provide a record of the author's 30-year career in academics, government, and private industry. Much of that career has involved the analysis of biological systems and data via mathematics. More than this, however, is the desire to provide the reader with a set of tools and examples that can be used as a basis for solving the problems he/she is facing. Some uncommon "tricks of the trade" and methodologies rarely broached by university instruction are provided.

Too often, books are written with only an academic audience in mind. This effort is aimed at working scientists and aspiring apprentices. It can be viewed as a combined textbook, reference work, handbook, and user's guide. The program presented here will be example driven. It would be disingenuous to say that the mathematics will not be emphasized (the author is, after all, a mathematician). Nevertheless, each section will be motivated by the underlying biology. Each example will contain the MATLAB® code required to produce a figure, result, and/or numerical table.

The book is guided by the idea that applied mathematical models are iterative. Develop a set of equations to describe a phenomenon, measure its effectiveness against data collected to measure the phenomenon, and then modify the model to improve its accuracy. The focus is on solving real examples by way of a mathematical method. Sophistication is not the primary goal. A symbiosis between the rigors of mathematical techniques and the unpredictable nature of biological systems is the point of emphasis.

The book reflects the formula that "mathematics + data + scientific computing = genuine insight into biological systems." The computing software of choice in this work is MATLAB. The reader can think of MATLAB as another important mathematical tool, akin to the Fourier transform. It (that is, MATLAB) helps transform data into mathematical forms and vice versa.

The presentation of concepts is as follows.

This introduction gives an overview of the book and ends with a representative example of the "mathematics + data + software" paradigm. The first chapter lists a set of guidelines and methods for obtaining, filtering, deciphering, and ultimately analyzing data. These techniques include data visualization, data transformations, data filtering/smoothing, data clustering (i.e., splitting one collection of samples into two or more subclasses), and data quality/data cleaning. In each case, a topic is introduced along with a data set. Mathematical methods used to examine the data are explained. Specific MATLAB programs, developed for use in an industrial setting, are applied to the data. The underlying assumption of this book is that, unlike most academic texts, data must be examined, verified, and/or filtered before a model is applied.

Following the discussion of data, the second chapter provides a view of the utility of differential equations as a modeling method on three distinct medical issues. The interaction of glucose and insulin levels within a human body is described by way of an elementary interaction model. This same approach is applied to the transition of HIV to AIDS within a patient. The HIV/AIDS example portends the susceptible-exposed-infected-recovered/removed models detailed in Chapter 3. The renowned polymerase chain reaction is presented as a coupled set of differential equations. In all of the cases above, the models are either applied to real clinical data or tested for their predictive value. MATLAB functions and code segments are included.

Chapter 3 focuses on mathematical epidemiology. The approach here is decidedly more involved than the examples provided in Chapter 2. The first section concerns a model, built on reported clinical data, that governs the transmission of HIV/AIDS through a population. It is, to the author's knowledge, the only such unified approach to the spread of a contagious disease. The second example within Chapter 3 concerns a mathematical method developed to predict the outbreak of a contagious disease based on simulated data (that mimic clinical data) of respiratory infections recorded at Boston Children's Hospital (BCH). Due to HIPAA (Health Insurance Portability and Accountability Act) laws, the author was unable to include the actual BCH data in the analysis. The simulated data, however, very closely resemble the clinical data. In each case, the need to estimate certain parameters crucial to the overall model reflecting real measurements and recorded populations is emphasized.

The fourth chapter concerns statistical pattern recognition methods used in the classification of human specimen samples for disease identification. Again, the application of mathematics to clinical data is the central focus of the exposition. All of the methods described are applied to data. Numerous figures and MATLAB code segments are included to aid the reader. The chapter ends with a presentation of support vector machines and their applications to the classification problem. Special software, developed at the University of Newcastle (Australia), is used with the permission of the design team to calculate the support vector boundaries. This cooperation itself is an example of how things are done in industry: collaborate with other experts; do not reinvent the wheel.

Chapter 5 is dedicated to a key component of biostatistics: hypothesis testing. The mathematical infrastructure is developed to produce the calculations (sample size, test statistic, hypothesis test, p-values) required by review agencies for submissions. This is an encyclopedic chapter listing the most important hypothesis tests and their variations including equivalence, non-inferiority, and superiority tests. As a point of reference, note that the author is on the organizing committee for the annual statistical issues workshop attended by industrial and review agency scientists, statisticians, and policy analysts. Thus, some of the key statistical matters as presented in these workshops are included in the chapter.

The final chapter examines clustered (that is, multi-reader/multi-category) data and the mathematical methods developed to render scientifically justified conclusions. The techniques include hypothesis testing and analysis of variance on clustered data.

0.1 How to Use This Book

Throughout these chapters, every effort has been made to present the material in as direct and clear a manner as possible. It is assumed that the reader is familiar with elementary differential equations, linear algebra, and statistics. An appendix includes a brief review of the mathematical underpinnings of these subjects. Further, it is hoped that the reader has some familiarity with MATLAB. In order to use the M-files written for this text, the reader must have access to MATLAB and the MATLAB Statistics Toolbox. A summary of most of the pertinent M-files and MAT-files which contain the data sets are provided in the Glossary of MATLAB Functions located at the end of the book. Also within this section is a recommendation for setting up a workspace to access the M- and MAT-files associated with this text. The reader is strongly urged to follow the recommendations contained therein. To gain access to the quadratic programming solver implemented in Chapter 4, the reader must contact Professor Brett Ninness of the University of Newcastle in New South Wales, Australia (http://sigpromu.org/quadprog/index.html). Expertise in any of the aforementioned areas, however, is not crucial to the use and understanding of this work.

The chapters are, by design, independent units. While methods developed in each chapter can be applied throughout the book (especially the chapter on data), each topic can be read without reliance on its predecessor. Whenever possible, it is recommended that the reader have MATLAB at the avail so that the examples can be traced along with the provided code.

0.2 Data and Solutions

With these ideas in mind, a few words about the source of all modeling efforts are presented: data. How does an industrial scientist deal with data? As a theoretical physicist once noted, if the data do not fit your model, change the data. Naturally, this comment was made with tongue firmly implanted in cheek. Here are some guidelines.

Data Normalization. When dealing with time-dependent data, it is often advisable to "center time" based upon the given interval. That is, if t ranges over the discrete values {t0, t1, ., tn}, then make calculations on times which start at 0 by subtracting off the initial time t0. More precisely, map t into t´ = t - t0: tk ? t´k = tk - t0. Similarly, some large measurements (say, population) can be given as 6,123,000, 6,730,000, etc. Rather than reporting such large "raw" numbers (which can cause overflow errors in...

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Applied Mathematics for the Analysis of Biomedical Data

Description

More details

Other editions

Additional editions

Person

Content

Introduction

0.1 How to Use This Book

0.2 Data and Solutions

System requirements