Nonlinear Filters

Name: Nonlinear Filters | Theory and Applications
Brand: Wiley
Price: 117.99 EUR
Availability: OnlineOnly

Theory and Applications

Peyman Setoodeh Saeid Habibi Simon Haykin(Author)

Wiley (Publisher)

1st Edition

Published on 4. March 2022

304 pages

E-Book

ePUB with Adobe-DRM

System requirements

978-1-119-07815-9 (ISBN)

€117.99incl. 7% vat

System requirements

for ePUB with Adobe-DRM

E-Book Single Licence

Available for download

Description

Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.

Alles über E-Books, Kopierschutz & Dateiformate finden Sie in unserem Info- & Hilfebereich.

NONLINEAR FILTERS

Discover the utility of using deep learning and (deep) reinforcement learning in deriving filtering algorithms with this insightful and powerful new resource

Nonlinear Filters: Theory and Applications delivers an insightful view on state and parameter estimation by merging ideas from control theory, statistical signal processing, and machine learning. Taking an algorithmic approach, the book covers both classic and machine learning-based filtering algorithms.

Readers of Nonlinear Filters will greatly benefit from the wide spectrum of presented topics including stability, robustness, computability, and algorithmic sufficiency. Readers will also enjoy:

* Organization that allows the book to act as a stand-alone, self-contained reference

* A thorough exploration of the notion of observability, nonlinear observers, and the theory of optimal nonlinear filtering that bridges the gap between different science and engineering disciplines

* A profound account of Bayesian filters including Kalman filter and its variants as well as particle filter

* A rigorous derivation of the smooth variable structure filter as a predictor-corrector estimator formulated based on a stability theorem, used to confine the estimated states within a neighborhood of their true values

* A concise tutorial on deep learning and reinforcement learning

* A detailed presentation of the expectation maximization algorithm and its machine learning-based variants, used for joint state and parameter estimation

* Guidelines for constructing nonparametric Bayesian models from parametric ones

Perfect for researchers, professors, and graduate students in engineering, computer science, applied mathematics, and artificial intelligence, Nonlinear Filters: Theory and Applications will also earn a place in the libraries of those studying or practicing in fields involving pandemic diseases, cybersecurity, information fusion, augmented reality, autonomous driving, urban traffic network, navigation and tracking, robotics, power systems, hybrid technologies, and finance.

More details

Other editions

Persons

Content

List of Figures xiii

List of Table xv

Preface xvii

Acknowledgments xix

Acronyms xxi

1 Introduction 1

1.1 State of a Dynamic System 1

1.2 State Estimation 1

1.3 Construals of Computing 2

1.4 Statistical Modeling 3

1.5 Vision for the Book 4

2 Observability 7

2.1 Introduction 7

2.2 State-Space Model 7

2.3 The Concept of Observability 9

2.4 Observability of Linear Time-Invariant Systems 10

2.4.1 Continuous-Time LTI Systems 10

2.4.2 Discrete-Time LTI Systems 12

2.4.3 Discretization of LTI Systems 14

2.5 Observability of Linear Time-Varying Systems 14

2.5.1 Continuous-Time LTV Systems 14

2.5.2 Discrete-Time LTV Systems 16

2.5.3 Discretization of LTV Systems 17

2.6 Observability of Nonlinear Systems 17

2.6.1 Continuous-Time Nonlinear Systems 18

2.6.2 Discrete-Time Nonlinear Systems 21

2.6.3 Discretization of Nonlinear Systems 22

2.7 Observability of Stochastic Systems 23

2.8 Degree of Observability 25

2.9 Invertibility 26

2.10 Concluding Remarks 27

3 Observers 29

3.1 Introduction 29

3.2 Luenberger Observer 30

3.3 Extended Luenberger-Type Observer 31

3.4 Sliding-Mode Observer 33

3.5 Unknown-Input Observer 35

3.6 Concluding Remarks 39

4 Bayesian Paradigm and Optimal Nonlinear Filtering 41

4.1 Introduction 41

4.2 Bayes' Rule 42

4.3 Optimal Nonlinear Filtering 42

4.4 Fisher Information 45

4.5 Posterior Cramér-Rao Lower Bound 46

4.6 Concluding Remarks 47

5 Kalman Filter 49

5.1 Introduction 49

5.2 Kalman Filter 50

5.3 Kalman Smoother 53

5.4 Information Filter 54

5.5 Extended Kalman Filter 54

5.6 Extended Information Filter 54

5.7 Divided-Difference Filter 54

5.8 Unscented Kalman Filter 60

5.9 Cubature Kalman Filter 60

5.10 Generalized PID Filter 64

5.11 Gaussian-Sum Filter 65

5.12 Applications 67

5.12.1 Information Fusion 67

5.12.2 Augmented Reality 67

5.12.3 Urban Traffic Network 67

5.12.4 Cybersecurity of Power Systems 67

5.12.5 Incidence of Influenza 68

5.12.6 COVID-19 Pandemic 68

5.13 Concluding Remarks 70

6 Particle Filter 71

6.1 Introduction 71

6.2 Monte Carlo Method 72

6.3 Importance Sampling 72

6.4 Sequential Importance Sampling 73

6.5 Resampling 75

6.6 Sample Impoverishment 76

6.7 Choosing the Proposal Distribution 77

6.8 Generic Particle Filter 78

6.9 Applications 81

6.9.1 Simultaneous Localization and Mapping 81

6.10 Concluding Remarks 82

7 Smooth Variable-Structure Filter 85

7.1 Introduction 85

7.2 The Switching Gain 86

7.3 Stability Analysis 90

7.4 Smoothing Subspace 93

7.5 Filter Corrective Term for Linear Systems 96

7.6 Filter Corrective Term for Nonlinear Systems 102

7.7 Bias Compensation 105

7.8 The Secondary Performance Indicator 107

7.9 Second-Order Smooth Variable Structure Filter 108

7.10 Optimal Smoothing Boundary Design 108

7.11 Combination of SVSF with Other Filters 110

7.12 Applications 110

7.12.1 Multiple Target Tracking 111

7.12.2 Battery State-of-Charge Estimation 111

7.12.3 Robotics 111

7.13 Concluding Remarks 111

8 Deep Learning 113

8.1 Introduction 113

8.2 Gradient Descent 114

8.3 Stochastic Gradient Descent 115

8.4 Natural Gradient Descent 119

8.5 Neural Networks 120

8.6 Backpropagation 122

8.7 Backpropagation Through Time 122

8.8 Regularization 122

8.9 Initialization 125

8.10 Convolutional Neural Network 125

8.11 Long Short-Term Memory 127

8.12 Hebbian Learning 129

8.13 Gibbs Sampling 131

8.14 Boltzmann Machine 131

8.15 Autoencoder 135

8.16 Generative Adversarial Network 136

8.17 Transformer 137

8.18 Concluding Remarks 139

9 Deep Learning-Based Filters 141

9.1 Introduction 141

9.2 Variational Inference 142

9.3 Amortized Variational Inference 144

9.4 Deep Kalman Filter 144

9.5 Backpropagation Kalman Filter 146

9.6 Differentiable Particle Filter 148

9.7 Deep Rao-Blackwellized Particle Filter 152

9.8 Deep Variational Bayes Filter 158

9.9 Kalman Variational Autoencoder 167

9.10 Deep Variational Information Bottleneck 172

9.11 Wasserstein Distributionally Robust Kalman Filter 176

9.12 Hierarchical Invertible Neural Transport 178

9.13 Applications 182

9.13.1 Prediction of Drug Effect 182

9.13.2 Autonomous Driving 183

9.14 Concluding Remarks 183

10 Expectation Maximization 185

10.1 Introduction 185

10.2 Expectation Maximization Algorithm 185

10.3 Particle Expectation Maximization 188

10.4 Expectation Maximization for Gaussian Mixture Models 190

10.5 Neural Expectation Maximization 191

10.6 Relational Neural Expectation Maximization 194

10.7 Variational Filtering Expectation Maximization 196

10.8 Amortized Variational Filtering Expectation Maximization 198

10.9 Applications 199

10.9.1 Stochastic Volatility 199

10.9.2 Physical Reasoning 200

10.9.3 Speech, Music, and Video Modeling 200

10.10 Concluding Remarks 201

11 Reinforcement Learning-Based Filter 203

11.1 Introduction 203

11.2 Reinforcement Learning 204

11.3 Variational Inference as Reinforcement Learning 207

11.4 Application 210

11.4.1 Battery State-of-Charge Estimation 210

11.5 Concluding Remarks 210

12 Nonparametric Bayesian Models 213

12.1 Introduction 213

12.2 Parametric vs Nonparametric Models 213

12.3 Measure-Theoretic Probability 214

12.4 Exchangeability 219

12.5 Kolmogorov Extension Theorem 221

12.6 Extension of Bayesian Models 223

12.7 Conjugacy 224

12.8 Construction of Nonparametric Bayesian Models 226

12.9 Posterior Computability 227

12.10 Algorithmic Sufficiency 228

12.11 Applications 232

12.11.1 Multiple Object Tracking 233

12.11.2 Data-Driven Probabilistic Optimal Power Flow 233

12.11.3 Analyzing Single-Molecule Tracks 233

12.12 Concluding Remarks 233

References 235

Index 253

1
Introduction

1.1 State of a Dynamic System

In many branches of science and engineering, deriving a probabilistic model for sequential data plays a key role. System theory provides guidelines for studying the underlying dynamics of sequential data (time series). In describing a dynamic system, the notion of state is a key concept [1]:

According to the principle of causality, any dynamic system may be described from the state perspective. Deploying a state-transition model allows for determining the future state of a system, , at any time instant , given its initial state, , at time instant as well as the inputs to the system, , for . The output of the system, , is a function of the state, which can be computed using a measurement model. In this regard, state-space models are powerful tools for analysis and control of dynamic systems.

1.2 State Estimation

Observability is a key concept in system theory, which refers to the ability to reconstruct the hidden or latent state variables that cannot be directly measured, from the measured variables in the minimum possible length of time [1]. In building state-space models, two key questions deserve special attention [2]:

(i) Is it possible to identify the governing dynamics from data?
(ii) Is it possible to perform inference from observables to the latent state variables?

At time instant , the inference problem to be solved is to find the estimate of in the presence of noise, which is denoted by . Depending of the value of , estimation algorithms are categorized into three groups [3]:

(i) Prediction: ,
(ii) Filtering: ,
(iii) Smoothing: .

Regarding the mentioned two challenging questions, in order to improve performance, sophisticated representations can be deployed for the system under study. However, the corresponding inference algorithms may become computationally demanding. Hence, for designing efficient data-driven inference algorithms, the following points must be taken into account [2]:

(i) The underlying assumptions for building a state-space model must allow for reliable system identification and plausible long-term prediction of the system behavior.
(ii) The inference mechanism must be able to capture rich dependencies.
(iii) The algorithm must be able to inherit the merit of learning machines to be trainable on raw data such as sensory inputs in a control system.
(iv) The algorithm must be scalable to big data regarding the optimization of model parameters based on the stochastic gradient descent method.

Regarding the important role of computation in inference problems, Section 1.3 provides a brief account of the foundations of computing.

1.3 Construals of Computing

According to [4], a comprehensive theory of computing must meet three criteria:

(i) Empirical criterion: Doing justice to practice by keeping the analysis grounded in real-world examples.
(ii) Conceptual criterion: Being understandable in terms of what it says, where it comes from, and what it costs.
(iii) Cognitive criterion: Providing an intelligible foundation for the computational theory of mind that underlies both artificial intelligence and cognitive science.

Following this line of thinking, it was proposed in [4] to distinguish the following construals of computation:

Formal symbol manipulation is rooted in formal logic and metamathematics. The idea is to build machines that are capable of manipulating symbolic or meaningful expressions regardless of their interpretation or semantic content.
Effective computability deals with the question of what can be done, and how hard it is to do it mechanically.
Execution of an algorithm or rule following focuses on what is involved in following a set of rules or instructions, and what behavior would be produced.
Calculation of a function considers the behavior of producing the value of a mathematical function as output, when a set of arguments is given as input.
Digital state machine is based on the idea of a finite-state automaton.
Information processing focuses on what is involved in storing, manipulating, displaying, and trafficking of information.
Physical symbol systems is based on the idea that the way computers interact with symbols depends on their mutual physical embodiment. In this regard, computers may be assumed to be made of symbols.
Dynamics must be taken into account in terms of the roles that nonlinear elements, attractors, criticality, and emergence play in computing.
Interactive agents are capable of interacting and communicating with other agents and even people.
Self-organizing or complex adaptive systems are capable of adjusting their organization or structure in response to changes in their environment in order to survive and improve their performance.
Physical implementation emphasizes on the occurrence of computational practice in real-world systems.

1.4 Statistical Modeling

Statistical modeling aims at extracting information about the underlying data mechanism that allows for making predictions. Then, such predictions can be used to make decisions. There are two cultures in deploying statistical models for data analysis [5]:

Data modeling culture is based on the idea that a given stochastic model generates the data.
Algorithmic modeling culture uses algorithmic models to deal with an unknown data mechanism.

An algorithmic approach has the advantage of being able to handle large complex datasets. Moreover, it can avoid irrelevant theories or questionable conclusions.

Figure 1.1 The encoder of an asymmetric autoencoder plays the role of a nonlinear filter.

Taking an algorithmic approach, in machine learning, statistical models can be classified as [6]:

(i) Generative models predict visible effects from hidden causes, .
(ii) Discriminative models infer hidden causes from visible effects, .

While the former is associated with the measurement process in a state-space model, the latter is associated with the state estimation or filtering problem. Deploying machine learning, a wide range of filtering algorithms can be developed that are able to learn the corresponding state-space models. For instance, an asymmetric autoencoder can be designed by combining a generative model and a discriminative model as shown in Figure 1.1 [7]. Deep neural networks can be used to implement both the encoder and the decoder. Then, the resulting autoencoder can be trained in an unsupervised manner. After training, the encoder can be used as a filter, which estimates the latent state variables.

1.5 Vision for the Book

This book provides an algorithmic perspective on the nonlinear state/parameter estimation problem for discrete-time systems, where measurements are available at discrete sampling times and estimators are implemented using digital processors. In Chapter 2, guidelines are provided for discretizing continuous-time linear and nonlinear state-space models. The rest of the book is organized as follows:

Chapter 2 presents the notion of observability for deterministic and stochastic systems.

Chapters 3-7 cover classic estimation algorithms:

Chapter 3 is dedicated to observers as state estimators for deterministic systems.
Chapter 4 presents the general formulation of the optimal Bayesian filtering for stochastic systems.
Chapter 5 covers the Kalman filter as the optimal Bayesian filter in the sense of minimizing the mean-square estimation error for linear systems with Gaussian noise. Moreover, Kalman filter variants are presented that extend its applicability to nonlinear or non-Gaussian cases.
Chapter 6 covers the particle filter, which handles severe nonlinearity and non-Gaussianity by approximating the corresponding distributions using a set of particles (random samples).
Chapter 7 covers the smooth variable-structure filter, which provides robustness against bounded uncertainties and noise. In addition to the innovation vector, this filter benefits from a secondary set of performance indicators.

Chapters 8-11 cover learning-based estimation algorithms:

Chapter 8 covers the basics of deep learning.
Chapter 9 covers deep-learning-based filtering algorithms using supervised and unsupervised learning.
Chapter 10 presents the expectation maximization algorithm and its variants, which are used for joint state and parameter estimation.
Chapter 11 presents the reinforcement learning-based filter, which is built on viewing variational inference and reinforcement learning as instances of a generic expectation maximization problem.

The last chapter is dedicated to nonparametric Bayesian models:

Chapter 12...

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Nonlinear Filters

Description

More details

Other editions

Additional editions

Persons

Content

1
Introduction

1.1 State of a Dynamic System

1.2 State Estimation

1.3 Construals of Computing

1.4 Statistical Modeling

1.5 Vision for the Book

System requirements