3D Shape Analysis

Name: 3D Shape Analysis | Fundamentals, Theory, and Applications
Brand: Jossey-Bass
Price: 111.99 EUR
Availability: OnlineOnly

Fundamentals, Theory, and Applications

Hamid Laga Yulan Guo Hedi Tabia Robert B. Fisher Mohammed Bennamoun(Autor*in)

Jossey-Bass (Verlag)

1. Auflage

Erschienen am 14. Dezember 2018

368 Seiten

E-Book

ePUB mit Adobe-DRM

Systemvoraussetzungen

978-1-119-40519-1 (ISBN)

111,99 €inkl. 7% MwSt.

Systemvoraussetzungen

für ePUB mit Adobe-DRM

E-Book Einzellizenz

Als Download verfügbar

Beschreibung

An in-depth description of the state-of-the-art of 3D shape analysis techniques and their applications This book discusses the different topics that come under the title of &quote;3D shape analysis&quote;. It covers the theoretical foundations and the major solutions that have been presented in the literature. It also establishes links between solutions proposed by different communities that studied 3D shape, such as mathematics and statistics, medical imaging, computer vision, and computer graphics. The first part of 3D Shape Analysis: Fundamentals, Theory, and Applications provides a review of the background concepts such as methods for the acquisition and representation of 3D geometries, and the fundamentals of geometry and topology. It specifically covers stereo matching, structured light, and intrinsic vs. extrinsic properties of shape. Parts 2 and 3 present a range of mathematical and algorithmic tools (which are used for e.g., global descriptors, keypoint detectors, local feature descriptors, and algorithms) that are commonly used for the detection, registration, recognition, classification, and retrieval of 3D objects. Both also place strong emphasis on recent techniques motivated by the spread of commodity devices for 3D acquisition. Part 4 demonstrates the use of these techniques in a selection of 3D shape analysis applications. It covers 3D face recognition, object recognition in 3D scenes, and 3D shape retrieval. It also discusses examples of semantic applications and cross domain 3D retrieval, i.e. how to retrieve 3D models using various types of modalities, e.g. sketches and/or images. The book concludes with a summary of the main ideas and discussions of the future trends. 3D Shape Analysis: Fundamentals, Theory, and Applications is an excellent reference for graduate students, researchers, and professionals in different fields of mathematics, computer science, and engineering. It is also ideal for courses in computer vision and computer graphics, as well as for those seeking 3D industrial/commercial solutions.

Weitere Details

Weitere Ausgaben

Inhalt

Cover
Title Page
Copyright
Contents
Preface
Acknowledgments
Chapter 1 Introduction
1.1 Motivation
1.2 The 3D Shape Analysis Problem
1.3 About This Book
1.4 Notation
Part I Foundations
Chapter 2 Basic Elements of 3D Geometry and Topology
2.1 Elements of Differential Geometry
2.1.1 Parametric Curves
2.1.2 Continuous Surfaces
2.1.2.1 Differential Properties of Surfaces
2.1.2.1.1 First Fundamental Form
2.1.2.1.2 Second Fundamental Form and Shape Operator
2.1.2.2 Curvatures
2.1.2.3 Laplace and Laplace-Beltrami Operators
2.1.3 Manifolds, Metrics, and Geodesics
2.1.4 Discrete Surfaces
2.1.4.1 Representations of Discrete Surfaces
2.1.4.2 Mesh Data Structures
2.1.4.3 Discretization of the Differential Properties of Surfaces
2.2 Shape, Shape Transformations, and Deformations
2.2.1 Shape-Preserving Transformations
2.2.1.1 Normalization for Translation
2.2.1.2 Normalization for Scale
2.2.1.3 Normalization for Rotation
2.2.1.3.1 Rotation Normalization Using Principal Component Analysis (PCA)
2.2.1.3.2 Rotation Normalization Using Planar Reflection Symmetry Analysis
2.2.2 Shape Deformations
2.2.3 Bending
2.2.4 Stretching
2.3 Summary and Further Reading
Chapter 3 3D Acquisition and Preprocessing
3.1 Introduction
3.2 3D Acquisition
3.2.1 Contact 3D Acquisition
3.2.1.1 Coordinate Measuring Machine (CMM)
3.2.1.2 Arm-Based 3D Scanner
3.2.2 Noncontact 3D Acquisition
3.2.2.1 Time-of-Flight
3.2.2.1.1 Pulse-Based TOF
3.2.2.1.2 PhaseShift-BasedTOF
3.2.2.2 Triangulation
3.2.2.3 Stereo
3.2.2.4 Structured Light
3.2.2.4.1 Temporal Coded Patterns
3.2.2.4.2 Spatial Coded Patterns
3.2.2.4.3 Direct Coded Patterns
3.2.2.5 Shape from X
3.3 Preprocessing 3D Models
3.3.1 Surface Smoothing and Fairing
3.3.1.1 Laplacian Smoothing
3.3.1.2 Taubin Smoothing
3.3.1.3 Curvature Flow Smoothing
3.3.2 Spherical Parameterization of 3D Surfaces
3.4 Summary and Further Reading
Part II 3D Shape Descriptors
Chapter 4 Global Shape Descriptors
4.1 Introduction
4.2 Distribution-Based Descriptors
4.2.1 Point Sampling
4.2.2 Geometric Features
4.2.2.1 Geometric Attributes
4.2.2.2 Differential Attributes
4.2.3 Signature Construction and Comparison
4.3 View-Based 3D Shape Descriptors
4.3.1 The Light Field Descriptors (LFD)
4.3.2 Feature Extraction
4.3.3 Properties
4.4 Spherical Function-Based Descriptors
4.4.1 Spherical Shape Functions
4.4.2 Comparing Spherical Functions
4.4.2.1 Spherical Harmonic Descriptors
4.4.2.2 Spherical Wavelet Transforms
4.4.2.2.1 Wavelet Coefficients as a Shape Descriptor
4.4.2.2.2 Spherical Wavelet Energy Signatures
4.5 Deep Neural Network-Based 3D Descriptors
4.5.1 CNN-Based Image Descriptors
4.5.2 Multiview CNN for 3D Shapes
4.5.2.1 Network Architecture
4.5.2.2 View Aggregation using CNN
4.5.3 Volumetric CNN
4.6 Summary and Further Reading
Chapter 5 Local Shape Descriptors
5.1 Introduction
5.2 Challenges and Criteria
5.2.1 Challenges
5.2.2 Criteria for 3D Keypoint Detection
5.2.3 Criteria for Local Feature Description
5.3 3D Keypoint Detection
5.3.1 Fixed-Scale Keypoint Detection
5.3.1.1 Curvature-Based Methods
5.3.1.1.1 Local Surface Patch (LSP)
5.3.1.2 Other Surface Variation-Based Methods
5.3.1.2.1 Matei's Method
5.3.1.2.2 Intrinsic Shape Signatures (ISS)
5.3.1.2.3 Harris 3D
5.3.2 Adaptive-Scale Keypoint Detection
5.3.2.1 Extrinsic Scale-Space Based Methods
5.3.2.1.1 3D Shape Filtering
5.3.2.1.2 Multiscale Surface Variation
5.3.2.2 Intrinsic Scale-Space Based Methods
5.3.2.2.1 Scale-Space on 2D Parameterized Images
5.3.2.2.2 Scale-Space on 3D Shapes
5.3.2.2.3 Scale-Space on Transformed Domains
5.4 Local Feature Description
5.4.1 Signature-Based Methods
5.4.1.1 Splash
5.4.1.2 Point Signature
5.4.2 Histogram Based Methods
5.4.2.1 Histogram of Spatial Distributions
5.4.2.1.1 Spin Images
5.4.2.1.2 3D Shape Context
5.4.2.1.3 Intrinsic Shape Signature (ISS)
5.4.2.1.4 Rotational Projection Statistics (RoPS)
5.4.2.2 Histogram of Geometric Attributes
5.4.2.2.1 Point Feature Histograms (PFH)
5.4.2.2.2 Fast Point Feature Histograms (FPFH)
5.4.2.2.3 Signature of Histograms of Orientations (SHOT)
5.4.2.3 Histogram of Oriented Gradients
5.4.3 Covariance-Based Methods
5.5 Feature Aggregation Using Bag of Feature Techniques
5.5.1 Dictionary Construction
5.5.1.1 Feature Extraction
5.5.1.2 Codebook Construction
5.5.2 Coding and Pooling Schemes
5.5.2.1 Sparse Coding
5.5.2.2 Fisher Vectors
5.5.3 Vector of Locally Aggregated Descriptors (VLAD)
5.5.4 Vector of Locally Aggregated Tensors (VLAT)
5.6 Summary and Further Reading
5.6.1 Summary of 3D Keypoint Detection
5.6.2 Summary of Local Feature Description
5.6.3 Summary of Feature Aggregation
Part III 3D Correspondence and Registration
Chapter 6 Rigid Registration
6.1 Introduction
6.2 Coarse Registration
6.2.1 Point Correspondence-Based Registration
6.2.1.1 The Typical Pipeline
6.2.1.2 Transformation Estimation from a Group of Correspondences
6.2.1.3 Transformation Estimation from Three Correspondences
6.2.1.4 Transformation Estimation from Two Correspondences
6.2.1.5 Transformation Estimation from One Correspondence
6.2.2 Line-Based Registration
6.2.2.1 Line Matching Method
6.2.2.2.1 Rotation Estimation
6.2.2.2.2 Translation Estimation
6.2.2.2 Line Clustering Method
6.2.3 Surface-Based Registration
6.2.3.1 Principal Component Analysis (PCA)
6.2.3.2 RANSAC-Based DARCES
6.2.3.3 Four-Points Congruent Sets (4PCS)
6.2.3.3.1 AffineInvariants of 4-Points Set
6.2.3.3.2 Congruent 4-Points Extraction
6.2.3.3.3 The 4PCS Algorithm
6.3 Fine Registration
6.3.1 Iterative Closest Point (ICP)
6.3.1.1 Closest Point Search
6.3.1.2 Transformation Estimation
6.3.1.3 Summary of the ICP Method
6.3.2 ICP Variants
6.3.2.1 Point Selection
6.3.2.2 Point Matching
6.3.2.3 Point Pair Weighting
6.3.2.4 Point Pair Rejection
6.3.2.5 Error Metrics
6.3.3 Coherent Point Drift
6.4 Summary and Further Reading
Chapter 7 Nonrigid Registration
7.1 Introduction
7.2 Problem Formulation
7.3 Mathematical Tools
7.3.1 The Space of Diffeomorphisms
7.3.2 Parameterizing Spaces
7.4 Isometric Correspondence and Registration
7.4.1 Möbius Voting
7.4.2 Examples
7.5 Nonisometric (Elastic) Correspondence and Registration
7.5.1 Surface Deformation Models
7.5.1.1 Linear Deformation Model
7.5.1.2 Elastic Deformation Models
7.5.2 Square-Root Normal Fields (SRNF) Representation
7.5.3 Numerical Inversion of SRNF Maps
7.5.3.1 SRNF Inversion Algorithm
7.5.4 Correspondence
7.5.4.1 Optimization Over SO(3)
7.5.4.2 Optimization Over
7.5.4.3 Differential of ?[184]
7.5.4.4 Initialization of the Gradient[184]
7.5.5 Elastic Registration and Geodesics
7.5.6 Coregistration
7.6 Summary and Further Reading
Chapter 8 Semantic Correspondences
8.1 Introduction
8.2 Mathematical Formulation
8.3 Graph Representation
8.3.1 Characterizing the Local Geometry and the Spatial Relations
8.3.1.1 Unary Descriptors
8.3.1.2 Binary Descriptors
8.3.2 Cross Mesh Pairing of Patches
8.4 Energy Functions for Semantic Labeling
8.4.1 The Data Term
8.4.2 Smoothness Terms
8.4.2.1 Smoothness Constraints
8.4.2.2 Geometric Compatibility
8.4.2.3 Label Compatibility
8.4.3 The Intermesh Term
8.5 Semantic Labeling
8.5.1 Unsupervised Clustering
8.5.2 Learning the Labeling Likelihood
8.5.2.1 GentleBoost Classifier
8.5.2.2 Training GentleBoost Classifiers
8.5.3 Learning the Remaining Parameters
8.5.4 Optimization Using Graph Cuts
8.6 Examples
8.7 Summary and Further Reading
Part IV Applications
Chapter 9 Examples of 3D Semantic Applications
9.1 Introduction
9.2 Semantics: Shape or Status
9.3 Semantics: Class or Identity
9.4 Semantics: Behavior
9.5 Semantics: Position
9.6 Summary and Further Reading
Chapter 10 3D Face Recognition
10.1 Introduction
10.2 3D Face Recognition Tasks, Challenges and Datasets
10.2.1 3D Face Verification
10.2.2 3D Face Identification
10.2.3 3D Face Recognition Challenges
10.2.3.1 Intrinsic Transformations
10.2.3.2 Acquisition Conditions
10.2.3.3 Data Acquisition
10.2.3.4 Computation Time
10.2.4 3D Face Datasets
10.3 3D Face Recognition Methods
10.3.1 Holistic Approaches
10.3.1.1 Eigenfaces and Fisherfaces
10.3.1.1.1 Eigenfaces
10.3.1.1.2 Fisherfaces
10.3.1.2 Iterative Closest Point
10.3.1.3 Hausdorff Distance
10.3.1.4 Canonical Form
10.3.2 Local Feature-Based Matching
10.3.2.1 Keypoint-Based Methods
10.3.2.1.1 Landmark-Based Methods
10.3.2.1.2 SIFT-Like Keypoints
10.3.2.2 Curve-Based Features
10.3.2.3 Patch-Based Features
10.3.2.4 Other Features
10.4 Summary
Chapter 11 Object Recognition in 3D Scenes
11.1 Introduction
11.2 Surface Registration-Based Object Recognition Methods
11.2.1 Feature Matching
11.2.2 Hypothesis Generation
11.2.2.1 Geometric Consistency-Based Hypothesis Generation
11.2.2.2 Pose Clustering-Based Hypothesis Generation
11.2.2.3 Constrained Interpretation Tree-Based Hypothesis Generation
11.2.2.4 RANdom SAmple Consensus-Based Hypothesis Generation
11.2.2.5 Game Theory-Based Hypothesis Generation
11.2.2.5.1 Preliminary on Game Theory
11.2.2.5.2 Matching Game for Transformation Hypothesis Generation
11.2.2.6 Generalized Hough Transform-Based Hypothesis Generation
11.2.3 Hypothesis Verification
11.2.3.1 Individual Verification
11.2.3.2 Global Verification
11.3 Machine Learning-Based Object Recognition Methods
11.3.1 Hough Forest-Based 3D Object Detection
11.3.1.1 3D Local Patch Extraction
11.3.1.2 3D Local Patch Representation
11.3.1.3 Hough Forest Training and Testing
11.3.1.3.1 Offline Training
11.3.1.3.2 Online detection
11.3.2 Deep Learning-Based 3D Object Recognition
11.3.2.1 Hand-crafted Feature-Based Methods
11.3.2.2 2D View-Based Methods
11.3.2.3 3D Voxel-Based Methods
11.3.2.4 3D Point Cloud-Based Methods
11.4 Summary and Further Reading
Chapter 12 3D Shape Retrieval
12.1 Introduction
12.2 Benchmarks and Evaluation Criteria
12.2.1 3D Datasets and Benchmarks
12.2.2 Performance Evaluation Metrics
12.2.2.1 Precision
12.2.2.2 Recall
12.2.2.3 Precision-Recall Curves
12.2.2.4 F- and E-Measures
12.2.2.5 Area Under Curve (AUC) or Average Precision (AP)
12.2.2.6 Mean Average Precision (mAP)
12.2.2.7 Cumulated Gain-Based Measure
12.2.2.8 Nearest Neighbor (NN), First-Tier (FT), and Second-Tier (ST)
12.3 Similarity Measures
12.3.1 Dissimilarity Measures
12.3.2 Hashing and Hamming Distance
12.3.3 Manifold Ranking
12.4 3D Shape Retrieval Algorithms
12.4.1 Using Handcrafted Features
12.4.2 Deep Learning-Based Methods
12.5 Summary and Further Reading
Chapter 13 Cross-domain Retrieval
13.1 Introduction
13.2 Challenges and Datasets
13.2.1 Datasets
13.2.2 Training Data Synthesis
13.2.2.1 Photo Synthesis from 3D Models
13.2.2.2 2D Sketch Synthesis from 3D Models
13.3 Siamese Network for Cross-domain Retrieval
13.4 3D Shape-centric Deep CNN
13.4.1 Embedding Space Construction
13.4.1.1 Principal Component Analysis
13.4.1.2 Multi-dimensional Scaling
13.4.1.3 Kernel-Based Analysis
13.4.2 Learning Shapes from Synthesized Data
13.4.3 Image and Sketch Projection
13.5 Summary and Further Reading
Chapter 14 Conclusions and Perspectives
References
Index
EULA

1
Introduction

1.1 Motivation

Shape analysis is an old topic that has been studied, for many centuries, by scientists from different boards, including philosophers, psychologists, mathematicians, biologists, and artists. However, in the past two decades, we have seen a renewed interest in the field motivated by the recent advances in 3D acquisition, modeling, and visualization technologies, and the substantial increase in the computation and storage power. Nowadays, 3D scanning devices are accessible not only to domain-specific experts but also to the general public. Users can scan the real world at high resolution, using devices that are as cheap as video cameras, edit the 3D data using 3D modeling software, share them across the web, and host them in online repositories that are growing in size and in number. Such repositories can include millions of every day objects, cultural heritage artifacts, buildings, as well as medical, scientific, and engineering models.

The increase in the availability of 3D data comes with new challenges in terms of storage, classification, and retrieval of such data. It also brings unprecedented opportunities for solving long-standing problems; First, the rich variability of 3D content in existing shape repositories makes it possible to directly reuse existing 3D models, in whole or in part, to construct new 3D models with rich variations. In many situations, 3D designers and content creators will no more need to scan or model a 3D object or scene from scratch. They can query existing repositories, retrieve the desired models, and fine-tune their geometry and appearance to suit their needs. This concept of context reuse is not specific to 3D models but has been naturally borrowed from other types of media. For instance, one can translate sentences to different languages by performing cross-language search. Similarly, one can create an image composite or a visual art piece by querying images, copying parts of them and pasting them into their own work.

Second, these large amounts of 3D data can be used to learn computational models that effectively reason about properties and relationships of shapes without relying on hard-coded rules or explicitly programmed instructions. For instance, they can be used to learn 3D shape variation in medical data in order to model physiological abnormalities in anatomical organs, model their natural growth, and learn how shape is affected by disease progression. They can be also used to model 3D shape variability using statistical models, which, in turn, can be used to facilitate 3D model creation with minimum user interaction.

Finally, data-driven methods facilitate high-level shape understanding by discovering geometric and structural patterns among collections of shapes. These patterns can serve as strong priors not only in various geometry processing applications but also in solving long-standing computer vision problems, ranging from low-level 3D reconstruction to high-level scene understanding.

These technological developments and the opportunities they bring have motivated researchers to take a fresh look at the 3D shape analysis problem. Although most of the recent developments are application-driven, many of them aim to answer fundamental, sometimes philosophical, questions such as: What is shape? Can we mathematically formulate the concept of shape? How to compare the shape of objects? How to quantify and localize shape similarities and differences? This book synthesizes the critical mass of 3D shape analysis research that has accumulated over the past 15 years. This rapidly developing field is both profound and broad, with a wide range of applications and many open research questions that are yet to be answered.

1.2 The 3D Shape Analysis Problem

Shape is the external form, outline or surface, of someone or something as opposed to other properties such as color, texture, or material composition.

Source: Wikipedia and Oxford dictionaries.

Humans can easily abstract the form of an object, describe it with a few geometrical attributes or even with words, relate it to the form of another object, and group together, in multiple ways and using various criteria, different objects to form clusters that share some common shape properties. Shape analysis is the general term used to refer to the process of automating these tasks, which are trivial to humans but very challenging to computers. It has been investigated under the umbrella of many applications and has multiple facets. Below, we briefly summarize a few of them.

3D shape retrieval, clustering, and classification. Similar to other types of multimedia information, e.g. text documents, images, and videos, the demand for efficient clustering and classification tools that can organize, automatically or semi-automatically, the continuously expanding collections of 3D models is growing. Likewise, users, whether they are experts, e.g. graphics designers who are increasingly relying on the reuse of existing 3D contents, or novice, will benefit from a search engine that will enable them to search for 3D data of interest in the same way they search for text documents or images.
Correspondence and registration. This problem, which can be summarized as the ability to say which part of an object matches which part on another object, and the ability to align one object onto another, arises in many domains of computer vision, computer graphics, and medical imaging. Probably, one of the most popular examples is the 3D reconstruction problem where usually a 3D object is scanned by multiple sensors positioned at different locations around the object. To build the complete 3D model of the object, one needs to merge the partial scans produced by each sensor. This operation requires a correct alignment, i.e. registration, step that brings all the acquired 3D data into a common coordinate frame. Note also that, in many cases, 3D objects move and deform, in a nonrigid way, during the scanning process. This makes the alignment process even more complex. Another example is in computer graphics where a 3D designer creates a triangulated 3D mesh model, hereinafter referred to as the reference, and assigns to each of its triangular faces some attributes, e.g. color and material properties. The designer then can create additional models with the same attributes but instead of manually setting them, they can be automatically transferred from the reference model if there is a mechanism which finds for each point on the reference model its corresponding points on the other models.
Detection and recognition. This includes the detection of low level features such as corners or regions of high curvatures, as well as the localization and recognition of parts in 3D objects, or objects in 3D scenes. The latter became very popular in the past few years with the availability of cheap 3D scanning devices. In fact, instead of trying to localize and recognize objects in a scene from 2D images, one can develop algorithms that operate on the 3D scans of the scene, eventually acquired using commodity devices. This has the advantage that 3D data are less affected than 2D images by the occlusions and ambiguities, which are inherent to the loss of dimensionality when projecting the 3D world onto 2D images. 3D face and 3D action recognition are, among others, examples of applications that have benefited from the recent advances in 3D technologies.
Measurement and characterization of the geometrical and topological properties of objects on one hand and of the spatial relations between objects on the other hand. This includes the identification of similar regions and finding recurrent patterns within and across 3D objects.
Summarization and exploration of collections of 3D models. Given a set of objects, one would like to compute a representative 3D model, e.g. the average or median shape, as well as other summary statistics such as covariances and modes of variation of their shapes. One would like also to characterize the collection using probability distributions and sample from these distributions new instances of shapes to enrich the collection. In other words, one needs to manipulate 3D models in the same way one manipulates numbers.

Implementing these representative analysis tasks requires solving a set of challenges, and each has been the subject of important research and contributions. The first challenge is the mathematical representation of the shape of objects. 3D models, acquired with laser scanners or created using some modeling software, can be represented with point clouds, polygonal soup models, or as volumetric images. Such representations are suitable for storage and visualization but not for high-level analysis tasks. For instance, scanning the same object from two different viewpoints or using different devices will often result in two different point clouds but the shape remains the same. The challenge is in designing mathematical representations that capture the essence of shape. A good representation should be independent of (or invariant to) the pose of the 3D object, the way it is scanned or modeled, and the way it is stored. It is also important to ensure that two different shapes cannot have the same representation.

Figure 1.1Complexity of the shape similarity problem. (a) Nonrigid deformations. (b) Partial similarity. (c) Semantic similarity.

Second, almost every shape analysis task requires a measure that quantifies shape similarities and differences....

Systemvoraussetzungen

Als PDF speichern Als Link merken

3D Shape Analysis

Beschreibung

Weitere Details

Weitere Ausgaben

Inhalt

1 Introduction

1.1 Motivation

1.2 The 3D Shape Analysis Problem

Systemvoraussetzungen

1
Introduction