
MultiMedia Modeling
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
More details
Other editions
Additional editions

Content
- Intro
- Preface
- Organization
- Contents - Part II
- Contents - Part I
- Special Session Poster Papers (continued)
- Transfer Nonnegative Matrix Factorization for Image Representation
- 1 Introduction
- 2 Related Work
- 3 Preliminaries
- 3.1 Nonnegative Matrix Factorization
- 3.2 Hessian Regularization
- 4 Transfer Nonnegative Matrix Factorization
- 4.1 Problem Definition
- 4.2 Proposed Approach
- 4.3 Optimization
- 5 Experiments
- 5.1 Dataset Description
- 5.2 Performance on Cross-Domain Datasets
- 6 Conclusion
- References
- Sentiment Analysis on Multi-View Social Data
- 1 Introduction
- 2 Related Works
- 2.1 Sentiment Analysis Datasets
- 2.2 Sentiment Analysis Approaches
- 3 The MVSA Dataset
- 3.1 Data Collection and Annotation
- 3.2 Data Analysis
- 4 Predicting Sentiment in Multi-view Data
- 4.1 Text-Based Approaches
- 4.2 Visual-Based Approaches
- 4.3 Multi-view Sentiment Analysis
- 5 Experiments
- 5.1 Results on Textual Messages
- 5.2 Results on Images
- 5.3 Results on Multi-View Data
- 6 Conclusion and Future Work
- References
- Single Image Super-Resolution via Convolutional Neural Network and Total Variation Regularization
- Abstract
- 1 Introduction
- 2 Overview of the SR Algorithm
- 3 Convolutional Neural Network for SR
- 3.1 Training Set Generation
- 3.2 Convolutional Neural Network for SR
- 4 Regularization Constraints for SR
- 4.1 Non-Local Similarity Regularization Constraint
- 4.2 Local Similarity Regularization Constraint
- 4.3 Fundamental Formula
- 5 Experimental Results
- 5.1 Datasets
- 5.2 Results and Comparison
- 6 Conclusion
- References
- An Effective Face Verification Algorithm to Fuse Complete Features in Convolutional Neural Network
- Abstract
- 1 Introduction
- 2 Related Work
- 3 Methodology
- 3.1 Network Structure
- 3.2 Feature Extraction
- 3.3 Verification
- 4 Experiments
- 5 Conclusion
- References
- Driver Fatigue Detection System Based on DSP Platform
- Abstract
- 1 Introduction
- 2 Methodology
- 2.1 Face Detection
- 2.2 Eye Detection
- 2.3 Eye State Estimation
- 3 Experiment and Results
- 3.1 Experiment Setting
- 3.2 Experimental Results
- 4 Conclusion
- References
- Real-Time Grayscale-Thermal Tracking via Laplacian Sparse Representation
- 1 Introduction
- 2 Related Work
- 3 Bayesian Filtering for Object Tracking
- 4 Observation Model
- 4.1 Laplacian Sparse Representation
- 4.2 Candidate Likelihood
- 5 Experiments
- 5.1 Evaluation Settings
- 5.2 Evaluation Metrics
- 5.3 Comparison Results
- 5.4 Component Analysis
- 6 Conclusion
- References
- Efficient Perceptual Region Detector Based on Object Boundary
- 1 Introduction
- 2 Related Work
- 2.1 Superpixel
- 2.2 Local Detectors
- 2.3 General Object Proposal
- 3 CAR: The Method
- 3.1 Contour-Aware Superpixel (CAS)
- 3.2 Perceptual Regions Detection
- 4 Experiments
- 4.1 Under-Segmentation Error
- 4.2 Boundary Recall
- 4.3 CAR Detector Repeatability
- 5 Conclusions
- References
- 1D Barcode Region Detection Based on the Hough Transform and Support Vector Machine
- Abstract
- 1 Introduction
- 2 Proposed Method
- 2.1 Barcode Detection
- 2.2 Support Vector Machine
- 2.3 Hough Transform
- 2.4 Using the SVM Classifier to Judge Pieces of the Image
- 2.5 Post-processing
- 3 Experiments and Results Analysis
- 3.1 Datasets
- 3.2 Result
- 4 Conclusion
- Acknowledgment
- References
- Special Session Papers
- Client-Driven Strategy of Large-Scale Scene Streaming
- 1 Introduction
- 2 Related Works
- 3 Overview
- 4 Multiple-resolution 3D Space Adaptive Grid Creation
- 5 Scene Streaming Assemble Strategy
- 5.1 Dynamic Double Layer AOI (D-DLAOI)
- 5.2 Object Priority Determination and LOD Resolution
- 6 Experimental Results
- 7 Conclusion and Future Work
- References
- SELSH: A Hashing Scheme for Approximate Similarity Search with Early Stop Condition
- 1 Introduction
- 2 Preliminaries
- 2.1 Problem Definition
- 2.2 Notations
- 3 Related Work
- 4 Our Method
- 4.1 LSH Function
- 4.2 Distance Measure, Linear Order and Early Stop Condition
- 4.3 Index Structure
- 4.4 Search Process
- 4.5 Complexity Analysis
- 5 Experimental Results
- 5.1 Set up
- 5.2 Selection of Appropriate Parameters
- 5.3 Comparison with SK-LSH and C2LSH
- 6 Conclusion
- References
- Learning Hough Transform with Latent Structures for Joint Object Detection and Pose Estimation
- 1 Introduction
- 2 Related Work
- 3 Our Approach
- 3.1 Hough-Based Object Detection
- 3.2 Latent Deformable Feature Model
- 3.3 Multiple Instance Learning for M2HT
- 4 Experiment
- 4.1 Experiment Settings
- 4.2 Object Detection
- 4.3 Car Pose Estimation
- 5 Conclusion
- References
- Consensus Guided Multiple Match Removal for Geometry Verification in Image Retrieval
- 1 Introduction
- 2 Approximate Feature Matching
- 3 Geometric Verification by Hough Voting
- 4 Consensus Guided Multiple Match Removal
- 5 Experiments
- 5.1 Datasets and Evaluation
- 5.2 Experimental Setup
- 5.3 Experimental Results
- 6 Conclusion
- References
- Locality Constrained Sparse Representation for Cat Recognition
- 1 Introduction
- 2 The Methodology
- 2.1 Overview
- 2.2 Problem Formalism for Sparse Feature Representation
- 2.3 Supervised Dictionary Learning Approach
- 2.4 Recognition Task
- 3 Experimental Results
- 3.1 Dataset and Experimental Settings
- 3.2 Performance Evaluation
- 4 Conclusion and Future Work
- References
- User Profiling by Combining Topic Modeling and Pointwise Mutual Information (TM-PMI)
- Abstract
- 1 Introduction
- 2 The Proposed Approach
- 2.1 Framework of the Proposed Approach
- 2.2 Data Preprocessing
- 2.3 LDA from Description of User Pins
- 2.4 Pointwise Mutual Information (PMI)
- 2.5 Personal Topic Words Extraction
- 2.6 Pins Recommended Based User Profile
- 3 Experiments
- 3.1 Dataset
- 3.2 Perplexity
- 3.3 User Study
- 3.4 Influence of the Number of Topic Words on the Result
- 4 Conclusion and Future Work
- References
- Image Retrieval Using Color-Aware Tag on Progressive Image Search and Recommendation System
- 1 Introduction
- 2 Related Works and Preliminaries
- 2.1 Related Works
- 2.2 PISAR System and WAS Algorithm
- 3 CAT Algorithm
- 3.1 Offline Phase
- 3.2 Online Phase
- 4 Experiment
- 4.1 Experimental Environment
- 4.2 Optimizing the Parameters in the CAT Algorithm
- 4.3 Example of Image Retrieval
- 4.4 Image Retrieval Result
- 5 Conclusions
- References
- Advancing Iterative Quantization Hashing Using Isotropic Prior
- 1 Introduction
- 2 Related Work
- 3 Isotropic Iterative Quantization
- 3.1 Preliminaries
- 3.2 Improving ITQ Using the Isotropic Prior
- 4 Experiments
- 5 Conclusion
- References
- An Improved RANSAC Image Stitching Algorithm Based Similarity Degree
- Abstract
- 1 Introduction
- 2 The Improved RANSAC Based Similarity Degree
- 2.1 Transformation Matrix of Image Registration
- 2.2 Calculation Method for RANSAC Sampled
- 2.3 Feature Points Matching in Coarse Matching Step
- 3 Pretreatment
- 4 Experimental Results and Analysis
- 5 Conclusions
- References
- A Novel Emotional Saliency Map to Model Emotional Attention Mechanism
- Abstract
- 1 Introduction
- 2 Emotional Saliency Map
- 2.1 Color Emotion Space
- 2.2 Emotional Saliency Map Computation
- 3 Experiments
- 3.1 Data Set and Error Measure
- 3.2 Experiments on Horror Image Set
- 3.3 Experiments on MS Image Set
- 4 Conclusion
- Acknowlegment
- References
- Automatic Endmember Extraction Using Pixel Purity Index for Hyperspectral Imagery
- Abstract
- 1 Introduction
- 2 Pixel Purity Index
- 3 Automatic Endmember Extraction Using Pixel Purity Index
- 3.1 Determining the Number of Endmembers Based on Noise Subspace Projection
- 3.2 Data Dimensionality Reduction by Improving Noise Covariance Matrix (NCM) Estimation for MNF Transformation
- 3.3 Experimental Results and Analysis
- 4 Conclusions
- Acknowledgment
- References
- A Fast 3D Indoor-Localization Approach Based on Video Queries
- 1 Introduction
- 2 Related Work
- 3 Fast 3D Indoor-Localization
- 3.1 Pipeline
- 3.2 Deblurring Query Images
- 3.3 Interactive Foreground Segmentation
- 3.4 Dynamic Scene Query for Localization
- 4 Graph Matching Verification
- 5 Experiments
- 6 Conclusion
- References
- Smart Ambient Sound Analysis via Structured Statistical Modeling
- 1 Introduction
- 2 Multilayer Based Ambient Sound Understanding
- 2.1 Audio Preprocessing
- 2.2 Structured Environmental Sound Modelling
- 2.3 Segment Based Adaptation
- 2.4 Audio Concept Estimation Using SVM
- 3 Experimental Configuration
- 3.1 Data Collections
- 3.2 Methodology and Evaluation Metrics
- 3.3 Competitors for Performance Comparison
- 4 Experiment Results
- 5 Conclusions
- References
- Discriminant Manifold Learning via Sparse Coding for Image Analysis
- 1 Introduction
- 2 Discriminant Manifold Learning via Sparse Coding (DML_SC)
- 2.1 Motivation
- 2.2 Dictionary Learning and Feature Regrouping
- 2.3 Graph Embedding
- 3 Experiment Results
- 3.1 Data Preparation and Representation
- 3.2 Face Recognition Results
- 3.3 Clustering Experiment on COIL20 Database
- 4 Conclusion
- References
- A Very Deep Sequences Learning Approach for Human Action Recognition
- Abstract
- 1 Introduction
- 2 Related Work
- 2.1 Convolutional Neural Networks
- 2.2 Long Short-Term Memory Block
- 2.3 Recent Researches
- 3 Essential Knowledge
- 3.1 Convolutional Neural Networks
- 3.2 Sequence Features Extraction
- 4 Multi-Feature Framework
- 4.1 Experiments Details
- 5 Evaluation
- 5.1 Single Frame Models
- 5.2 Sequence-Related Models
- 5.3 Fusion Models
- 5.4 Discussion of the Time Cost
- 6 Conclusions
- Acknowledgements
- References
- Attribute Discovery for Person Re-Identification
- 1 Introduction
- 2 Related Work
- 3 The Proposed Method
- 3.1 Key Idea: Ideal Case
- 3.2 Method Detail: Real Case
- 4 Experiments
- 4.1 Experiment Setup
- 4.2 Result on PRID
- 4.3 Result on VIPeR
- 5 Conclusion
- References
- What are the Limits to Time Series Based Recognition of Semantic Concepts?
- 1 Introduction and Background
- 2 Experimental Datasets
- 3 Methods
- 4 Results
- 5 Discussion
- 6 Conclusions
- References
- Ten Research Questions for Scalable Multimedia Analytics
- 1 Introduction
- 1.1 Multimedia Analytics
- 1.2 Scalability Challenges
- 1.3 Scalable Multimedia Analytics
- 1.4 Contributions of this Paper
- 2 Multimedia Analytics
- 2.1 From Multimedia Analysis: Multimedia Representation
- 2.2 From Visual Analytics: Multimedia Analytics Process
- 2.3 Scalability Considerations
- 3 Database Support
- 3.1 Multimedia Search
- 3.2 Analytics Workloads
- 3.3 Database Management
- 4 Research Questions
- 4.1 Volume
- 4.2 Variety
- 4.3 Velocity
- 4.4 Visual Interaction
- 5 Conclusions
- References
- Shaping-Up Multimedia Analytics: Needs and Expectations of Media Professionals
- 1 Introduction
- 2 Methodology
- 2.1 The UCD Approach
- 2.2 Interview Stage #1: Knowing the Crowd
- 2.3 Interview Stage #2: Assessing Design Decisions
- 3 Perceived Usefulness of Analytics Functionalities
- 3.1 Transcripts, Keywords and Entities
- 3.2 Social Networks and Opinions
- 3.3 Links and Recommendation
- 3.4 Fast Access to Information
- 4 Lessons for Multimedia Analytics
- References
- Informed Perspectives on Human Annotation Using Neural Signals
- 1 Introduction
- 2 Background to EEG in Annotation Tasks
- 3 User Annotation Experiment
- 3.1 EEG Setup and Configuration
- 3.2 Experiment Outline
- 4 Results of the User Annotation Experiment
- 5 Perspectives on the Experiment and Suggestions
- References
- Demo Session Papers
- GrillCam: A Real-Time Eating Action Recognition System
- 1 Introduction
- 2 System Overview
- 3 Experiments
- 4 Conclusions
- References
- Searching in Video Collections Using Sketches and Sample Images -- The Cineast System
- 1 Introduction
- 2 Cineast
- 2.1 Architecture
- 2.2 Features
- 2.3 User Interaction
- 2.4 Implementation
- 3 Cineast in Action
- 4 Conclusion
- References
- LoggerMan, a Comprehensive Logging and Visualization Tool to Capture Computer Usage
- 1 Introduction
- 2 Background
- 3 LoggerMan Overview
- 3.1 Privacy and Technical Specification
- 3.2 Modules
- 3.3 Reporting and Insight
- 4 Demonstration and Evaluation
- 5 Conclusions
- References
- E2SGM: Event Enrichment and Summarization by Graph Model
- 1 Introduction
- 2 Our Solution
- 2.1 Dataset
- 2.2 Coarse Query
- 2.3 Graph-Based Ranking
- 2.4 Illustration
- 3 Implement and Result
- 3.1 Implement Configuration
- 3.2 Results
- 4 Conclusions
- References
- METU-MMDS: An Intelligent Multimedia Database System for Multimodal Content Extraction and Querying
- 1 Introduction
- 2 System Architecture
- 2.1 Semantic Content Extraction
- 2.2 Storage and Retrieval
- 3 Demonstration Details
- 4 Conclusion
- References
- Applying Visual User Interest Profiles for Recommendation and Personalisation
- 1 Introduction
- 2 Visual User Interest Modeling
- 2.1 Visual Feature Extraction
- 3 Utilising the Visual Profile
- 4 Applications of the Prototype Visual Profile
- 4.1 Hotel Booking System: An Example Application
- 5 Conclusion and Future Work
- References
- Cross-Modal Fashion Search
- 1 Introduction
- 2 Functionality Overview
- 3 Methodology
- 4 Conclusions
- References
- Video Browser Showdown
- IMOTION -- Searching for Video Sequences Using Multi-Shot Sketch Queries
- 1 Introduction
- 2 The IMOTION System
- 2.1 Architecture
- 2.2 Implementation
- 3 New Functionality and User Interaction
- 3.1 Multi-Shot Queries
- 3.2 Object Recognition and Retrieval
- 3.3 Result Limitation and Collaborative Search
- 3.4 Result Presentation and Browsing
- 3.5 Additional and Improved Video Features
- 4 Conclusions
- References
- iAutoMotion -- an Autonomous Content-Based Video Retrieval Engine
- 1 Introduction
- 2 System Architecture
- 2.1 Perspective Undistortion and Color Correction
- 2.2 Visual Search and Text Search
- 2.3 Query Composition
- 2.4 Result Aggregator and Submitter
- 3 User Interaction
- 3.1 Setup and Configuration
- 3.2 Query Initialization
- 4 Implementation
- 5 Conclusions
- References
- Selecting User Generated Content for Use in Media Productions
- 1 Introduction
- 2 Related Work
- 3 Content Preparation
- 4 Browsing User Interface
- 5 Conclusion
- References
- VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
- Abstract
- 1 Introduction
- 2 Video Retrieval System
- 2.1 Video Temporal Segmentation
- 2.2 Visual Similarity Search
- 2.3 Object-Based Visual Search
- 2.4 High Level Concept Retrieval Module
- 2.5 Hierarchical Clustering
- 3 VERGE Interface and Interaction Modes
- 4 Future Work
- Acknowledgements
- References
- Collaborative Video Search Combining Video Retrieval with Human-Based Visual Inspection
- 1 Introduction and Related Work
- 2 Proposed Approach
- 2.1 Content-Based Video Retrieval (CBVR) Tool
- 2.2 Tablet App
- 2.3 Collaboration Mechanism
- References
- Multi-sketch Semantic Video Browser
- 1 Introduction
- 2 Multi-sketch Filtering
- 2.1 Feature Signatures
- 2.2 Edge Histograms
- 3 Browsing
- 3.1 Semantic Similarity Search
- 3.2 Browsing Features
- 4 Conclusion
- References
- Faceted Navigation for Browsing Large Video Collection
- 1 Introduction
- 2 Video Segmentation and Representation
- 3 Visual Content Analysis
- 4 Faceted Navigation
- 5 Browsing Interface
- References
- Navigating a Graph of Scenes for Exploring Large Video Collections
- Abstract
- 1 Introduction
- 2 Generating Semantic Features
- 3 Enriching a Graph of Scenes with Semantic Information
- 4 Projecting and Navigating a Graph in a 2D Manner
- 5 Vibro System
- References
- Mental Visual Browsing
- 1 Introduction
- 2 Video Visual Features
- 3 Statistical Feedback Model
- 3.1 Update
- 3.2 Click
- 3.3 Display
- References
- Author Index
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.