
Database Systems for Advanced Applications
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
The volume contains six workshops, each focusing on specific research issues that contribute to the main themes of the DASFAA conference: The First International Workshop on Graph-structured Data Bases (GDB 2011); the First International Workshop on Spatial Information Modeling, Management and Mining (SIM3 2011); the International Workshop on Flash-based Database Systems (FlashDB 2011); the Second International Workshop on Social Networks and Social Media Mining on the Web (SNSMW 2011); the First International Workshop on Data Management for Emerging Network Infrastructures (DaMEN 2011); and the Fourth International Workshop on Data Quality in Integration Systems (DQIS 2011).
More details
Other editions
Additional editions

Content
- Title
- Preface
- Organization
- Table of Contents
- The 1st International Workshop on Graph-structured Data Bases (GDB 2011)
- Invited Talk
- Privacy-Preserved Network Data Publishing
- Systems
- Towards Efficient Subgraph Search in Cloud Computing Environments
- Introduction
- Related Work
- Graph Search and Indexing
- MapReduce-Based Computing
- Cloud-Based Subgraph Search: An Overview
- Implementation Techniques
- Index Building
- Subgraph Search
- Experimental Evaluation
- Experimental Settings
- Experimental Results
- Conclusion and Future Work
- References
- Latency-Optimal Walks in Replicated and Partitioned Graphs
- Introduction
- Problem
- Definitions
- Optimal Partition Walks
- Fast-Forward-Search
- System Model
- Query Model
- Fast-Forward-Search
- Cost Analysis
- Lowering the Fan-Out
- Proof of Optimality
- Evaluation
- Generating Graph Partitionings
- Experimental Setup
- Experimental Results
- Related Work
- Conclusion
- References
- Graph-Based Matching of Composite OWL-S Services
- Introduction
- Graph Representation of OWL-S Processes
- Matching OWL-S Processes
- Matching Atomic Components
- Matching Process Structure
- Experimental Evaluation
- Related Work
- Conclusions and Future Work
- References
- Theories
- Design Non-recursive and Redundant-Free XML Conceptual Schema with Hypergraph
- Introduction
- Related Work
- Methodology
- Conclusion
- References
- Classifying Graphs Using Theoretical Metrics: A Study of Feasibility
- Introduction
- Representing Graphs with Theoretical Metrics
- Graph-Theoretical Metrics
- Feature Selection and Graph Transformation
- Graph Classification
- Experimental Evaluations
- Tasks and Data Sets
- Results
- Related Work
- Conclusions
- References
- The First International Workshop on Spatial Information Modeling, Management and Mining (SIM3)
- Spatial Data Management: Compression, Storage and Query
- A GML Documents Stream Compressor
- Introduction
- Related Work
- The GDScomp Method
- GDScomp Architecture
- Event Handler
- Dynamic Structure Compression
- Delta Compression
- Experimental Evaluation
- Compression Ratio
- Compression Time and Decompression Time
- Conclusion and Future Work
- References
- A Query-Friendly Compression for GML Documents
- Introduction
- Background
- An Example of GML Documents
- An Example of GML Queries
- Query-Friendly GML Compression
- Compression Model
- SAX Event Dictionary.
- SAX Events Hierarchy.
- SAX Event Wavelet Tree.
- Document Content Blocks.
- Query Resolution Process
- Compression Algorithm
- Conclusion
- References
- Storing GML Documents: A Model-Mapping Based Approach
- Introduction
- Related Work
- Model-Mapping Storage Method Based on Nodes and Edges
- GML Document Storing Architecture
- GML Document Data Schema
- Constructing of GML Document Tree
- GML Document Database Model
- The Experimental Analyzing of GML Document Data Storing Time
- GML Query Processing
- Experimental Analysis
- Conclusion
- References
- GML Data Management: Framework and Prototype
- Introduction
- GML Structure and Query Language
- GML Structure and Model
- GML Query Language
- The Framework of GML Data Management
- The Processing Center
- Storing GML Data in Object-Relational Database
- GQL Query in Object-Relation Database
- GQL Processor
- GML Indexing
- The Prototype
- Conclusion and Future Work
- References
- An Efficient Multi-layer Grid Method for Skyline Queries in Distributed Environments
- Introduction
- Related Work
- The Proposed Method
- Motivation
- The Processing of MGDS Algorithm
- Experiment Evaluation
- Experimental Environment
- Experimental Results
- Conclusions
- References
- Spatial Planning, Visualization, Mining and System
- 3D Indoor Route Planning for Arbitrary-Shape Objects
- Introduction
- Related Work
- The LEGO Model
- Checking the Accessibility for Arbitrary-Shape Objects
- The Maximum Widths
- The Maximum Heights
- The Maximum Lengths
- The LEGO Graph
- Conclusions and Future Work
- References
- A Web-Based Visualisation Tool for Analysing Mouse Movements to Support Map Personalisation
- Introduction
- Related Work
- System Description
- System Architecture and Technologies
- Discussion and On-Going Developments
- Conclusions
- References
- On the Requirements for User-Centric Spatial Data Warehousing and SOLAP
- Introduction
- Related Work
- Requirements for User-Centric Spatial OLAP
- A Meta-Framework for Spatial Data Warehouse Design
- Conclusions and Future Work
- References
- Optimal Bandwidth Selection for Density-Based Clustering
- Introduction
- Related Principles
- Basic Idea of Density Based Clustering Algorithm
- Parameter Estimation Model
- Density-Based Clustering Algorithm Using the Optimal Bandwidth Selection
- The Structure of the Algorithm
- Optimal Bandwidth Selection Model
- Case Study
- The Procedure of Optimal Bandwidth Selection
- Clustering Analysis
- Conclusions
- References
- Developing an Oracle-Based Spatio-Temporal Information Management System
- Introduction
- Overview of STOC
- Implementation of STOC
- Moving Data Types in STOC
- Spatio-Temporal Operations in STOC
- Case Study: A Traffic Information System
- Create BerlinMOD Database
- Spatio-Temporal Queries
- Related Work
- Conclusions
- References
- The First International Workshop on Flash-Based Database Systems (FlashDB)
- Storage Management for SSD
- Invited Talk I
- Some Research Directions in FlashDB
- References
- Regular Papers
- Page-Level Log Mapping: From Many-to-Many Mapping to One-to-One Mapping
- Introduction
- Design Overview
- Basic Concepts
- System Architecture
- The Implementations of the PLM Approach
- The Block Associative Log Mapping
- The Fully Associative Log Mapping
- Experimental Evaluation
- Related Work
- Conclusion
- References
- A Novel Method to Extend Flash Memory Lifetime in Flash-Based DBMS
- Introduction
- Characteristics of Flash Memory
- The Methods Used in Traditional Free Space Management
- Our Solution
- Overview
- Free Space Management
- Write Buffer
- Merge Operation
- Evaluation Experiments
- Experiment Setup
- Performance Results and Analysis
- Related Work
- Conclusions
- References
- Log-Compact R-Tree: An Efficient Spatial Index for SSD
- Introduction
- Preliminaries
- Introduction to SSD
- Related Work
- The LCR-Tree
- Overview of LCR-Tree
- Design Details of LCR-Tree
- Experimental Results
- Experiments on Synthetic Data Sets
- Experiments on Real Spatial Data Sets
- Conclusion and Future Work
- References
- An FTL-Agnostic Layer to Improve Random Write on Flash Memory
- Introduction
- NAND Flash Memories
- Write Spatial Locality for FTL-Based Devices
- Gathering Random Writes
- Model
- Results
- Related Works
- Conclusion
- References
- Energy Efficiency & Hybrid Storage
- Invited Talk II
- Energy Efficiency Is Not Enough, Energy Proportionality Is Needed!
- Introduction
- Experimental Results and Critical Observations
- SSD Performance Measurements
- Result Interpretation
- Findings in DBMS Buffer Management
- Objectives of Flash-Aware Replacement Algorithms
- Experiments
- Energy-Proportional Computing
- Design Considerations of WattDB
- Architecture Overview
- Storage Mapping and Partitioning
- Query Processing
- Cluster Coordination
- Conclusion and Future Work
- References
- Invited Talk III
- Flash-Based Database Systems: Experiences from the FlashDB Project
- References
- Regular Papers
- Trading Memory for Performance and Energy
- Introduction
- Related Work
- The 3LA Storage System
- The LOC Algorithm
- The GLB Algorithm
- Discussion
- Experiment
- Simulations
- Running a Real-Life Trace on Real Devices
- Conclusion and Future Work
- References
- Design of Embedded Database Based on Hybrid Storage of PRAM and NAND Flash Memory
- Introduction
- Related Work
- Hybrid Storage Architecture
- Transaction on Hybrid Storage
- Implementation Issue
- Experiment
- Experimental Environment
- Experimental Result
- Conclusion
- Future Work
- References
- Hybrid Storage with Disk Based Write Cache
- Introduction
- Related Work
- Flash Translation Layer
- Log-Block-Based FTL
- Hybrid Storage Policy
- The Hybrid Storage System Model
- The Migration Algorithm
- Page Placement
- Block Level Hybrid Algorithm
- Page Level HSLRU-2 Algorithm
- Performance Evaluation
- Conclusions and Future Work
- References
- The 2nd International Workshop on Social Networks and Social Media Mining on the Web (SNSMW)
- Social Networking and Community Structure
- An Analysis of Network Structure and Post Content for Blog Post Recommendation
- Introduction
- Literature Review
- Proposed Approaches
- Evaluation
- Conclusions
- References
- Extracting Local Community Structure from Local Cores
- Introduction
- Preliminaries
- Local Community
- Previous Algorithms
- Our Contribution
- Extracting Local Core
- Merging Vertices
- Pruning Phase
- Experiment Results
- Zachary's Karate Club Network
- GN Networks
- The NCAA Football Network
- Conclusions
- References
- On Summarizing Graph Homogeneously
- Introduction
- Problem Statement
- An Approximately Homogeneous Grouping Based on Information Theory
- Homogeneous Graph Summarization
- Experimental Results
- Related Works
- Conclusions
- References
- Expansion Properties of Large Social Graphs
- Introduction
- Related Work
- Measuring Expansion Properties
- Subgraph Centrality
- Experimental Results
- Conclusions
- References
- Text Representation Using Dependency Tree Subgraphs for Sentiment Analysis
- Introduction
- Our Method
- Subgraph Representation
- Feature Construction
- Discounting Scheme
- Experiments and Results
- Data and Evaluation Setup
- Results
- Related Work
- Conclusion
- References
- A Local Information Passing Clustering Algorithm for Tagging Systems
- Introduction
- Preliminaries
- Social Tagging System Model
- Tag Vector and Tag Similarity
- Local Information Passing Clustering Algorithm
- KNN Directed Graph and Local Information
- Local Information Passing Clustering Algorithm
- Experimental Evaluations
- Experimental Datasets
- Evaluation Measurements
- Experiments and Discussion
- Conclusion
- References
- Social Media and Data Mining
- What's in a Name: A Study of Names, Gender Inference, and Gender Behavior in Facebook
- Introduction
- Related Work
- Crawling and Data Gathering
- Using Facebook to Generate an Annotated Name List
- Combining Names with Their Nicknames
- Analysis of Annotated Name List
- Design of Gender Predictors
- Offline Name List Predictor (OFL)
- Facebook Generated Name List Predictor (FB)
- Local Information Predictor (LCL)
- Friend Information Predictor (FRND)
- Hybrid Predictors
- Evaluation of Gender Predictors
- Experimental Setup
- Effectiveness of Gender Predictors
- Inferring Gender for NYC Facebook Users
- User Partitioning
- Applying Gender Predictors to Group A
- Gender Inference Results
- Gender Characteristics and Behavior
- Privacy of Attributes
- Targeted Advertising and Privacy Implications
- Conclusions
- References
- Realtime Social Sensing of Support Rate for Microblogging
- Introduction
- Problem Setting
- Data Preparation
- Approach
- Preprocess
- Classification via Support Vector Machine
- Event Detection
- Experiments and Evaluations
- Training Data
- Support Rate Results
- Analysis
- Verifying Realtimeness
- Results of Event Detection
- Conclusion and Future Work
- References
- Searching Consultants in Web Forum
- Introduction
- Problem Statement
- Objects in Web Forum
- Definitions
- Approaches to Find Consultants in Web Forum
- Modeling Consultants Search
- Algorithms
- Experiments
- Data Collection
- Experiment Results
- Related Work
- Conclusion and Future Work
- References
- Comparing Similarity of HTML Structures and Affiliate IDs in Splog Analysis
- Introduction
- Similarity of HTML Structures
- Extracting DOM Sequences of an HTML Document
- Ratio of the Differences in DOM Sequences
- Automatic Collection of Splogs with High Similarities of HTML Structures
- Seed Splog Data Set
- The Procedure
- Analysis on Splog Rate
- Splogs and Affiliate IDs
- Analysis on Identifying Spammers
- Identifying Spammers Based on the Similarity of HTML Structures
- Comparison of the Similarity of HTML Structures and Affiliate IDs
- Concluding Remarks
- References
- Crowd-Powered TV Viewing Rates: Measuring Relevancy between Tweets and TV Programs
- Introduction
- A Twitter-Based TV Rating Platform
- Looking for Audiences on Twitter
- Twitter-Based TV Rating Platform
- Related Work
- Semantic Linking from Tweets to Relevant TV Programs
- Experiment
- Experimental Dataset
- Experimental Results
- Conclusions
- References
- The First International Workshop on Data Management for Emerging Network Infrastructures (DaMEN)
- Invited Talk
- GreenOrbs: Lessons Learned from Extremely Large Scale Sensor Network Deployment
- Query and Stream Processing
- Adapting Skyline Computation to the MapReduce Framework: Algorithms and Experiments
- Introduction
- Preliminaries
- Skyline: Definition and Properties
- The MapReduce Framework
- MapReduce-Based Skyline Computation Algorithms
- MR-BNL
- MR-SFS
- MR-Bitmap
- Performance Evaluation
- Experimental Setting
- Experimental Results
- Related Work
- Skyline Computation
- Data Management and Query Processing under the MapReduce Framework
- Conclusion
- References
- Efficient Event Stream Processing: Handling Ambiguous Events and Patterns with Negation
- Introduction
- Background
- Constructing NFA for Pattern Queries with Negation
- Constructing DFA for Pattern Queries and Ambiguous Events
- Performance Evaluation
- Related Work
- Conclusions and Future Work
- References
- Effective Keyword Search for Candidate Fragments of XML Documents
- Introduction
- Related Work
- Query Semantics
- XML Data Model
- CAF Semantics
- Query Algorithms
- Node Match Algorithm
- Path Match Algorithm
- Experimental Evaluation
- Experimental Setup
- Datasets and Keyword Queries
- Query Effectiveness
- Query Efficiency
- Conclusion
- References
- Storage and Scheduling
- Optimized Data Placement for Column-Oriented Data Store in the Distributed Environment
- Introduction
- Related Work
- Problem Statement
- Data Placement
- An Overview
- Content-Aware Bitmap Index Key Generation
- Index Construction
- Data Placement
- Segment Split
- Query Processing
- Multi-dimensional Range Query and Multi-attribute Range Query
- Aggregation Query and Approximate Aggregation Query
- Performance Evaluation
- Evaluation on Access Efficiency
- Evaluation on Aggregation Accuracy
- A Comparison
- Conclusion
- References
- Two-Step Joint Scheduling Scheme for Road Side Units (RSUs)-Based Vehicular Ad Hoc Networks (VANETs)
- Introduction
- Related Work
- Background and Preliminaries
- System Model
- Notation and Assumptions
- Scheduling Schemes
- First-Step Scheduling
- Our Proposed Scheduling Algorithm
- Performance Metrics
- Performance Evaluation
- Experimental Setup
- Effect of Deadline Miss Rate
- Effect of
- Effect of Data Item Size Distribution
- Conclusion and Future Work
- References
- A Content-Aware Adaptive Storage Approach for XML in PXRDB
- Introduction
- Presentation of Adaptive XML Storage Schema
- Storage Scheme Selector
- Storage Scheme Selector Function
- Implementation of the Selector-CASF
- Experiments
- Datasets
- Accuracy of Choosing Suitable Storage
- Related Work
- Conclusion
- References
- Fourth International Workshop on Data Quality in Integration Systems (DQIS)
- Invited Talk
- The Flamingo Software Package on Approximate String Queries
- Session I
- Invited Paper
- A Framework for Data Quality Aware Query Systems
- Introduction
- Existing Literature
- Framework for DQ Aware Query Systems
- Data Quality Profiling
- Capture User Preference on Data Quality
- Query Planning
- Conclusions
- References
- Regular Papers
- SemGen-Towards a Semantic Data Generator for Benchmarking Duplicate Detectors
- Introduction
- Qualitative Description of Duplicate Semantics
- Approach
- Related Work
- Discussion and Further Work
- References
- Estimating a Transit Passenger Trip Origin-Destination Matrix Using Automatic Fare Collection System
- Introduction
- Estimating Passenger Trajectory
- Data Analysis
- Trajectory Search Algorithms
- Travel Demand Matrix
- Case Studies
- Conclusions
- References
- Session II
- Invited Paper
- An Approach to Assess the Quality of Web Pages in the Deep Web
- Introduction
- Related Works
- Preprocessing for the Assessment
- The Schema Model of Web Data
- The Annotation of Web Data
- The Method for Quality Assessment
- Analyzing the Structure Complexity
- Analyzing the Text Complexity
- The Quality Level
- An XQuery-Based Wrapper
- Experimental Results
- Conclusion
- References
- Regular Papers
- Using Machine Learning to Support Resource Quality Assessment: An Adaptive Attribute-Based Approach for Health Information Portals
- Introduction
- Research Context: The BCKOnline Portal
- An Adaptive Attribute-Based Approach for Resource Quality Assessment
- An Attribute-Based Data Model for the Healthcare Domain
- Machine Learning for Predicting Quality Attributes
- ML Procedures for Intelligent Quality Assessment
- Selection of ML Scheme
- Selection of Data Attributes
- Data Cleaning and Transforming
- Evaluation of Prediction Performance
- Statistical Evaluation Method
- Datasets for Experiments
- Comparison of Prediction Performance
- Predicting Accuracy of SVMs
- Conclusion and Future Work
- References
- Grid-Based Probabilistic Skyline Retrieval on Distributed Uncertain Data
- Introduction
- Problem Definition
- The Grid-Based Probabilistic Skyline Algorithm
- The Framework
- Loading Data
- Merge and Sharing
- Local Pruning
- Further Optimization
- Related Works
- Experimental Evaluations
- Experimental Setup
- Experimental Results
- Conclusions
- References
- Author Index
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.