Database Systems for Advanced Applications

Name: Database Systems for Advanced Applications | 16th International Conference, DASFAA 2011 International Workshops: GDB, SIM3, FlashDB, SNSMW, DaMEN, DQIS, Hong Kong, China, April 22-25, 2011, Proceedings
Brand: Springer
Price: 53.49 EUR
Availability: OnlineOnly

16th International Conference, DASFAA 2011 International Workshops: GDB, SIM3, FlashDB, SNSMW, DaMEN, DQIS, Hong Kong, China, April 22-25, 2011, Proceedings

Jianliang Xu Ge Yu Shuigeng Zhou Rainer Unland(Editor)

Springer (Publisher)

Published on 12. April 2011

XXIV, 550 pages

E-Book

PDF with digital watermarking

System requirements

978-3-642-20244-5 (ISBN)

€53.49incl. 7% vat

System requirements

for PDF with digital watermarking

E-Book Single Licence

Available for download

Description

More details

Other editions

Content

Title
Preface
Organization
Table of Contents
The 1st International Workshop on Graph-structured Data Bases (GDB 2011)
Invited Talk
Privacy-Preserved Network Data Publishing
Systems
Towards Efficient Subgraph Search in Cloud Computing Environments
Introduction
Related Work
Graph Search and Indexing
MapReduce-Based Computing
Cloud-Based Subgraph Search: An Overview
Implementation Techniques
Index Building
Subgraph Search
Experimental Evaluation
Experimental Settings
Experimental Results
Conclusion and Future Work
References
Latency-Optimal Walks in Replicated and Partitioned Graphs
Introduction
Problem
Definitions
Optimal Partition Walks
Fast-Forward-Search
System Model
Query Model
Fast-Forward-Search
Cost Analysis
Lowering the Fan-Out
Proof of Optimality
Evaluation
Generating Graph Partitionings
Experimental Setup
Experimental Results
Related Work
Conclusion
References
Graph-Based Matching of Composite OWL-S Services
Introduction
Graph Representation of OWL-S Processes
Matching OWL-S Processes
Matching Atomic Components
Matching Process Structure
Experimental Evaluation
Related Work
Conclusions and Future Work
References
Theories
Design Non-recursive and Redundant-Free XML Conceptual Schema with Hypergraph
Introduction
Related Work
Methodology
Conclusion
References
Classifying Graphs Using Theoretical Metrics: A Study of Feasibility
Introduction
Representing Graphs with Theoretical Metrics
Graph-Theoretical Metrics
Feature Selection and Graph Transformation
Graph Classification
Experimental Evaluations
Tasks and Data Sets
Results
Related Work
Conclusions
References
The First International Workshop on Spatial Information Modeling, Management and Mining (SIM3)
Spatial Data Management: Compression, Storage and Query
A GML Documents Stream Compressor
Introduction
Related Work
The GDScomp Method
GDScomp Architecture
Event Handler
Dynamic Structure Compression
Delta Compression
Experimental Evaluation
Compression Ratio
Compression Time and Decompression Time
Conclusion and Future Work
References
A Query-Friendly Compression for GML Documents
Introduction
Background
An Example of GML Documents
An Example of GML Queries
Query-Friendly GML Compression
Compression Model
SAX Event Dictionary.
SAX Events Hierarchy.
SAX Event Wavelet Tree.
Document Content Blocks.
Query Resolution Process
Compression Algorithm
Conclusion
References
Storing GML Documents: A Model-Mapping Based Approach
Introduction
Related Work
Model-Mapping Storage Method Based on Nodes and Edges
GML Document Storing Architecture
GML Document Data Schema
Constructing of GML Document Tree
GML Document Database Model
The Experimental Analyzing of GML Document Data Storing Time
GML Query Processing
Experimental Analysis
Conclusion
References
GML Data Management: Framework and Prototype
Introduction
GML Structure and Query Language
GML Structure and Model
GML Query Language
The Framework of GML Data Management
The Processing Center
Storing GML Data in Object-Relational Database
GQL Query in Object-Relation Database
GQL Processor
GML Indexing
The Prototype
Conclusion and Future Work
References
An Efficient Multi-layer Grid Method for Skyline Queries in Distributed Environments
Introduction
Related Work
The Proposed Method
Motivation
The Processing of MGDS Algorithm
Experiment Evaluation
Experimental Environment
Experimental Results
Conclusions
References
Spatial Planning, Visualization, Mining and System
3D Indoor Route Planning for Arbitrary-Shape Objects
Introduction
Related Work
The LEGO Model
Checking the Accessibility for Arbitrary-Shape Objects
The Maximum Widths
The Maximum Heights
The Maximum Lengths
The LEGO Graph
Conclusions and Future Work
References
A Web-Based Visualisation Tool for Analysing Mouse Movements to Support Map Personalisation
Introduction
Related Work
System Description
System Architecture and Technologies
Discussion and On-Going Developments
Conclusions
References
On the Requirements for User-Centric Spatial Data Warehousing and SOLAP
Introduction
Related Work
Requirements for User-Centric Spatial OLAP
A Meta-Framework for Spatial Data Warehouse Design
Conclusions and Future Work
References
Optimal Bandwidth Selection for Density-Based Clustering
Introduction
Related Principles
Basic Idea of Density Based Clustering Algorithm
Parameter Estimation Model
Density-Based Clustering Algorithm Using the Optimal Bandwidth Selection
The Structure of the Algorithm
Optimal Bandwidth Selection Model
Case Study
The Procedure of Optimal Bandwidth Selection
Clustering Analysis
Conclusions
References
Developing an Oracle-Based Spatio-Temporal Information Management System
Introduction
Overview of STOC
Implementation of STOC
Moving Data Types in STOC
Spatio-Temporal Operations in STOC
Case Study: A Traffic Information System
Create BerlinMOD Database
Spatio-Temporal Queries
Related Work
Conclusions
References
The First International Workshop on Flash-Based Database Systems (FlashDB)
Storage Management for SSD
Invited Talk I
Some Research Directions in FlashDB
References
Regular Papers
Page-Level Log Mapping: From Many-to-Many Mapping to One-to-One Mapping
Introduction
Design Overview
Basic Concepts
System Architecture
The Implementations of the PLM Approach
The Block Associative Log Mapping
The Fully Associative Log Mapping
Experimental Evaluation
Related Work
Conclusion
References
A Novel Method to Extend Flash Memory Lifetime in Flash-Based DBMS
Introduction
Characteristics of Flash Memory
The Methods Used in Traditional Free Space Management
Our Solution
Overview
Free Space Management
Write Buffer
Merge Operation
Evaluation Experiments
Experiment Setup
Performance Results and Analysis
Related Work
Conclusions
References
Log-Compact R-Tree: An Efficient Spatial Index for SSD
Introduction
Preliminaries
Introduction to SSD
Related Work
The LCR-Tree
Overview of LCR-Tree
Design Details of LCR-Tree
Experimental Results
Experiments on Synthetic Data Sets
Experiments on Real Spatial Data Sets
Conclusion and Future Work
References
An FTL-Agnostic Layer to Improve Random Write on Flash Memory
Introduction
NAND Flash Memories
Write Spatial Locality for FTL-Based Devices
Gathering Random Writes
Model
Results
Related Works
Conclusion
References
Energy Efficiency & Hybrid Storage
Invited Talk II
Energy Efficiency Is Not Enough, Energy Proportionality Is Needed!
Introduction
Experimental Results and Critical Observations
SSD Performance Measurements
Result Interpretation
Findings in DBMS Buffer Management
Objectives of Flash-Aware Replacement Algorithms
Experiments
Energy-Proportional Computing
Design Considerations of WattDB
Architecture Overview
Storage Mapping and Partitioning
Query Processing
Cluster Coordination
Conclusion and Future Work
References
Invited Talk III
Flash-Based Database Systems: Experiences from the FlashDB Project
References
Regular Papers
Trading Memory for Performance and Energy
Introduction
Related Work
The 3LA Storage System
The LOC Algorithm
The GLB Algorithm
Discussion
Experiment
Simulations
Running a Real-Life Trace on Real Devices
Conclusion and Future Work
References
Design of Embedded Database Based on Hybrid Storage of PRAM and NAND Flash Memory
Introduction
Related Work
Hybrid Storage Architecture
Transaction on Hybrid Storage
Implementation Issue
Experiment
Experimental Environment
Experimental Result
Conclusion
Future Work
References
Hybrid Storage with Disk Based Write Cache
Introduction
Related Work
Flash Translation Layer
Log-Block-Based FTL
Hybrid Storage Policy
The Hybrid Storage System Model
The Migration Algorithm
Page Placement
Block Level Hybrid Algorithm
Page Level HSLRU-2 Algorithm
Performance Evaluation
Conclusions and Future Work
References
The 2nd International Workshop on Social Networks and Social Media Mining on the Web (SNSMW)
Social Networking and Community Structure
An Analysis of Network Structure and Post Content for Blog Post Recommendation
Introduction
Literature Review
Proposed Approaches
Evaluation
Conclusions
References
Extracting Local Community Structure from Local Cores
Introduction
Preliminaries
Local Community
Previous Algorithms
Our Contribution
Extracting Local Core
Merging Vertices
Pruning Phase
Experiment Results
Zachary's Karate Club Network
GN Networks
The NCAA Football Network
Conclusions
References
On Summarizing Graph Homogeneously
Introduction
Problem Statement
An Approximately Homogeneous Grouping Based on Information Theory
Homogeneous Graph Summarization
Experimental Results
Related Works
Conclusions
References
Expansion Properties of Large Social Graphs
Introduction
Related Work
Measuring Expansion Properties
Subgraph Centrality
Experimental Results
Conclusions
References
Text Representation Using Dependency Tree Subgraphs for Sentiment Analysis
Introduction
Our Method
Subgraph Representation
Feature Construction
Discounting Scheme
Experiments and Results
Data and Evaluation Setup
Results
Related Work
Conclusion
References
A Local Information Passing Clustering Algorithm for Tagging Systems
Introduction
Preliminaries
Social Tagging System Model
Tag Vector and Tag Similarity
Local Information Passing Clustering Algorithm
KNN Directed Graph and Local Information
Local Information Passing Clustering Algorithm
Experimental Evaluations
Experimental Datasets
Evaluation Measurements
Experiments and Discussion
Conclusion
References
Social Media and Data Mining
What's in a Name: A Study of Names, Gender Inference, and Gender Behavior in Facebook
Introduction
Related Work
Crawling and Data Gathering
Using Facebook to Generate an Annotated Name List
Combining Names with Their Nicknames
Analysis of Annotated Name List
Design of Gender Predictors
Offline Name List Predictor (OFL)
Facebook Generated Name List Predictor (FB)
Local Information Predictor (LCL)
Friend Information Predictor (FRND)
Hybrid Predictors
Evaluation of Gender Predictors
Experimental Setup
Effectiveness of Gender Predictors
Inferring Gender for NYC Facebook Users
User Partitioning
Applying Gender Predictors to Group A
Gender Inference Results
Gender Characteristics and Behavior
Privacy of Attributes
Targeted Advertising and Privacy Implications
Conclusions
References
Realtime Social Sensing of Support Rate for Microblogging
Introduction
Problem Setting
Data Preparation
Approach
Preprocess
Classification via Support Vector Machine
Event Detection
Experiments and Evaluations
Training Data
Support Rate Results
Analysis
Verifying Realtimeness
Results of Event Detection
Conclusion and Future Work
References
Searching Consultants in Web Forum
Introduction
Problem Statement
Objects in Web Forum
Definitions
Approaches to Find Consultants in Web Forum
Modeling Consultants Search
Algorithms
Experiments
Data Collection
Experiment Results
Related Work
Conclusion and Future Work
References
Comparing Similarity of HTML Structures and Affiliate IDs in Splog Analysis
Introduction
Similarity of HTML Structures
Extracting DOM Sequences of an HTML Document
Ratio of the Differences in DOM Sequences
Automatic Collection of Splogs with High Similarities of HTML Structures
Seed Splog Data Set
The Procedure
Analysis on Splog Rate
Splogs and Affiliate IDs
Analysis on Identifying Spammers
Identifying Spammers Based on the Similarity of HTML Structures
Comparison of the Similarity of HTML Structures and Affiliate IDs
Concluding Remarks
References
Crowd-Powered TV Viewing Rates: Measuring Relevancy between Tweets and TV Programs
Introduction
A Twitter-Based TV Rating Platform
Looking for Audiences on Twitter
Twitter-Based TV Rating Platform
Related Work
Semantic Linking from Tweets to Relevant TV Programs
Experiment
Experimental Dataset
Experimental Results
Conclusions
References
The First International Workshop on Data Management for Emerging Network Infrastructures (DaMEN)
Invited Talk
GreenOrbs: Lessons Learned from Extremely Large Scale Sensor Network Deployment
Query and Stream Processing
Adapting Skyline Computation to the MapReduce Framework: Algorithms and Experiments
Introduction
Preliminaries
Skyline: Definition and Properties
The MapReduce Framework
MapReduce-Based Skyline Computation Algorithms
MR-BNL
MR-SFS
MR-Bitmap
Performance Evaluation
Experimental Setting
Experimental Results
Related Work
Skyline Computation
Data Management and Query Processing under the MapReduce Framework
Conclusion
References
Efficient Event Stream Processing: Handling Ambiguous Events and Patterns with Negation
Introduction
Background
Constructing NFA for Pattern Queries with Negation
Constructing DFA for Pattern Queries and Ambiguous Events
Performance Evaluation
Related Work
Conclusions and Future Work
References
Effective Keyword Search for Candidate Fragments of XML Documents
Introduction
Related Work
Query Semantics
XML Data Model
CAF Semantics
Query Algorithms
Node Match Algorithm
Path Match Algorithm
Experimental Evaluation
Experimental Setup
Datasets and Keyword Queries
Query Effectiveness
Query Efficiency
Conclusion
References
Storage and Scheduling
Optimized Data Placement for Column-Oriented Data Store in the Distributed Environment
Introduction
Related Work
Problem Statement
Data Placement
An Overview
Content-Aware Bitmap Index Key Generation
Index Construction
Data Placement
Segment Split
Query Processing
Multi-dimensional Range Query and Multi-attribute Range Query
Aggregation Query and Approximate Aggregation Query
Performance Evaluation
Evaluation on Access Efficiency
Evaluation on Aggregation Accuracy
A Comparison
Conclusion
References
Two-Step Joint Scheduling Scheme for Road Side Units (RSUs)-Based Vehicular Ad Hoc Networks (VANETs)
Introduction
Related Work
Background and Preliminaries
System Model
Notation and Assumptions
Scheduling Schemes
First-Step Scheduling
Our Proposed Scheduling Algorithm
Performance Metrics
Performance Evaluation
Experimental Setup
Effect of Deadline Miss Rate
Effect of
Effect of Data Item Size Distribution
Conclusion and Future Work
References
A Content-Aware Adaptive Storage Approach for XML in PXRDB
Introduction
Presentation of Adaptive XML Storage Schema
Storage Scheme Selector
Storage Scheme Selector Function
Implementation of the Selector-CASF
Experiments
Datasets
Accuracy of Choosing Suitable Storage
Related Work
Conclusion
References
Fourth International Workshop on Data Quality in Integration Systems (DQIS)
Invited Talk
The Flamingo Software Package on Approximate String Queries
Session I
Invited Paper
A Framework for Data Quality Aware Query Systems
Introduction
Existing Literature
Framework for DQ Aware Query Systems
Data Quality Profiling
Capture User Preference on Data Quality
Query Planning
Conclusions
References
Regular Papers
SemGen-Towards a Semantic Data Generator for Benchmarking Duplicate Detectors
Introduction
Qualitative Description of Duplicate Semantics
Approach
Related Work
Discussion and Further Work
References
Estimating a Transit Passenger Trip Origin-Destination Matrix Using Automatic Fare Collection System
Introduction
Estimating Passenger Trajectory
Data Analysis
Trajectory Search Algorithms
Travel Demand Matrix
Case Studies
Conclusions
References
Session II
Invited Paper
An Approach to Assess the Quality of Web Pages in the Deep Web
Introduction
Related Works
Preprocessing for the Assessment
The Schema Model of Web Data
The Annotation of Web Data
The Method for Quality Assessment
Analyzing the Structure Complexity
Analyzing the Text Complexity
The Quality Level
An XQuery-Based Wrapper
Experimental Results
Conclusion
References
Regular Papers
Using Machine Learning to Support Resource Quality Assessment: An Adaptive Attribute-Based Approach for Health Information Portals
Introduction
Research Context: The BCKOnline Portal
An Adaptive Attribute-Based Approach for Resource Quality Assessment
An Attribute-Based Data Model for the Healthcare Domain
Machine Learning for Predicting Quality Attributes
ML Procedures for Intelligent Quality Assessment
Selection of ML Scheme
Selection of Data Attributes
Data Cleaning and Transforming
Evaluation of Prediction Performance
Statistical Evaluation Method
Datasets for Experiments
Comparison of Prediction Performance
Predicting Accuracy of SVMs
Conclusion and Future Work
References
Grid-Based Probabilistic Skyline Retrieval on Distributed Uncertain Data
Introduction
Problem Definition
The Grid-Based Probabilistic Skyline Algorithm
The Framework
Loading Data
Merge and Sharing
Local Pruning
Further Optimization
Related Works
Experimental Evaluations
Experimental Setup
Experimental Results
Conclusions
References
Author Index

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Database Systems for Advanced Applications

Description

More details

Other editions

Additional editions

Content

System requirements