
Advanced Data Mining and Applications
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
More details
Other editions
Additional editions

Content
- Intro
- Title
- Preface
- Organization
- Table of Contents
- Retrieval in CBR Using a Combination of Similarity and Association Knowledge
- Introduction
- Motivation
- Related Work
- Background of Similarity and Association Knowledge
- Background of Similarity Knowledge
- Background of Association Knowledge
- Association Knowledge Formalization
- The USIMSCAR Algorithm
- Evaluation
- Results and Analysis
- Conclusion and Future Work
- References
- A Clustering Approach Using Weighted Similarity Majority Margins
- Introduction
- Dual Similarity-Dissimilarity Modelling
- Pairwise Similarity and Dissimilarity Statements
- Taking into Account Strong Dissimilarities
- The Condorcet Similarity Graph
- Definition of the Clusters
- Clustering Algorithm
- Results
- Conclusions and Perspectives
- References
- A False Negative Maximal Frequent Itemset Mining Algorithm over Stream
- Introduction
- Preliminaries
- Frequent Itemsets
- Maximal Frequent Itemsets
- Chernoff Bound
- A Naive Method
- FNMFIMoDS Algorithm
- Data Structure
- Mining Strategies
- FNMFIMoDS
- Experimental Results
- Running Time Cost and Memory Cost Evaluation
- Precision and Recall
- Conclusions
- References
- A Graph Enrichment Based Clustering over Vertically Partitioned Data
- Introduction
- Problem Statement
- Graph Enrichment Based Clustering Approach
- Graph Construction
- Graph Updating
- Graph Partitioning
- Experimental Results
- Quality of the Obtained Partition
- Effect of Noise on the Enrichment Performance
- Conclusion
- References
- A Method for Finding Groups of Related Herbs in Traditional Chinese Medicine
- Introduction
- Method Overview
- Obtaining Dataset of Paired Herbs
- Data Preprocessing for TCM Digital Books
- Group Detection
- Herbal Network
- Partitioning the Network into Groups
- Experiments
- Experiments on Combinational Rule Mining
- Experiments on Community Detection
- Conclusions
- References
- A New Hybrid Clustering Method for Reducing Very Large Spatio-temporal Dataset
- Introduction
- Related Work
- Hybrid Clustering Algorithm
- Issues of SNNdegree
- New SNN-DBSCAN Algorithm
- Evaluation and Analysis
- Experiment Setup
- Analysis
- Conclusion and Future Work
- References
- A Normal Distribution-Based Over-Sampling Approach to Imbalanced Data Classification
- Introduction
- Performance Evaluation Metrics
- The Normal Distribution Model
- Experimental Results
- Conclusions
- References
- A Novel Genetic Algorithm for Overlapping Community Detection
- Introduction
- Related Work
- The New Genetic Algorithm for Community Detection
- Framework of the Algorithm
- Objective Function
- Genetic Representation
- Operators
- Discussion
- Experiments
- Experiments on Artificial Networks
- Experiments on Real Networks
- Conclusion
- References
- A Probabilistic Topic Model with Social Tags for Query Reformulation in Informational Search
- Introduction
- Related Work
- The Probabilistic Topic Model for Query Reformulation
- Topic Model with Social Tags
- Estimation of Parameters
- Retrieval and Ranking of URLs
- Experiments
- Datasets
- Methodology
- Results
- Conclusions and Future Work
- References
- A QoS-Aware Web Services Selection Model Using AND/OR Graph
- Introduction
- Basic Concepts
- Quality Criteria for Elementary Web Service
- Meta-control Logical Relation Between Services
- Composite Service Model Using AND/OR Graph
- AND/OR Graph for Composite Services
- Description of Composite Service Selection Model
- Web Service Composition Selection Algorithm Based on ACO
- Experiments and Analysis
- Example Analysis
- Experimental Results
- Conclusion and Future Work
- References
- A Tweet-Centric Approach for Topic-Specific Author Ranking in Micro-Blog
- Introduction
- Motivation
- Our Work
- Related Work
- A User-Tweet Interaction Model
- Topic-Specific Author Ranking
- Reader Score Reflecting Reader's Topic-Focus Degree
- Tweet Score Measuring Tweet's Quality
- User Score Containing Reader Score and Author Score
- Author Ranking
- Experiments
- Empirical Evaluation of Author Ranking
- Effectiveness of Author Ranking
- Efficiency and Analysis of Timely Author Ranking
- Effect of Weight Settings in the User-Tweet Graph
- Conclusion
- References
- An Algorithm for Sample and Data Dimensionality Reduction Using Fast Simulated Annealing
- Introduction
- Methodological Preliminaries
- Basic Exploratory Data Mining Tasks
- Fast Simulated Annealing
- Algorithm Description
- Dimensionality Reduction
- Weighting and Sample Length Reduction
- Experimental Results
- Conclusion
- References
- An Investigation of Recursive Auto-associative Memory in Sentiment Detection
- Introduction
- Related Work
- Sentiment Detection
- Artificial Neural Networks in Natural Language Processing
- System Design and Methodology
- Implementation of RAAM
- A Single RAAM
- A RAAM Ensemble
- Triplet Tree Generation
- Data Collection and Parameter Determination
- Experiments and Results
- Similarity Detection
- Sentiment Detection
- Results on Ambiguous Words
- Conclusion and Future Work
- References
- APPECT: An Approximate Backbone-Based Clustering Algorithm for Tags
- Introduction
- Related Work
- Preliminaries
- Social Tagging System Model
- Similarity Measure
- Approximate Backbone of Tag Clustering Results
- The Details of APPECT
- Capture the Approximate Backbone
- Details of APPECT
- Time Analysis of APPECT
- Experiments and Evaluations
- Data Sets
- Evaluation Metrics
- Experimental Results and Discussions
- Conclusion and Future Work
- References
- Bi-clustering Gene Expression Data Using Co-similarity
- Introduction
- The ?-Sim Algorithm
- The Algorithm
- ?-Sim as a Bi-clustering Algorithm
- Experimentation
- Experimental Methodology
- Sample Cluster Analysis
- Gene Cluster Analysis
- Related Work
- Conclusion
- References
- CCE: A Chinese Concept Encyclopedia Incorporating the Expert-Edited Chinese Concept Dictionary with Online Cyclopedias
- Introduction
- Related Work
- Chinese Concept Dictionary
- English Concept Cyclopedia Enhancement
- Construction Methodology for CCE
- Notation and Definition
- The Structure of Baidu Baike
- Method for Ontology Generation
- Semantic Distances and Similarity
- Experiments
- Experiment Setup
- Case Study of CCE
- Document Classification
- Document Clustering
- Relevance Analysis in BlogSphere
- Conclusion and Future Work
- References
- Cluster Ensembles via Weighted Graph Regularized Nonnegative Matrix Factorization
- Introduction
- Related Work
- WGNMF
- Notation
- NMF and Extensions
- Weighted Graph Regularized NMF
- Optimization
- Experiments
- Datasets
- Evaluation Criteria
- Comparison Settings
- Experimental Results
- Individual Clustering Selection
- Impact of Parameter
- Conclusion and Future Work
- References
- Continuously Identifying Representatives Out of Massive Streams
- Introduction
- Related Work
- Problem Statement
- Continuously Identifying Representatives
- Core Clustering
- Extracting Representatives
- Representatives Adjustment
- Experimental Evaluations
- Datasets
- Effectiveness
- Efficiency
- Conclusion
- References
- Cost-Sensitive Decision Tree for Uncertain Data
- Introduction
- Related Work
- Problem Definition
- Cost-Sensitive Decision Tree for Uncertain Data (CSDTU)
- Data Uncertainty
- Training Algorithm for CSDTU
- Testing Algorithm for CSDTU
- Experiments
- Conclusions and Future Work
- References
- Direct Marketing with Fewer Mistakes
- Introduction
- Related Work
- Most-Certain Learning (MCL) Paradigm
- MCL-b Learning Strategy
- MCL-1 Learning Strategy
- Experiment
- MCL-b Experiment
- MCL-1 Experiment
- Conclusion and Discussion
- References
- Discovering Collective Viewpoints on Micro-blogging Events Based on Community and Temporal Aspects
- Introduction
- Problem Statement
- The Framework of Mining Viewpoints
- Term-Tweet-User Graph
- Problem Definition
- Mining Viewpoints on TWU Graph
- Our Solution
- Modeling Community and Temporal Aspects
- Random Walk on TWU Graph
- Our Algorithm
- Experiments
- Dataset Description
- Evaluation Metric
- Comparison Methods
- Related Work
- Conclusions
- References
- Discriminatory Confidence Analysis in Pattern Mining
- Introduction
- Related Work
- Problem Definition
- The Design of the Information Score Filter
- Discriminatory Confidence vs. Information Score
- Empirical Study
- Discriminatory Confidence Analysis
- Correlation Analysis
- Discriminatory Rule Analysis
- Conclusion
- References
- Dominance-Based Soft Set Approach in Decision-Making Analysis
- Introduction
- Information System
- Soft Set Theory
- Dominance Relation Based Soft Set Theory
- Experiments
- Conclusion
- References
- Efficient Computation of Measurements of Correlated Patterns in Uncertain Data
- Introduction
- Related Work
- Preliminaries
- The Model for Uncertain Data
- All-Confidence and Bond
- Efficient Computation of Expected All-Confidence and Bond
- Definition of Expected All-Confidence and Bond
- Efficient Computation of Expected Bond
- Efficient Computation of All-Confidence
- Experiments and Evaluations
- Evaluation Datasets
- Evaluation Result
- Conclusions
- References
- Efficient Subject-Oriented Evaluating and Mining Methods for Data with Schema Uncertainty
- Introduction
- Related Works
- Problem Definition
- Hierarchical Monte Carlo Possible World Analysis
- Experiment Discusses and Applications
- Conclusion
- References
- An Empirical Evaluation of Bagging with Different Algorithms on Imbalanced Data
- Introduction
- Designed Framework
- Different Levels of Sample Distributions to Form a ROC
- Statistical Test
- Classify Base Learners
- Evaluation Metrics
- Experimental Setting
- Data-Sets
- Selection of Base Learners
- Experimental Results Analysis
- Compare the Performance of Bagging and Single Learners
- Compare the ROC Curves of Bagging MLP with a Single Learner MLP
- Compare Bagging Predictors against One Another
- Conclusion
- References
- Exploiting Concept Clumping for Efficient Incremental News Article Categorization
- Introduction
- Related Work
- Concept Clumping for Multi-label Categorization
- Categorization Methods
- Base Algorithms
- Thresholding Techniques
- Term-Category Weight Boosting
- Experimental Evaluation
- The RCV1 Datasets
- Experimental Setup
- Discussion of Results
- Conclusion
- References
- Extracting Rocks from Mars Images with Data Fields
- Introduction
- Data Field
- Mathematical Model
- Mars Image Field
- Principles
- Differ Foreground Rocks from Background Information
- Initial Grids
- Identify Clustering Centers
- Detect Edges
- Find Full Clusters
- Discover Target Rocks
- Experiments and Comparisons
- Conclusions
- References
- Finding a Wise Group of Experts in Social Networks
- Introduction
- Problems
- Preliminaries
- Problem Definition
- Discussion
- Computation of Social Influence
- Social Influence
- Approach for Computation
- Algorithms
- RarestFirst Algorithm
- Simplified RarestFirst Algorithm
- Experiments
- Other Algorithms
- Data Preparation
- Performance Evaluation
- Related Works
- Conclusions and Future Work
- References
- Fully Utilize Feedbacks: Language Model Based Relevance Feedback in Information Retrieval
- Introduction
- Related Work
- Language Modeling Approach
- Relevance Feedback Methods on Language Modeling Approach
- Model Definition
- Incorporating Labeled Relevant Documents
- Incorporating Unlabeled Documents
- Incorporating Labeled Irrelevant Documents
- Rank the Documents
- Experiment
- Dataset and Competitors
- Experiment Results
- Conclusion and Future Work
- References
- FXProj - A Fuzzy XML Documents Projected Clustering Based on Structure and Content
- Introduction
- Related Works
- Preliminaries
- XProj Algorithm
- LAC Algorithm
- The FXProj Algorithm
- Phase I: Structure Mining
- Phase II: Content Mining
- Phase III: Clustering
- Experiments
- Data Description and Evaluation Measures
- The Accuracy of FXProj Algorithm
- The Scalability of FXProj Algorithm
- Conclusion
- References
- Author Index
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.