
Big Data Technology and Applications
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
This book constitutes the refereed proceedings of the First National Conference on Big Data Technology and Applications, BDTA 2015, held in Harbin, China, in December 2015.
The 26 revised papers presented were carefully reviewed and selected from numerous submissions. The papers address issues such as the storage technology of Big Data; analysis of Big Data and data mining; visualization of Big Data; the parallel computing framework under Big Data; the architecture and basic theory of Big Data; collection and preprocessing of Big Data; innovative applications in some areas, such as internet of things and cloud computing.
More details
Other editions
Additional editions

Content
- Intro
- Preface
- Organization
- Contents
- A General MHSS Iteration Method for a Class of Complex Symmetric Linear Systems
- Abstract
- 1 Introduction
- 2 The General MHSS Method 4E49 MSSOR
- Funds
- References
- Big Data Storage Architecture Design in Cloud Computing
- Abstract
- 1 Introduction
- 2 Big Data Platform Design
- 2.1 Platform Environment
- 2.2 Architecture Design
- 3 Key Technologies
- 3.1 Storage System Design Based on Cloud Computing
- 3.2 Updating Algorithm Design Based on File Block
- 3.3 Fault Recovery Mechanism Design Based on Cloud Storage
- 4 Conclusions
- References
- Visual Analysis for Civil Aviation Passenger Reservation Data Characteristics Based on Uncertainty M ...
- Abstract
- 1 Introduction
- 2 Uncertainty Measurement of Multidimensional Data
- 2.1 Measurement Method
- 2.2 Uncertainty Measurement of Properties and Data Record
- 3 Visual Analytic
- 3.1 Parallel Coordinates with Histogram
- 3.1.1 Parallel Coordinates
- 3.1.2 Histogram
- 3.2 Radar Chart
- 3.3 Pixel Chart
- 3.4 Interaction Method
- 3.4.1 Hidden
- 3.4.2 Zooming and Moving
- 4 Conclusion
- References
- The Mobile Personalized Recommendation Model Containing Implicit Intention
- Abstract
- 1 Introduction
- 2 Mobile Personalized Recommendation Model with Implicit Intention
- 3 Research on Key Technologies Based on MPRI Method
- 3.1 Constructing Explicit Demand Knowledge Database
- 3.2 Calculate Implicit Will Influence Value
- 3.3 Calculate the Predictive Value of the Composite Purchase Behavior
- 4 Experimental Analysis
- 5 Conclusion
- References
- Personalized Recommendation System on Hadoop and HBase
- Abstract
- 1 Introduction
- 2 Personalized Recommendation System and Hadoop, HBase
- 2.1 Personalized Recommendation System
- 2.2 Hadoop Platform
- 2.3 HBase Platform
- 3 Personalized Recommendation System Algorithm Optimization
- 4 Data and Experiment
- 4.1 Experimental Data
- 4.2 Experiment and Result Analysis
- 4.2.1 Data Storage Optimization Experiment
- 4.2.2 Recommended Algorithm Optimization Experiments
- 4.2.3 Recommended System Results
- 5 Conclusion
- References
- A Social Stability Analysis System Based on Web Sensitive Information Mining
- Abstract
- 1 Introduction
- 2 Related Work
- 3 Social Stability Analysis Technology Framework
- 3.1 The Web Mining Layer
- 3.2 The Knowledge Discovery Layer
- 3.3 Quantitative Calculation of Social Stability Index
- 3.4 The Data Presentation Layer
- 4 Prototype System Demonstration
- 4.1 System Construction
- 4.2 Case Study
- 5 Evaluation and Analysis
- 6 Conclusions and Future Work
- Acknowledgments
- References
- Energy Conservation Strategy for Big News Data on HDFS
- Abstract
- 1 Overview
- 2 Access Pattern of News Data
- 3 HDFS Energy-Conservation Storage Strategy
- 3.1 Data Nodes Partition Strategy
- 3.2 Two Strategies of Priority Allocation
- 3.2.1 Active State Node Priority (ASNP)
- 3.2.2 Lower Than Average Utilization Rate Node Priority (LANP)
- 3.3 File Migration Strategy
- 3.4 Nodes Standby Strategy
- 4 Experiment Results and Analysis
- 4.1 Analysis of Cluster Power Consumption
- 4.2 Analysis of Cold Data Nodes Utilization Rate
- 4.3 Analysis of File Migration
- 4.4 Impact on the Response Time of Reading
- 5 Conclusions and Future Work
- References
- Research on Jukes-Cantor Model Parallel Algorithm Based on OpenMP
- Abstract
- 1 Introduction
- 2 Jukes-Cantor Model
- 3 Parallel Algorithm of Distance Model
- 3.1 The Design of Parallel Algorithm
- 3.1.1 OpenMP Parallel Technique
- 3.1.2 The Design of Parallel Algorithm
- 3.2 Parallel Algorithm Based on OpenMP
- 4 Results of Experiments
- 5 Conclusion
- Acknowledgment
- References
- Rough Control Rule Mining Model Based on Decision Interval Concept Lattice and Its Application
- Abstract
- 1 Introduction
- 2 Decision Interval Concept Lattice
- 2.1 Basic Concepts
- 2.2 Decision Interval Rule
- 2.3 Construction Algorithm for Decision Interval Concept Lattice
- 2.4 Mining Algorithm for Decision Interval Rule
- 3 Mining Model of Decisions Interval Rule in Rough Control
- 3.1 Model Design
- 3.2 Model Analysis
- 4 Case Study
- 5 Conclusions
- Acknowledgements
- References
- Research on Association Analysis Technology of Network User Accounts
- 1 Introduction
- 2 Basis of Association Analysis on Network User Accounts
- 2.1 User Naming Conventions
- 2.2 User Profiles
- 2.3 User Writing Styles
- 2.4 User Online Behavior
- 2.5 User Community Relationship
- 3 Method of Association Analysis on Network User Accounts
- 3.1 The Analysis Method Based on User Naming Conventions
- 3.2 The Analysis Method Based on User Profiles
- 3.3 The Analysis Method Based on User Writing Styles
- 3.4 The Analysis Method Based on User Online Behavior
- 3.5 The Analysis Method Based on User Community Relationship
- 4 Future Direction
- References
- A Distributed Query Method for RDF Data on Spark
- Abstract
- 1 Introduction
- 2 Preliminaries
- 2.1 RDF and SPARQL
- 2.2 Spark and RDD
- 2.3 MemSQL
- 3 RQCCP Algorithm
- 3.1 Split and Storage
- 3.2 Index and Query
- 4 Experiment and Evaluation
- 4.1 Cluster Configuration
- 4.2 Datasets
- 4.3 Experimental Results
- 5 Conclusions
- References
- Real-Time Monitoring and Forecast of Active Population Density Using Mobile Phone Data
- 1 Introduction
- 2 Related Work
- 3 Data Description
- 4 Real-Time Active Population Monitoring and Forecasting
- 4.1 Area of Service Coverage for a Base Station
- 4.2 Active Users Within Service Coverage of a Base Station
- 4.3 Real-Time Monitoring and Forecast of Active Population Density
- 4.4 Forecast Methods
- 5 Experiments
- 5.1 Environment Setup
- 5.2 Evaluation Method
- 5.3 Results
- 6 Conclusion
- References
- Ranking-Based Recommendation System with Text Modeling
- Abstract
- 1 Introduction
- 2 Related Work
- 2.1 Rating Predicted Model
- 2.2 HFT Model
- 3 Model
- 3.1 Ranking-Based Matrix Factorization Model
- 3.2 Text Modeling
- 3.3 Model Merging
- 4 Experiment
- 4.1 Rating-Based Model and Ranking-Based Model
- 4.2 Text Modeling
- 4.3 Compare to HFT
- 5 Conclusions and Future Work
- References
- Design and Implementation of a Project Management System Based on Product Data Management on the Baidu Cloud Computing Platform
- Abstract
- 1 Introduction
- 2 Project Management in the PDM
- 2.1 The Basic Concept of Project Management
- 2.2 Organization Structure Model of Project Management
- 2.3 Function Analysis of PMS
- 2.4 Data Analysis in the PMS
- 3 System Architecture: Baidu Cloud Computing Platform
- 4 Software Architecture Design of a PMS System Based on BAE
- 5 Use-Case Testing and Implementation of the PMS
- 6 Conclusion
- Acknowledgments
- References
- A Ring Signature Based on LDGM Codes
- Abstract
- 1 Introduction
- 2 Ring Signature Using LDGM Codes
- 2.1 Security Model
- 2.2 Scheme Description
- 2.3 Analysis of the Scheme
- 3 Conclusion
- References
- SparkSCAN: A Structure Similarity Clustering Algorithm on Spark
- Abstract
- 1 Introduction
- 2 Preliminary
- 2.1 Spark
- 2.2 Rdd
- 2.3 PDirSCAN Algorithm
- 3 SparkSCAN
- 3.1 Data Structure of SparkSCAN
- 3.2 Parallel Recognition epsilon Neighbors and Core Nodes
- 3.3 Extension and Synchronization of Cluster Label in Parallel
- 3.4 Clustering Result Analysis
- 4 Evaluation
- 4.1 Data-sets
- 4.2 Algorithm Evaluation Index
- 4.3 Environment of Experiment
- 4.4 Parameters of Cluster
- 4.5 Results and Analysis of Experiment
- 5 Conlusion
- References
- Using Distant Supervision and Paragraph Vector for Large Scale Relation Extraction
- 1 Introduction
- 2 Related Works
- 2.1 Distant Supervision
- 2.2 Paragraph Vector
- 3 Task Definition
- 4 Modeling Framework
- 4.1 Fuzzy Classification Based Multi-instance Multi-label Learning
- 4.2 Paragraph Vector
- 4.3 Training & Inference
- 5 Experiments
- 5.1 FC-MIML-RE
- 5.2 Paragraph Vector
- 5.3 Comprehensive Experiment
- 6 Conclusions
- References
- Computer Assisted Language Testing and the Washback Effect on Language Learning
- Abstract
- 1 Introduction
- 2 Theoretical Framework
- 2.1 Computer Assisted Language Testing
- 2.2 Washback Effect
- 3 Preliminary Practice of Computer Assisted Spoken English Testing
- 3.1 Preparation
- 3.2 Implementation
- 3.3 Evaluation
- 4 Result and Discussion
- 4.1 Result
- 4.2 Discussion
- 5 Washback Effect on Language Learning
- 5.1 Positive Effect
- 5.2 Negative Effect
- 6 Suggestions
- 6.1 To Improve Teachers' Level of Using Network Technology in Teaching
- 6.2 To Improve Students' Autonomous Learning Competence
- 6.3 The Relevant Teaching Management Departments Will Do Their Duties
- 7 Conclusion
- References
- Using Class Based Document Frequency to Select Features in Text Classification
- Abstract
- 1 Introduction
- 2 Related Work
- 3 Class Based Document Frequency for Feature Selection
- 4 Experiments and Discussion
- 4.1 Datasets
- 4.2 Experimental Settings
- 4.3 Evaluation Metric
- 4.4 Results and Discussion
- 5 Conclusions and Future Work
- Acknowledgments
- References
- A Generalized Location Privacy Protection Scheme in Location Based Services
- Abstract
- 1 Introduction
- 2 Related Works
- 3 Our Scheme
- 3.1 Basic Definition
- 3.2 LPPS-GKA
- 4 Discussion and Analysis
- 4.1 Geo-Indistinguishability
- 4.2 Feasibility of Services
- 4.3 Performance
- 5 Conclusion and Future Work
- References
- Cooperation Oriented Computing: A Computing Model Based on Emergent Dynamics of Group Cooperation
- Abstract
- 1 Introduction
- 2 Motivation
- 3 Related Works
- 4 Cooperation Rules
- 5 Evolution Algorithm Based on Cooperation Rules
- 5.1 Algorithm Framework
- 5.2 Theoretical Analyses
- 6 Experimental Results
- 6.1 Experiment Setting
- 6.2 Experiments on Synthetic Data
- 6.3 Experiments on Real Data
- 7 Conclusion and Future Work
- References
- SVR Recommendation Algorithm of Civil Aviation Auxiliary Service Based on Context-Awareness
- Abstract
- 1 Introduction
- 2 Construct the User Preference Model
- 2.1 Construct the User Attributes Model
- 2.2 Construct the User Preference Model
- 2.3 Construct the Contextual User Preference Model
- 3 Nonlinear SVR Algorithm
- 4 SVR Recommendation Algorithm Based on Context
- 5 The Application of the Passenger Auxiliary Service of Civil Aviation
- 5.1 Analysis of the Passenger Ticket Booking Data Set
- 5.2 Pre-processing of the Passenger Ticket Booking Data Set of Civil Aviation
- 5.3 Evaluation Metric
- 5.4 Experimental Results and Analyses
- 5.4.1 Analysis of the Influence of SVR Parameters' Selection on MAE
- 5.4.2 Experimental Results
- 6 Conclusion
- Acknowledgment
- References
- Crystal MD: Molecular Dynamic Simulation Software for Metal with BCC Structure
- Abstract
- 1 Introduction
- 2 Crystal Structure and MD Calculation
- 2.1 Crystal Structure
- 2.2 MD Calculation
- 3 Data Structure in Crystal MD
- 3.1 Data Structure Design for MD Simulation with BCC Structure
- 4 Communication Scheme in Crystal MD
- 5 Performance Analysis and Discussion
- 5.1 Performance Test and Discussion
- 5.2 Memory Usage Test
- 6 Conclusions
- Acknowledgements
- References
- GPU Acceleration of the Locally Selfconsistent Multiple Scattering Code for First Principles Calculation of the Ground State and Statistical Physics of Materials
- 1 Multiple Scattering Theory
- 1.1 The LSMS Algorithm
- 1.2 Scattering Matrix Construction
- 1.3 Matrix Inversion
- 2 Wang-Landau Monte-Carlo Sampling
- 3 Scaling and Performance
- 4 Applications
- 5 Conclusions
- References
- Kernel Optimization on Short-Range Potentials Computations in Molecular Dynamics Simulations
- Abstract
- 1 Introduction
- 2 Related Work and Background
- 2.1 Related Work
- 2.1.1 Multi-threading for Molecular Dynamics
- 2.1.2 SIMD for Molecular Dynamics
- 2.2 Background
- 3 Kernel Optimization of Molecular Dynamics
- 3.1 Efficient Multi-threading Implementation
- 3.1.1 Principle of PTS Algorithm
- 3.1.2 Implement the PTS Method Using OpenMP
- 3.1.3 Limitation of PTS Method
- 3.2 Improved SIMD Utilization
- 3.2.1 Cut-Off if Statements
- 3.2.2 Modified Pre-searching Neighbor Method
- 4 Experiments Results and Analysis
- 4.1 Experiment Result and Analysis of Multi-threading Optimization
- 4.2 Experiment Result and Analysis of Improved SIMD Implementation
- 5 Conclusions
- References
- Optimizing Parallel Kinetic Monte Carlo Simulation by Communication Aggregation and Scheduling
- Abstract
- 1 Introduction
- 2 Related Work
- 3 Communication Aggregation
- 3.1 Surface Adjacent Sectors Situation
- 3.2 Edge Adjacent Sectors Situation
- 3.3 Diagonally Adjacent Sectors Situation
- 4 Neighborhood Collective Communication Optimization
- 4.1 Graph Topology Processes
- 4.2 Optimizing Communication Scheduling
- 5 Experimental Evaluation
- 5.1 Experimental Environment
- 5.2 Parallel Performance
- 5.3 Scalability
- 6 Conclusions and Future Work
- References
- A Study on Process Model of Computing Similarity Between Product Features and Online Reviews
- Abstract
- 1 Introduction
- 2 Research Design
- 3 Framework of Process Model of Review Similarity Computing
- 3.1 Text Preprocessing
- 3.2 Feature Selection and Weighting
- 3.3 Similarity Calculation [14, 15]
- 4 Model Application and Effect Analysis
- 4.1 Treatment Effect of Product Descriptions
- 4.2 Treatment Effect of Product Reviews
- 4.3 Results of Similarity Calculation and Effects
- 5 Conclusion
- References
- Research on the Tendency of Consumer Online Shopping Based on Improved TOPSIS Method
- Abstract
- 1 Introduction
- 2 Improvement on the TOPSIS
- 2.1 Traditional TOPSIS
- 2.2 Improve on TOPSIS
- 2.2.1 Determine the Weight Using Entropy Method
- 2.2.2 The TOPSIS Algorithm Based on Fuzzy Number
- 3 Algorithm Application
- 3.1 Product Category
- 3.2 Determination of the Consumers' Attributes Value and the Corresponding Fuzzy Weights
- 4 Conclusion
- Acknowledgement
- References
- Author Index
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.