
Oracle Big Data Handbook
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
More details
Other editions
Additional editions

Content
- Cover
- Title Page
- Copyright Page
- About the Authors
- Contents at a Glance
- Contents
- Acknowledgments
- Introduction
- Part I: Introduction
- Chapter 1: Introduction to Big Data
- Big Data
- Google's MapReduce Algorithm and Apache Hadoop
- Oracle's Big Data Platform
- Summary
- Chapter 2: The Value of Big Data
- Am I Big Data, or Is Big Data Me?
- Big Data, Little Data-It's Still Me
- What Happened?
- Now What?
- Reality, Check Please!
- What Do You Make of It?
- Information Chain Reaction (ICR)
- Big Data, Big Numbers, Big Business?
- Internal Source
- ICR: Connect
- ICR: Change
- Wanted: Big Data Value
- Big Data Example 1: Clinical Trial Research Within the Healthcare Industry
- Example 2: Improvements in Car Design for Driver Safety Within the Automotive Industry
- Summary
- Part II: Big Data Platform
- Chapter 3: The Apache Hadoop Platform
- Software vs. Hardware
- The Hadoop Software Platform
- Hadoop Distributions and Versions
- The Hadoop Distributed File System (HDFS)
- Scheduling, Compute, and Processing
- Operating System Choices
- I/O and the Linux Kernel
- The Hadoop Hardware Platform
- CPU and Memory
- Network
- Disk
- Putting It All Together
- Chapter 4: Why an Appliance?
- Why Would Oracle Create a Big Data Appliance?
- What Is an Appliance?
- What Are the Goals of Oracle Big Data Appliance?
- Optimizing an Appliance
- Oracle Big Data Appliance Version 2 Software
- Oracle Big Data Appliance X3-2 Hardware
- Where Did Oracle Get Hadoop Expertise?
- Configuring a Hadoop Cluster
- Choosing the Core Cluster Components
- Assembling the Cluster
- What About a Do-It-Yourself Cluster?
- Total Costs of a Cluster
- Time to Value
- How to Build Out Larger Clusters
- Can I Add Other Software to Oracle Big Data Appliance?
- Drawbacks of an Appliance
- Chapter 5: BDA Configurations, Deployment Architectures, and Monitoring
- Introduction
- Big Data Appliance X3-2 Full Rack (Eighteen Nodes)
- Big Data Appliance X3-2 Starter Rack (Six Nodes)
- Big Data Appliance X3-2 In-Rack Expansion (Six Nodes)
- Hardware Modifications to BDA
- Software Supported on Big Data Appliance X3-2
- BDA Install and Configuration Process
- Critical and Noncritical Nodes
- Automatic Failover of the NameNode
- BDA Disk Storage Layout
- Adding Storage to a Hadoop Cluster
- Hadoop-Only Config and Hadoop+NoSQL DB
- Hadoop-Only Appliance
- Hadoop and NoSQL DB
- Memory Options
- Deployment Architectures
- Multitenancy and Hadoop in the Cloud
- Scalability
- Multirack BDA Considerations
- Installing Other Software on the BDA
- BDA in the Data Center
- Administrative Network
- Client Access Network
- InfiniBand Private Network
- Network Requirements
- Connecting to Data Center LAN
- Example Connectivity Architecture
- Oracle Big Data Appliance Restrictions on Use
- BDA Management and Monitoring
- Enterprise Manager
- Cloudera Manager
- Hadoop Monitoring Utilities: Web GUI
- Oracle ILOM
- Hue
- DCLI Utility
- Chapter 6: Integrating the Data Warehouse and Analytics Infrastructure to Big Data
- The Data Warehouse as a Historic Database of Record
- The Oracle Database as a Data Warehouse
- Why the Data Warehouse and Hadoop Are Deployed Together
- Completing the Footprint: Business Analyst Tools
- Building Out the Infrastructure
- Chapter 7: BDA Connectors
- Oracle Big Data Connectors
- Oracle Loader for Hadoop
- Online Mode
- Oracle OCI Direct Path Output
- JDBC Output
- Offline Mode
- Oracle Data Pump Output
- Delimited Text Output
- Installation of Oracle Loader for Hadoop
- Invoking Oracle Loader for Hadoop
- Input Formats
- DelimitedTextInputFormat
- RegexInputFormat
- AvroInputFormat
- HiveToAvroInputFormat
- KVAvroInputFormat
- Custom Input Formats
- Oracle Loader for Hadoop Configuration Files
- Loader Maps
- Additional Optimizations
- Leveraging InfiniBand
- Comparison to Apache Sqoop
- Oracle SQL Connector for HDFS
- Installation of Oracle SQL Connector for HDFS
- HIVE Installation
- Creating External Tables Using Oracle SQL Connector for HDFS
- ExternalTable Configuration Tool
- Data Source Types
- Configuration Tool Syntax
- Required Properties
- Optional Properties
- ExternalTable Tool for Delimited Text Files
- Testing DDL with --noexecute
- Adding a New HDFS File to the Location File
- Manual External Table Configuration
- Hive Sources
- ExternalTable Example
- Oracle Data Pump Sources
- Configuration Files
- Querying with Oracle SQL Connector for HDFS
- Oracle R Connector for Hadoop
- Oracle Data Integrator Application Adapter for Hadoop
- Chapter 8: Oracle NoSQL Database
- What Is a NoSQL Database System?
- NoSQL Applications
- Oracle NoSQL Database
- A Sample Use Case
- Architecture
- Client Driver
- Key-Value Pairs
- Storage Nodes
- Replication
- Smart Topology
- Online Elasticity
- No Single Point of Failure
- Data Management
- APIs
- CRUD Operations
- Multiple Update Operations
- Lookup Operations
- Transactions
- Predictable Performance
- Integration
- Installation and Administration
- Simple Installation
- Administration
- How Oracle NoSQL Database Stacks Up
- Useful Links
- Part III: Analyzing Information and Making Decisions
- Chapter 9: In-Database Analytics: Delivering Faster Time to Value
- Introduction
- Oracle's In-Database Analytics
- Why Running In-Database Is So Important
- Introduction to Oracle Data Mining and Statistical Analysis
- Oracle's In-Database Advanced Analytics
- Oracle Data Mining
- Introduction to R
- Text Mining
- In-Database Statistical Functions
- Making BI Tools Smarter
- Spatial Analytics
- Understanding the Spatial Data Model
- Querying the Spatial Data Model
- Using Spatial Analytics
- Making BI Tools Smarter
- Graph-Based Analytics
- Graph Data Model
- Querying Graph Data
- Multidimensional Analytics
- Making BI Tools Smarter and Faster
- In-Database Analytics: Bringing It All Together
- Integrating Analytics into Extract-Load-Transform Processing
- Delivering Guided Exploration
- Delivering Analytical Mash-ups
- Conclusion
- Chapter 10: Analyzing Data with R
- Introduction to Open Source R
- CRAN, Packages, and Task Views
- GUIs and IDEs
- Traditional R and Database Interaction vs. Oracle R Enterprise
- Oracle's Strategic R Offerings
- Oracle R Enterprise
- Oracle R Distribution
- ROracle
- Oracle R Connector for Hadoop
- Oracle R Enterprise: Next-Level View
- Oracle R Enterprise Installation and Configuration
- Using Oracle R Enterprise
- Transparency Layer
- Embedded R Execution
- Predictive Analytics
- Oracle R Connector for Hadoop
- Invoking MapReduce Jobs
- Testing ORCH R Scripts Without the Hadoop Cluster
- Interacting with HDFS from R
- HDFS Metadata Discovery
- Working with Hadoop Using the ORCH Framework
- ORCH Predictive Analytics on Hadoop
- ORCHhive
- Oracle R Connector for Hadoop and Oracle R Enterprise Interaction
- Summary
- Chapter 11: Endeca Information Discovery
- Why Did Oracle Select Endeca?
- Product Suites Overview
- Endeca Information Discovery Platform
- Major Functional Areas
- Key Features
- Endeca Information Discovery and Business Intelligence
- Difference in Roles and Functions
- BI Development Process vs. Information Discovery Approach
- Complementary But Not Exclusive
- Architecture
- Oracle Endeca Server
- Oracle Endeca Studio
- Oracle Endeca Integration Suite
- Endeca on Exalytics
- Scalability and Load Balancing
- Unifying Diverse Content Sets
- Endeca Differentiator
- Industry Use Cases
- Hands-On with Endeca
- Installation and Configuration
- Developing an Endeca Application
- Chapter 12: Big Data Governance
- Key Elements of Enterprise Data Governance
- Business Outcome
- Information Lifecycle Management
- Regulatory Compliance and Risk Management
- Metadata Management
- Data Quality Management
- Master and Reference Data Management
- Data Security and Privacy Management
- Business Process Alignment
- How Does Big Data Impact Enterprise Data Governance?
- Modeled Data vs. Raw Data
- Types of Big Data
- Applying Data Governance to Big Data
- Leveraging Big Data Governance
- Industry-Specific Use Cases
- Utilities
- Healthcare
- Financial Services
- Retail
- Consumer Packaged Goods (CPG)
- Telecommunications
- Oil and Gas
- How Does Big Data Impact Data Governance Roles?
- Governance Roles and Organization
- An Approach to Implementing Big Data Governance
- Chapter 13: Developing Architecture and Roadmap for Big Data
- Architecture Capabilities for Big Data
- New Characteristics of Big Data
- Conceptual Architecture Capabilities of Big Data
- Product Capabilities and Tools
- Making Big Data Architecture Decisions
- Architecture Development Process for Realizing Incremental Values
- Overview of Oracle Information Architecture Framework
- Overview of Applied OADP for Information Architecture
- Big Data Architecture Development Process
- Impact on Data Management and BI Processes
- Traditional BI Development Process
- Big Data and Analytics Development Process
- Big Data Governance
- Traditional Data Governance Focus
- New Focus for Governance in Big Data
- Developing Skills and Talent
- Data Scientist
- Big Data Developer
- Big Data Administrator
- Big Data Best Practices
- Align Big Data Initiative with Specific Business Goals
- Ensure a Centralized IT Strategy for Standards and Governance
- Use a Center of Excellence to Minimize Training and Risk
- Correlate Big Data with Structured Data
- Provide High-Performance and Scalable Analytical Sandboxes
- Reshape the IT Operating Model
- Index
System requirements
File format: ePUB
Copy protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (not Kindle).
The file format ePub works well for novels and non-fiction books – i.e., „flowing” text without complex layout. On an e-reader or smartphone, line and page breaks automatically adjust to fit the small displays.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our ebook Help page.
File format: PDF
Copy-Protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our eBook Help page.