
Architecting HBase Applications
Beschreibung
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
Weitere Details
Weitere Ausgaben
Andere Ausgaben

Inhalt
- Intro
- Copyright
- Table of Contents
- Foreword
- Preface
- Who Should Read This Book?
- How This Book Is Organized
- Additional Resources
- Conventions Used in This Book
- Using Code Examples
- Safari® Books Online
- How to Contact Us
- Acknowledgments
- From Kevin
- From Jean-Marc
- Part I. Introduction to HBase
- Chapter 1. What Is HBase?
- Column-Oriented Versus Row-Oriented
- Implementation and Use Cases
- Chapter 2. HBase Principles
- Table Format
- Table Layout
- Table Storage
- Internal Table Operations
- Compaction
- Splits (Auto-Sharding)
- Balancing
- Dependencies
- HBase Roles
- Master Server
- RegionServer
- Thrift Server
- REST Server
- Chapter 3. HBase Ecosystem
- Monitoring Tools
- Cloudera Manager
- Apache Ambari
- Hannibal
- SQL
- Apache Phoenix
- Apache Trafodion
- Splice Machine
- Honorable Mentions (Kylin, Themis, Tephra, Hive, and Impala)
- Frameworks
- OpenTSDB
- Kite
- HappyBase
- AsyncHBase
- Chapter 4. HBase Sizing and Tuning Overview
- Hardware
- Storage
- Networking
- OS Tuning
- Hadoop Tuning
- HBase Tuning
- Different Workload Tuning
- Chapter 5. Environment Setup
- System Requirements
- Operating System
- Virtual Machine
- Resources
- Java
- HBase Standalone Installation
- HBase in a VM
- Local Versus VM
- Local Mode
- Virtual Linux Environment
- QuickStart VM (or Equivalent)
- Troubleshooting
- IP/Name Configuration
- Access to the /tmp Folder
- Environment Variables
- Available Memory
- First Steps
- Basic Operations
- Import Code Examples
- Testing the Examples
- Pseudodistributed and Fully Distributed
- Part II. Use Cases
- Chapter 6. Use Case: HBase as a System of Record
- Ingest/Pre-Processing
- Processing/Serving
- User Experience
- Chapter 7. Implementation of an Underlying Storage Engine
- Table Design
- Table Schema
- Table Parameters
- Implementation
- Data conversion
- Generate Test Data
- Create Avro Schema
- Implement MapReduce Transformation
- HFile Validation
- Bulk Loading
- Data Validation
- Table Size
- File Content
- Data Indexing
- Data Retrieval
- Going Further
- Chapter 8. Use Case: Near Real-Time Event Processing
- Ingest/Pre-Processing
- Near Real-Time Event Processing
- Processing/Serving
- Chapter 9. Implementation of Near Real-Time Event Processing
- Application Flow
- Kafka
- Flume
- HBase
- Lily
- Solr
- Implementation
- Data Generation
- Kafka
- Flume
- Serializer
- HBase
- Lily
- Solr
- Testing
- Going Further
- Chapter 10. Use Case: HBase as a Master Data Management Tool
- Ingest
- Processing
- Chapter 11. Implementation of HBase as a Master Data Management Tool
- MapReduce Versus Spark
- Get Spark Interacting with HBase
- Run Spark over an HBase Table
- Calling HBase from Spark
- Implementing Spark with HBase
- Spark and HBase: Puts
- Spark on HBase: Bulk Load
- Spark Over HBase
- Going Further
- Chapter 12. Use Case: Document Store
- Serving
- Ingest
- Clean Up
- Chapter 13. Implementation of Document Store
- MOBs
- Storage
- Usage
- Too Big
- Consistency
- Going Further
- Part III. Troubleshooting
- Chapter 14. Too Many Regions
- Consequences
- Causes
- Misconfiguration
- Misoperation
- Solution
- Before 0.98
- Starting with 0.98
- Prevention
- Regions Size
- Key and Table Design
- Chapter 15. Too Many Column Families
- Consequences
- Memory
- Compactions
- Split
- Causes, Solution, and Prevention
- Delete a Column Family
- Merge a Column Family
- Separate a Column Family into a New Table
- Chapter 16. Hotspotting
- Consequences
- Causes
- Monotonically Incrementing Keys
- Poorly Distributed Keys
- Small Reference Tables
- Applications Issues
- Meta Region Hotspotting
- Prevention and Solution
- Chapter 17. Timeouts and Garbage Collection
- Consequences
- Causes
- Storage Failure
- Power-Saving Features
- Network Failure
- Solutions
- Prevention
- Reduce Heap Size
- Off-Heap BlockCache
- Using the G1GC Algorithm
- Configure Swappiness to 0 or 1
- Disable Environment-Friendly Features
- Hardware Duplication
- Chapter 18. HBCK and Inconsistencies
- HBase Filesystem Layout
- Reading META
- Reading HBase on HDFS
- General HBCK Overview
- Using HBCK
- Index
- About the Authors
- Colophon
Systemvoraussetzungen
Dateiformat: PDF
Kopierschutz: Adobe-DRM (Digital Rights Management)
Systemvoraussetzungen:
- Computer (Windows; MacOS X; Linux): Installieren Sie bereits vor dem Download die kostenlose Software Adobe Digital Editions (siehe E-Book Hilfe).
- Tablet/Smartphone (Android; iOS): Installieren Sie bereits vor dem Download die kostenlose App Adobe Digital Editions oder die App PocketBook (siehe E-Book Hilfe).
- E-Book-Reader: Bookeen, Kobo, Pocketbook, Sony, Tolino u.v.a.m. (nicht Kindle)
Das Dateiformat PDF zeigt auf jeder Hardware eine Buchseite stets identisch an. Daher ist eine PDF auch für ein komplexes Layout geeignet, wie es bei Lehr- und Fachbüchern verwendet wird (Bilder, Tabellen, Spalten, Fußnoten). Bei kleinen Displays von E-Readern oder Smartphones sind PDF leider eher nervig, weil zu viel Scrollen notwendig ist.
Mit Adobe-DRM wird hier ein „harter” Kopierschutz verwendet. Wenn die notwendigen Voraussetzungen nicht vorliegen, können Sie das E-Book leider nicht öffnen. Daher müssen Sie bereits vor dem Download Ihre Lese-Hardware vorbereiten.
Bitte beachten Sie: Wir empfehlen Ihnen unbedingt nach Installation der Lese-Software diese mit Ihrer persönlichen Adobe-ID zu autorisieren!
Weitere Informationen finden Sie in unserer E-Book Hilfe.