
Streaming Architecture
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
More and more data-driven companies are looking to adopt stream processing and streaming analytics. With this concise ebook, you'll learn best practices for designing a reliable architecture that supports this emerging big-data paradigm.
Authors Ted Dunning and Ellen Friedman (Real World Hadoop) help you explore some of the best technologies to handle stream processing and analytics, with a focus on the upstream queuing or message-passing layer. To illustrate the effectiveness of these technologies, this book also includes specific use cases.
Ideal for developers and non-technical people alike, this book describes:
- Key elements in good design for streaming analytics, focusing on the essential characteristics of the messaging layer
- New messaging technologies, including Apache Kafka and MapR Streams, with links to sample code
- Technology choices for streaming analytics: Apache Spark Streaming, Apache Flink, Apache Storm, and Apache Apex
- How stream-based architectures are helpful to support microservices
- Specific use cases such as fraud detection and geo-distributed data streams
Ted Dunning is Chief Applications Architect at MapR Technologies, and active in the open source community. He currently serves as VP for Incubator at the Apache Foundation, as a champion and mentor for a large number of projects, and as committer and PMC member of the Apache ZooKeeper and Drill projects. Ted is on Twitter as @ted_dunning.
Ellen Friedman, a committer for the Apache Drill and Apache Mahout projects, is a solutions consultant and well-known speaker and author, currently writing mainly about big data topics. With a PhD in Biochemistry, she has years of experience as a research scientist and has written about a variety of technical topics. Ellen is on Twitter as @Ellen_Friedman.
More details
Other editions
Additional editions

Content
- Cover
- Copyright
- Table of Contents
- Preface
- Who Should Use This Book
- What Is Covered
- Conventions Used in This Book
- Safari® Books Online
- How to Contact Us
- Chapter 1. Why Stream?
- Planes, Trains, and Automobiles: Connected Vehicles and the IoT
- Streaming Data: Life As It Happens
- Where Streaming Matters
- Beyond Real Time: More Benefits of Streaming Architecture
- Emerging Best Practices for Streaming Architectures
- Healthcare Example with Data Streams
- Streaming Data as a Central Aspect of Architectural Design
- Chapter 2. Stream-based Architecture
- A Limited View: Single Real-Time Application
- Key Aspects of a Universal Stream-based Architecture
- Importance of the Messaging Technology
- Choices for Real-Time Analytics
- Apache Storm
- Apache Spark Streaming
- Apache Flink
- Apache Apex
- Comparison of Capabilities for Streaming Analytics
- Summary
- Chapter 3. Streaming Architecture: Ideal Platform for Microservices
- Why Microservices Matter
- What Is Needed to Support Microservices
- Microservices in More Detail
- Designing a Streaming Architecture: Online Video Service Example
- A New Design: Infrastructure to Support Messaging
- Importance of a Universal Microarchitecture
- What's in a Name?
- Why Use Distributed Files and NoSQL Databases?
- New Design for the Video Service
- Summary: The Converged Platform View
- Chapter 4. Kafka as Streaming Transport
- Motivations for Kafka
- Kafka Innovations
- Kafka Basic Concepts
- Ordering
- Persistence
- The Kafka APIs
- KafkaProducer API
- KafkaConsumer API
- Legacy APIs
- Kafka Utility Programs
- Load Balancing
- Mirroring
- Kafka Gotchas
- Kafka in Production Settings
- Limited Number of Topics and Partitions
- Manual Balancing of Partitions and Load
- No Inherent Serialization Mechanism
- Mirroring Deficiencies
- Summary
- Chapter 5. MapR Streams
- Innovations in MapR Streams
- History and Context of MapR's Streaming System
- How MapR Streams Works
- How to Configure MapR Streams
- Geo-Distributed Replication
- MapR Streams Gotchas
- Chapter 6. Fraud Detection with Streaming Data
- Card Velocity
- Fast Response Decision to the Question: "Is It Fraud?"
- Multiuse Streaming Data
- Scaling Up the Fraud Detector
- Summary
- Chapter 7. Geo-Distributed Data Streams
- Stakeholders
- Design Goals
- Design Choices
- Our Design
- Follow the Data
- Control Who Has Access to Stream Data
- Advantages of Streams-based Geo-Replication
- Chapter 8. Putting It All Together
- Benefits of Stream-based Architectures
- Making the Transition to Streaming Architecture
- Conclusion
- Appendix A. Additional Resources
- Streaming Data Topics
- Selected O'Reilly Publications by the Authors
- About the Authors
System requirements
File format: PDF
Copy-Protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our eBook Help page.