Distributed Storage in Practice

Name: Distributed Storage in Practice
Brand: Wiley
Price: 116.99 EUR
Availability: OnlineOnly

Hui Li Hanxu Hou Hong Tan(Author)

Wiley (Publisher)

1st Edition

Published on 19. December 2025

400 pages

E-Book

ePUB with Adobe-DRM

System requirements

978-1-394-33476-6 (ISBN)

€116.99incl. 7% vat

System requirements

for ePUB with Adobe-DRM

E-Book Single Licence

Available for download

Description

More details

Other editions

Persons

Content

Foreword by Prof. Shuo-Yen Robert li xiii

Foreword by Prof. Weimin Zheng xv

About the Authors xvii

Preface xxi

Acknowledgments xxiii

1 Fundamentals of Distributed Storage 1

1.1 Introduction to Distributed Storage 1

1.1.1 Evolution of Storage Systems 2

1.1.2 Characteristics of Distributed Storage 4

1.2 Reliability of Distributed Storage 6

1.2.1 Availability and Reliability 6

1.2.2 Fault Tolerance 7

1.2.3 Reliability Metrics 10

1.3 Trends in Future Development 12

1.4 Chapter Summary 16

Discussion Questions 16

References 17

2 Codes for Distributed Storage 19

2.1 Preliminaries 20

2.1.1 MDS Codes and ECs 20

2.1.2 Classification of ECs 23

2.2 RS Codes 25

2.2.1 Original RS Code 25

2.2.2 CRS Code 26

2.2.2.1 Encoding Process of CRS Code 26

2.2.2.2 Decoding Process of CRS Code 29

2.3 Regenerating Codes 29

2.3.1 Introduction to RGCs 30

2.3.2 Constructions of RGCs 32

2.3.2.1 Interference Alignment 33

2.3.2.2 Repair by Transfer 35

2.3.2.3 Product-Matrix 37

2.3.3 Rack-aware Regenerating Codes 40

2.3.3.1 Cross-rack Repair Model 41

2.3.3.2 Optimal Trade-off in RRCs 42

2.3.3.3 Construction of Cross-rack RGCs 46

2.4 Array Codes 47

2.4.1 Introduction to Array Codes 47

2.4.2 A Coding Framework for Binary Array Codes 51

2.4.3 Binary Vandermonde Array Codes 53

2.4.3.1 Construction of BVACs 54

2.4.3.2 Decoding Methods of BVACs 56

2.4.4 Binary Cauchy Array Codes 59

2.4.4.1 Construction of BCACs 59

2.4.4.2 Decoding Methods of BCACs 61

2.4.5 Optimization of MDS Array Codes 65

2.4.5.1 Repair Bandwidth 65

2.4.5.2 Locally Repairable Property 67

2.4.5.3 Computational Complexity 68

2.5 Codes with Locality 70

2.5.1 Introduction to LRCs 71

2.5.1.1 Information Symbol Locality Locally Repairable Codes 71

2.5.1.2 All-symbol Locality Locally Repairable Codes 72

2.5.2 Locally Regenerating Codes 73

2.5.2.1 MSR-local Codes 74

2.5.2.2 MBR-local Codes 74

2.5.3 Integrated Interleaved codes 76

2.6 Chapter Summary 81

Discussion Questions 81

References 83

3 Optimization for Practicality 93

3.1 Practicality in System Level 93

3.1.1 Distributed Storage Based on MDS Codes 93

3.1.1.1 Storage Coding Based on MDS Codes 94

3.1.1.2 Data Storage Device 94

3.1.2 Trade-off of Replication and ECs 96

3.1.3 Coding Optimization for Realistic Scenarios 98

3.1.3.1 Optimization for Heterogeneous Systems 98

3.1.3.2 Optimization for Block Distribution 99

3.1.3.3 Optimization for System-level Coding 100

3.2 Practical ECs 102

3.2.1 BRS Codes 102

3.2.1.1 Encoding Process of BRS Code 102

3.2.1.2 Decoding Process of BRS Code 103

3.2.2 Binary Regenerating Codes 105

3.2.2.1 BMSR Codes 108

3.2.2.2 BMBR Codes 111

3.2.3 Butterfly Codes 114

3.2.3.1 Encoding Process of Butterfly Codes 115

3.2.3.2 Decoding Process of Butterfly Codes 116

3.2.4 Clay Codes 118

3.2.5 Practical LRCs 124

3.2.5.1 LRCs in WAS 124

3.2.5.2 LRCs in HDFS-RAID 127

3.3 Chapter Summary 129

Discussion Questions 129

References 130

4 Optimization for Fault Recovery 133

4.1 Optimization Based on ECs 133

4.1.1 Fractional Repetition Codes 134

4.1.2 Self-repairing Codes 139

4.1.3 RGCs for Multiple Failed Nodes 141

4.2 Optimization Based on Network Topology 143

4.2.1 Repair Tree: Fast Repair for Single Node 143

4.2.1.1 Decomposing Computation 146

4.2.1.2 Repair Model 147

4.2.1.3 Construction of Repair Tree 149

4.2.2 Tree-Structured Repair Scheme for Multiple Nodes 151

4.2.2.1 Preliminaries 153

4.2.2.2 Modeling and Optimization of Tree-Structure Repair 155

4.3 Chapter Summary 159

Discussion Questions 159

References 160

5 Distributed Storage Systems 165

5.1 Background and Introduction 165

5.2 Tectonic: Facebook's Scalable File Storage System 168

5.2.1 Overview 168

5.2.2 Architecture 169

5.2.3 Core Components and Functions 171

5.2.3.1 Chunk Store 171

5.2.3.2 Metadata Store 172

5.2.3.3 Client Library 173

5.2.3.4 Background Services 173

5.3 Ambry: LinkedIn's Scalable Geo-distributed Object Storage System 174

5.3.1 Overview 174

5.3.2 Architecture 175

5.3.3 Core Components and Functions 179

5.3.3.1 Cluster Manager 179

5.3.3.2 Frontend Layer 179

5.3.3.3 Data Layer 182

5.3.3.4 Load Balancing 183

5.3.3.5 Replication 184

5.4 Evolution of Alibaba's Cloud Block Storage 186

5.4.1 Elastic Block Storage I 186

5.4.2 Elastic Block Storage II 187

5.4.3 Elastic Block Storage III 190

5.5 Distributed Key-Value Storage Systems 192

5.5.1 Storage Engines 192

5.5.1.1 Hash Table 192

5.5.1.2 B+ Tree 194

5.5.1.3 LSM-Tree 195

5.5.2 Data Layout 197

5.5.3 Distributed Replication 198

5.5.4 Distributed Secondary Indexes 199

5.6 Chapter Summary 201

Discussion Questions 201

References 202

6 Integrating Blockchain in Distributed Storage Systems 207

6.1 Fundamentals of Blockchain 207

6.1.1 Introduction to Blockchain 207

6.1.2 Key Technologies 208

6.1.2.1 Smart Contracts 209

6.1.2.2 Consensus Mechanisms 209

6.2 Blockchain-based Decentralized Storage 217

6.2.1 InterPlanetary File System 217

6.2.1.1 System Architecture 218

6.2.1.2 Core Functions 219

6.2.1.3 Application Prospects 220

6.2.2 DDSs: Sia and Storj 221

6.2.2.1 Sia 221

6.2.2.2 Storj 226

6.3 DSS Based on Blockchain and Mimic Security 229

6.3.1 Background and Introduction 229

6.3.2 Mimic Distributed Secure Storage System 230

6.3.2.1 System Principle 230

6.3.2.2 System Architecture 232

6.3.2.3 Core Functions 234

6.3.3 Logging System Based on Consortium Blockchain 236

6.3.3.1 System Architecture 236

6.3.3.2 Core Function 237

6.4 Chapter Summary 240

Discussion Questions 240

References 241

7 Quantum Bit, Q-Computers, and Q-Networks 243

7.1 Fundamental Principles of Quantum Computing 243

7.1.1 The Concept and Properties of Quantum Bits 243

7.1.2 Quantum Gates and Quantum Circuits 252

7.1.2.1 The Bell Circuit 258

7.1.2.2 Reverse Bell Circuit 259

7.1.2.3 Greenberger-Horne-Zeilinger (GHZ) Circuit 259

7.2 Quantum Memory and Quantum Computer 263

7.2.1 Quantum Memory 263

7.2.1.1 Electromagnetically Induced Transparency Quantum Memory 264

7.2.1.2 DLCZ Memory Scheme 265

7.2.1.3 Quantum Memory Based on Faraday Interaction 266

7.2.1.4 Atomic Frequency Comb Quantum Memory 267

7.2.2 Quantum Computer 268

7.2.2.1 Ion Trap Quantum Computing 270

7.2.2.2 Superconducting Quantum Computing 271

7.2.2.3 Photonic Quantum Computing 271

7.2.2.4 Neutral Atom Quantum Computing 272

7.3 Quantum Secure Transmission 273

7.3.1 Quantum Key Distribution 274

7.3.2 Superdense Coding 276

7.3.3 Quantum Teleportation 277

7.3.4 Quantum Error Correction Codes 279

7.4 Quantum Computer Networks 282

7.4.1 Quantum Internet 282

7.4.2 Quantum Multi-identifier Network Architecture with Quantum Identifier 289

7.5 Chapter Summary 296

Discussion Questions 296

References 297

Index 301

1
Fundamentals of Distributed Storage

As the digital economy emerges as a new driver for economic and social development, accelerating the realization of data as a production factor has become an essential requirement to promote the digital economy and facilitate high-quality development. The advent of new technologies such as 5G, cloud computing, artificial intelligence (AI), high-performance computing (HPC), and big data has given rise to numerous emerging applications, leading to an explosive growth of data volume. The aggregation, circulation, and application of massive data have placed higher demands on data storage systems. Distributed storage, as a critical infrastructure, represents an optimal solution for substantial volumes of data. Therefore, it is imperative to expedite research and development efforts related to super-large-scale, highly reliable, highly available, and high-performance distributed storage in order to adapt to the expanding industrial spaces and evolving application scenarios.

This chapter first introduces the evolution and characteristics of distributed storage and then focuses on the concept of data reliability, as well as on the common fault-tolerant mechanisms. Finally, it discusses the future trends of distributed storage from a macro perspective.

1.1 Introduction to Distributed Storage

More than 60 years ago, the first-generation computer ENIAC (McCartney 1999, pp. 1-11) could execute over 5,000 additions or more than 400 multiplications per second with only 100 B of magnetic core storage capacity. However, a compressed digital photo with a resolution of 3,600 × 2,700 pixels is approximately 4 MB in size, while a photograph of the spectacular spiral galaxy M81 captured by the Hubble Space Telescope occupies 689 MB. The storage capacity required for high-definition movie playback is around 20 GB, and this necessitates an input/output (I/O) sustained data transfer rate of at least 3 MB/s. In addition to capacity requirements, many applications impose stringent demands on response time, throughput, transmission bandwidth, and so on. These escalating demands propel the evolution of storage systems from small to large, slow to fast, simple to complex, and single-machine to a distributed system; they also present higher-level requirements for robustness, availability, reliability, and performance.

1.1.1 Evolution of Storage Systems

Generalized storage devices include CPU registers, multi-level caches, memory, and the external storage system. The first three are also known as the memory system. In a narrow sense, storage systems usually refer to external storage systems. This book also mainly centers on the external storage systems.

The data units of a computer are byte (B), kilobyte (KB), megabyte (MB), gigabyte (GB), terabyte (TB), petabyte (PB), exabyte (EB), zettabyte (ZB), and so on. One byte represents one character of information and is composed of eight binary bits. The conversion relations between the units are as follows: 1 KB is equal to 1024 B, 1 MB is equal to 1024 KB, 1 GB is equal to 1024 MB, and 1 TB is equal to 1024 GB.

If users usually only create simple documents or spreadsheets, then most files may be a few hundred KBs or occasionally a few MBs in size. However, if the computer is used to store and process digital photos, then the average file size will be in the range of several MBs. When the computer is used to edit and store videos, the size of a single file may be tens of MB or even several GB. For example, one hour of DV format video footage consumes approximately 12 GB; one minute of uncompressed 1,920 × 1,080 high-definition video is approximately 9 GB. Therefore, adequate and scalable storage capacity is very important.

Apart from capacity, the update and access frequency of stored data is also a key factor to be considered in the selection of external storage devices. For example, digital photographers may have incremental storage capacity requirements. With each shoot, they will add several hundred MB or several GB of photos as backups to the storage system and require these photos to be stored permanently without being altered. In contrast, three-dimensional (3D) computer animation may require regularly repeated access to several GB of data and replace the previously stored data. In this case, not only rewritable storage media is needed, but higher requirements are also imposed on the response speed. It is these various requirements that have given rise to different storage systems. Next, the development of storage systems is introduced in chronological order.

Hard Disk:
Most desktop computers and laptops nowadays still rely on rotational hard disks as high-capacity storage devices for storing the operating system kernel, applications, user data, and so on. Traditional rotational hard disk drives consist of one or more disks that are stacked together and covered with surface magnetic media that can be driven by write heads and read heads. Hard disk drives can transfer data directly to other computer hardware through three interface types: SATA, IDE/UDMA, or SCSI, and their speed ranges from 4,200 to 15,000 rpm.
Redundant Array of Independent Disks:
Redundant array of independent disks (RAID) (Perumal and Kritzinger 2004, pp. 1-9) refers to the use of multiple hard disks connected together in servers and high-end personal computer workstations and to the operation of storing data through multiple physical drives. There are many available RAID configurations, of which the three basic ones are explained as follows:
1. RAID 0 divides a data file into blocks and stores them on two or more hard disks, with each hard disk storing the data blocks evenly. Parallel read operations can be used to improve system performance without increasing capacity.
2. RAID 1 employs data mirroring to symmetrically distribute data blocks across two or more disks, effectively reducing the risk of disk faults. This redundancy ensures that in the event of a disk fault, the complete dataset can still be retrieved from other disks.
3. RAID 5 requires at least three disks. Similar to RAID 0, it not only spans multiple disks to store different data blocks but also simultaneously stores parity blocks for these data blocks. This configuration ensures that in the event of a single disk fault, the failed data can be recovered using the parity blocks stored on other disk drives.
Other RAID configurations are generally further optimized or combined from the basic configurations mentioned earlier, such as increasing the fault-tolerant level of the disks and expanding to disk arrays across different regions.
Distributed Storage:
The exponential growth in data volume has necessitated more rigorous storage requirements, including scalable storage capacity, enhanced reliability, increased throughput, and reduced cost. Traditional single-machine storage such as RAID is no longer suitable for large-scale data. Distributed storage systems have emerged as a viable solution. Generic distributed storage involves the dispersion of data across multiple servers in a cluster of storage nodes, and the synchronization or access of data through a distributed network between the nodes. Data is spread across multiple nodes, and reliability is maintained through specific redundant mechanisms. Specifically speaking, a distributed storage system typically comprises the following components:
1. Client: Client encompasses the user interface and is responsible for engaging with users, receiving their requests, and delivering responses.
2. Network Link: The network link plays a crucial role in transmitting diverse data, including user requests, read and write operations, and responses.
3. Storage Node: Storage node, as a fundamental component of the distributed storage system, plays a pivotal role in both actual data storage and the processing of read and write operations.
4. Metadata Management Node: Metadata management node is tasked with overseeing metadata information of the distributed storage system, encompassing data file storage locations, copy counts, and access permissions. It also facilitates load balancing and data migration of storage nodes to optimize system performance.
Cloud Storage:
Cloud storage (Ghani et al. 2020, pp. 1-21) represents highly scalable distributed storage that disperses data across diverse geographical locations. It typically adopts a distributed storage architecture to cater to users' requirements for data storage. Unlike traditional distributed storage, which is limited by geography and network bandwidth, cloud storage is typically offered by global cloud service providers, allowing users to directly access data via the Internet. Therefore, cloud storage can be viewed as an extension of distributed storage on the Internet, delivering a more efficient, dependable, and adaptable solution.

1.1.2 Characteristics of Distributed Storage

A distributed storage system can provide storage services by interconnecting multiple storage nodes, which are composed of soft and hardware components. Compared with traditional centralized storage, distributed storage distributes data across multiple relatively independent storage nodes to achieve higher response speed and scales the capacity by increasing the number of storage nodes. It also...

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Distributed Storage in Practice

Description

More details

Other editions

Additional editions

Persons

Content

1
Fundamentals of Distributed Storage

1.1 Introduction to Distributed Storage

1.1.1 Evolution of Storage Systems

1.1.2 Characteristics of Distributed Storage

System requirements