Six technical analysis of cloud storage systems

With the rapid development of the surveillance field, the birth of new technologies is also coming, cloud storage is the most high-tech products that people are most happy. It has the following major technologies.

The cloud storage system has the following characteristics: data security, superior scalability, charging according to usage, automatic switching failures across different applications, and easy management. Cloud storage is mainly used in four areas of backup, archive, distribution and sharing collaboration.

Cloud storage is a new concept extended and developed in the concept of cloud computing. It refers to the aggregation of a large number of different types of storage devices in the network through application software through functions such as cluster application, grid technology or distributed file system. Work together to provide a system for data storage and business access functions. When the core of computing and processing of cloud computing systems is the storage and management of large amounts of data, cloud computing systems need to be configured with a large number of storage devices, then the cloud computing system is transformed into a cloud storage system, so cloud storage is a data storage system. And management as the core of the cloud computing system.

Compared with cloud computing systems, cloud storage can be considered as a cloud computing system configured with large storage space. The cloud storage system has the following characteristics: data security, superior scalability, charging according to usage, automatic switching failures across different applications, and easy management. Cloud storage is mainly used in four areas of backup, archive, distribution and sharing collaboration.

A cloud storage system is a collection of multi-device, multi-application, and multi-service collaborative work. Its implementation is based on the development of multiple technologies. According to the characteristics of cloud storage and its application areas, the main cloud storage technologies involve storage virtualization, distributed file systems, cluster storage, centralized storage management, heterogeneous platform collaboration, automatic tiered storage, and of course, deduplication. , data compression and other technologies.

Storage virtualization

The most common understanding of storage virtualization (StorageVirtualizaTIon) is the abstraction of storage hardware resources. Provides a comprehensive, full-featured service by unifying one (or more) target services or features with other additional features. Typical virtualization includes situations such as masking system complexity, adding or integrating new features, emulating, integrating, or decomposing existing service functions. Virtualization is applied to one or more entities that are used to provide storage resources or services.

Storage virtualization is a technology that runs through the entire IT environment to simplify the underlying infrastructure that might otherwise be relatively complex. The idea of ​​storage virtualization is to separate the logical image of the resource from the physical storage, providing a simplified, seamless view of the resources for the system and administrators.

For users, virtualized storage resources are like a huge "storage pool". Users don't see specific disks or tapes, and they don't have to care which path their own data goes to which specific storage device. .

Distributed file system

The Hadoop Distributed File System (HDFS) is a distributed file system designed for use on common hardware devices. It has many similarities to existing distributed file systems, but it is significantly different from these file systems. HDFS is highly fault tolerant and is designed to be deployed on inexpensive hardware. HDFS provides high throughput for application data and is suitable for large dataset applications. HDFS opens up some of the necessary interfaces for POSIX, allowing streaming access to file system data.

HDFS is a master/slave architecture. A cluster has a name node, the main control server, which manages the file system's namespace and coordinates client access to files. There is also a bunch of data nodes, usually one on a physical node, responsible for the storage management on the physical node where they are located. The HDFS open file system namespace allows user data to be stored in files. Internally, a file is divided into one or more data blocks that are stored in a set of data nodes. Name nodes perform file system namespace operations, such as opening, closing, renaming files or directories, and also determining the mapping of data blocks from data nodes. The data node is responsible for providing the client's read and write requests. The data node also performs the creation, deletion, and copying of data blocks in accordance with the instructions of the name node.

Cluster storage

Cluster storage is to aggregate the storage space in multiple storage devices into a storage pool that provides a unified access interface and management interface for the application server. The application can transparently access and utilize the disks on all storage devices through the access interface. Take advantage of storage device performance and disk utilization. Data will be stored and read from multiple storage devices in accordance with certain rules for higher concurrent access performance.

The advantages of clustered storage are primarily in improving the overall performance of parallel or partitioned I/O, especially for workflow, read-intensive, and large file access, by lowering the overall cost by using lower cost servers. There are two ways to implement cluster storage: one is hardware infrastructure plus software; the other is dedicated cluster storage, which is built on top of the NAS infrastructure, but implements cluster storage through the operating system.

Centralized management of storage

The cloud storage management platform requires support for deployment and management across data centers, and supports user access scheduling, data migration, and offsite storage backup across data centers.

Supporting centralized management, the cloud storage management platform is deployed in the central computer room of the cloud computing system. The storage nodes can be deployed in the computer rooms of different locations. The management platform can manage and schedule the storage devices in each computer room.

Through the cloud storage management platform, users can easily understand the service status of each node of the cloud storage system, including the capacity and performance of each node (read IOPS, write IOPS, read traffic, write traffic) and other information, so that users can understand the resources in the domain in real time. Information, running state, so that you can manipulate these resources; at the same time, you can understand the abnormalities of resources in a timely manner, and take appropriate measures to ensure its normal operation when necessary.

Heterogeneous platform collaboration

At present, various storage solutions and technologies are very complicated and diverse. It is known from the situation that there may be different types of storage devices in one enterprise at the same time. Moreover, the storage environment between different storage device vendors has always had compatibility problems. Therefore, the storage consolidation that has been called for many years is still difficult to meet the needs of enterprises, which is the biggest obstacle to the expansion of storage virtualization and cloud storage.

While the storage cloud is easy to step in in some way (such as online storage and backup), on the other hand, it seems not easy to achieve comprehensive storage consolidation through private cloud storage. In this regard, in order to successfully complete the storage virtualization goal, it is necessary to improve the enterprise's existing IT storage environment, and its improvement focus is not only shared storage architecture, affinity environment, simple and simple operation interface, and efficient storage solutions. . Among them, whether it is a single operation interface or a unified standard API, it is one of the keys to solving the coordination problem between different storage devices.

CDMI is a new standard interface for cloud storage, developed by the International Storage Network Industry Association SNIA. For cloud computing, CDMI provides a common cloud computing management infrastructure, while the original focus of information management has gradually shifted from storage management to data management. The CDMI standard can help users mark special metadata (Metadata) on the data, which tells the endpoint storage provider what data service provides the data (such as backup, archive, encryption, etc.). Through the implementation of the CDMI standard interface, users can move data between different cloud providers without having to endure the pain of re-encoding in different interfaces.

Automatic tiered storage

Improving storage management efficiency has become the primary problem for many enterprises. Automatic tiered storage is the most effective basic technology. It refers to the function of migrating data blocks between different disk types and RAID levels, which can meet the performance and space usage. The right balance, quickly put the data in the right place and avoid the so-called hot spots.

As this technology has received wide attention, mobile data needs to implement a fully automated migration process between different levels of storage media, such as FC disks and SATA disks.

In a hierarchical data storage structure, a storage device generally has a tape library, a disk or a disk array, and the disk can be classified into a FC disk, a SCSI disk, a SATA disk, and the like according to its performance, and the flash storage medium (non-volatile random) Access memory (NVRAM) is also a higher level in the hierarchical data storage architecture because of its higher performance. Generally, high-cost, fast-speed devices such as disks or disk arrays are used to store important information that is frequently accessed, while lower-cost storage resources such as tape libraries are used to store less frequently accessed information.

Portable Inverter Generator

Wuxi Doton Power , https://www.dotonpower.com