Architecture of intelligent Disk subsystem •A disk subsystem is a hard disk server. •Servers are connected to the connection port of the subsystem using standard I/O techniques such as SCSI, fibre channel etc. •The internal structure of the disk subsystem is completely hidden from the server. The server sees on the hard disks. www.bookspar.com | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP
48
Embed
Architecture of intelligent Disk subsystem A disk subsystem is a hard disk server. Servers are connected to the connection port of the subsystem using.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
• With regard to performance, It is better to use smaller hard disks, so that more hard disks are available in the disk subsystem. At the expense of max capacity.
• Here overall load is spread over more arms and read/write heads and over more I/O channels.
• Standard I/O techniques like SCSI and fibre channel are often used for the internal I/O channels between connection ports and controller and between controller and internal hard disks.
Disk subsystems are classified based on :i. No controllerii. RAID controlleriii. Intelligent controller.if the disk subsystem has no internal controller ,it is just an
enclosure with disks.(JBOD). Here connections for I/O channels and power supply are taken outward at a single point. JBOD is simpler to manage. A server treats these disks as independent disks. If there are 16 disk subsystems,16 device addresses are required.
• RAID ,initially was called as REDUNDANT ARRAY OF INEXPENSIVE DISKS
• Now its called as REDUNDANT ARRAY OF INDEPENDENT DISKS.
• Disk subsystems that support RAID are called as RAID arrays• RAID has two goals: i) to increase performance by striping ii) to increase fault tolerance by redundancy
• The bundle of physical hard disks brought together by the RAID controller is known as Virtual disk.
• A server connected to a RAID system only sees the virtual hard disk.
• The fact that RAID controller actually distributes the data over several physical disks is completely hidden to the server.
• A RAID controller can distribute the data that a server writes to the virtual hard disks among the individual physical hard disks in various manners. Theses different procedures are known as RAID levels.
RAID 0: Block by Block striping: RAID 0 distributes the data that the server writes to the virtual hard disk onto one physical hard disk after another block by block.
In the fig there are 4 physical hard disks. The server writes the Blocks A,B,C,D.. Onto the virtual hard disk one after the other .
The RAID controller distributes the sequence of blocks onto the individual physical hard disks (First block A to the first physical disk,B to second disk etc..) after D ,E is written to first disk,block F to second …
• RAID 0 increases the performance of the virtual hard disk as follows: the individual hard disks can exchange data with the RAID controller through I/O channel more quickly . When first block is written into first disk, second is sent into second disk etc..
• RAID 0 is the choice for applications for which the maximum write performance is more important than protection against failure. Examples are the storage of multimedia data for film and video production.
• RAID 0 is used as a fast store for segments in which intermediate results for complex requests are to be temporarily stored.
• In RAID 1 fault tolerance is given importance. Here two hard disks are brought together to form a virtual hard disk by mirroring the data on two physical hard disks.
• If the server writes a block to the virtual hard disk, the RAID controller writes this block to two physical hard disks.
• The individual copies are called as mirrors.• The performance increases only in read operation. while
reading load can be divided between two disks. But writing is slow as data has to be sent into disks.
• RAID 1 performance and capacity are limited . So it’s a good choice for storing small databases.
• In the example eight physical hard disks are used.• The RAID controller initially brings together each four physical
hard disks to form two virtual hard disks that are only visible within the RAID controller by means of RAID 0 (striping)
• In the second level, it consolidates these two virtual hard disks into a single virtual hard disk by means of RAID 1(mirroring). This virtual hard disk is visible to the server.
• When using RAID 0 the failure of a hard disk leads to the loss of the entire virtual hard disk. In RAID 0+1 the failure of a physical hard disk is thus equivalent to the effective failure of four physical hard disks.(fig 2.13). If one of the disk is lost, data is lost. It may be possible to reconstruct the data (but difficult).
• In RAID 10 after the failure of the individual hard disk, additional failure of a further hard disk does not arise. (fig 2.14)
• Even though RAID 10 provides excellent performance ,the problem is mirroring doubles the required storage capacity.
• In RAID 4 and RAID 5 all mirror disks are replaced with a single parity hard disk.(fig 2.15)
• The server writes the blocks A,B,C,D,E etc to the virtual hard disk sequentially. The RAID controller stripes the data blocks over the first four physical hard disks. instead of mirroring the RAID controller calculates a parity block for every four blocks and writes this onto the fifth physical hard disk.
• For ex the controller calculates the parity block• P ABCD for blocks A,B,C and D. If one of the block fails the controller can
reconstruct the data using the three other disks and parity disk.• RAID 4 saves three physical hard disks.• server sees only virtual disk.
• The parity block is calculated using XOR operation.• P ABCD =A XOR B XOR C XOR D (ex 2.15)• Changing a data block requires changing of value of parity block.ie each write operation to the virtual hard disk requires:i.the physical writing of the data block.ii.The recalculation of the parity blockiii.The physical writing of the newly calculated parity block
Write penalty: The extra cost for write operations in RAID4 and RAID 5 is called the write penalty.
• Ex: fig 2.16 shows a case where server changes block D on the virtual hard disk.
• The controller reads the data block and the parity block form the disk into its cache . then it uses XOR operation to calculate the difference between old and new parity blocks.
• RAID 4 and RAID 5 implementations are capable of reducing the write penalty .ex ,if large data are written sequentially, then the RAID controller can calculate parity blocks from the data flow without reading the old parity block from the disk.
• If the cache size is large it holds frequently changed parity blocks after writing to the disk.
• RAID 4 saves all parity blocks onto a single physical disk. Thus the parity disk becomes the performance bottleneck of RAID if there are high write operations.
• RAID 5 distributes the parity blocks over all hard disks. (fig 2.17)• For ex:P ABCD goes to the 5th disk(H) while P EFGH goes to the disk D.• RAID4 and RAID5 suffer from write penalty. (RAID4 not used in general)• RAID4 and RAID5 can withstand the failure of a physical hard disk. Parity
blocks help to restore.• RAID4 is not used in practice.• Some RAID5 implementations have second parity disk to protect data.
RAID 0 is suitable for which the maximum write performance is more important than protection against the failure of a disk. ( multimedia,or physical experiments where large time is required)
RAID 1 Performance and capacity are limited because only two physical hard disks are used . useful for small databases.
RAID 10 is used in situations where high write performance and high fault tolerance is needed. (Used for database log files)
RAID4 and RAID 5 save disk space at the expense of poorer write performance.