HP AutoRAID 12H White Paper

hp vir tual array: double your operating ef ficiency
executive summary In today’ s market envir onment, you are constantly challenged to find ways to both grow your business and save costs. And with the current market growth uncertainties, saving costs is getting a lot of attention by ever yone from Main Street to W a ll Street to inside the Beltway . HP has invented a solution that will get you on the road to saving money and set you up to grow revenue streams once the economy turns around. That solution is HP’ s V i r tual Arrays. It may sound complicated at first, but HP’ s V i r t ual Arrays are simply just another automated advancement. They free your organization from the many manual functions r equired by the current vintage of arrays on the market today . Just as automated printing machines revolutionized the publishing industr y in the last centur y , HP revolutionized the personal and corporate printing industr y with its industr y-leading LaserJet printers. Now , HP’ s V irtual Arrays are revolutionizing the storage industr y , cr eating industr y-leading storage solutions that save your organization 2x in operating costs over the current set of arrays on today’ s market. HP’ s V i r tual Arrays are automated to maximize your operating efficiencies and maximize your retur n on investment. They a re automated to save you money . The following customer quotes and two articles explain how HP’ s V irt ual Arrays deliver on this money-saving promise. Read them for the details or have your trusted IT professionals do this for you. Just remember that HP’ s V ir t ual Arrays are the only automated arrays on the market today that offer you these cost-saving features at a mid-range price. Can you think of even one profitable publishing house that has not fully adopted automated publishing equipment? Can you think of any successful organization that has not adopted laser printing? The same will be said for HP’ s V ir t ual Arrays. Capture these savings today .
2 1 table of contents • Amazon.com i • Room & Board i ” • State of New Mexico ii i • V iaSat • Santa Clara ii section 1: hp storage white paper November 2001 • virtualization, simplification and storage 1.0 • array virtualization defined 1.0 • the reasons you need more virtualization in the array 1.1 • key features of hp’ s patented virtual technology 1.1 • why now is the time for array virtualization 1.2 • managing traditional storage 1.2 • configuring an array for a database 1.3 • the system administrator’ s dilemma 1.4 • managing the hp virtual array 1.4 • adding capacity with hp virtual arrays 1.4 • time to implementation: formatting the array 1.5 • automating the cache parameters 1.5 • performance 1.5 • faqs 1.6 • summar y 1.8 • for more information 1.8 section 2: “hp extends vir tualization to the array” —by D.H. Brown Associates, Inc. November 15, 2001 • virtual value in hp’ s va7000 series 2.0 • super redundancy enhances RAID 2.1 • reliability characteristics of hp’ s va7000 series 2.2 • software extends virtual functions 2.3 customer quotes
vir tualization new . Many companies have optimized their performance with HP , including: ” Amazon.com “W e look at three things when selecting data warehouse disks. Price per terabyte, throughput capabilities, and availability features. With the HP V i r tual Array 7100, the price per terabyte was extremely competitive and the throughput w a s phenomenal. Our current Superdome V A7100 configuration is capable of driving three to four gigabytes of I/O per second, and the autoraid features of the V A7100 exceed our availability requirements.” Mark Dunlap Director of Data W a rehousing Amazon.com Room & Board “After reviewing various competitive scenarios for our SAP environment, we selected HP as our single vendor with rp5450 ser vers and a V A7100 for our SAN. W e found the set-up to be fast and easy , and the perfor mance excellent. It was an easy decision to add a second virtual array , a V A7400, to accommodate our growth and position us for the future.” John Focht Systems Administrator Room & Board State of New Mexico “In consolidating from eight controllers to the HP Surestore V A7100 we expected to see some perfor mance degradation in running our Children, Y outh, and Family Department databases. W e haven't seen any degradation and are pleased with the perfor mance. W e've found it ver y easy to allocate space with the V A7100 and have saved time with reduced administrative overhead. W ith the AutoRAID feature, we've eliminated wo rryi ng about mirroring. W e let AutoRAID go to work and don't even have to think about it.” T om Elder DBA 3 Children, Y outh and Family Department State of New Mexico i customer quotes After seven years and 20,000 sales, we hardly consider
” Santa Clara “W e needed to move to the next generation of technology as we merged data centers. W e had outgrown the large ser ver farm scenario and wanted to focus on consolidation. In discussing our current and future needs for our NT environment, the V A7400 was the best solution. W e have transformed to an enterprise environment with great expandability for the future. And as a hospital environment with needs for an always-on, 24x7 operation, we've found a reliable solution with HP .” Gary Davis Hospital Information Systems Manager II Santa Clara V iaSat “W e selected the HP Surestore V ir tual Array 7400 storage based on the number of Oracle ERP instances that would be needed. W e found the V irtual Array has many more features and capabilities than the Sun solution that was being proposed. W e felt it would scale the way we needed, provided strong price/perfor mance, and offered excellent operating efficiencies. It has saved many hours and long nights for our DBA ’ s and system administrators. Plus, we're using the HP Surestore tape librar y for efficient tape backup data protection.” Mike Johnson IT Manager V iaSat ii customer quotes cont’d
1 in storage capacity they will only become more so. HP’ s Array V i r tualization is evolutionar y , not revolutionar y . It is a logical progression in array technology . It is proven. It is necessar y . It is the perfect storage area network (SAN) technology . And it is offered only by Hewlett-Packard! section 1 virtualization, simplification and storage In 2001, HP announced two new disk arrays— t he HP Surestore V irtual Array 7400 and the HP Surestore V i r tual Array 7100. This paper will focus entirely on the virtualization technology that makes these the easiest arrays to manage and the most intelligently simple arrays on the market. This paper answers the question, “What is the value of HP’ s Array V i r tualization?” For those familiar with data center environments, virtualization is not a new concept. V irtualization already exists to some extent at ever y point in the solution stack. For example, ser vers, operating systems, databases, file systems, volume managers, drivers, switches, and storage devices all require virtualization to achieve their purposes. At the array level, HP’ s V i r tual Architecture simply expands on already familiar ground, and it cleanly fulfills the promise of virtualization— it hides complexity from the administrator and can have a dramatic and positive effect on real- world perfor mance. This reduction in complexity greatly simplifies and streamlines the data center environment. hp’ s array virtualization: • reduces the time spent managing individual arrays • allows storage administrators to manage more storage with less effor t • reduces the opportunities for human error • frees up precious IT resources to work on revenue-generating projects • self-manages the RAID configuration for optimum perfor mance array virtualization defined The purpose of virtualization in any technology is to hide complexity from the user , or in the case of disk arrays, to hide complexity from the storage administrator and provide a standard environment for application development and increased price/perfor mance. Disk arrays are complex devices designed for complex tasks. A disk array with 50 disk drives is more complex to manage than a disk array with one drive. Armed with vir tualization, an array could potentially allow the 50 drives to be perceived and managed as one big drive or as one big pool of storage. The power of virtualization is the power of simplification. Now for the definition: V irtualization in arrays is about creating and managing virtual storage devices. It is about taking blocks of storage on the disk drives and presenting them as LUNs (logical units of storage). What system administrators see, then, are not the actual physical disk drives but rather a created, simplified “virtual” view of the actual physical storage, i.e., the LUNs. HP’ s V i r tual Array works with LUNs just as traditional arrays do. However , the vir tual array doesn’ t stop there. The virtual array actually manages the disks down to the level of the smallest available “cluster” or “chunk.” Further , while traditional arrays utilize static address-translation algorithms for managing the chunks, the virtual array uses a dynamic mapping system. 1.0 hp storage white paper These are not trivial benefits. Data centers are already complex, and with the continued explosion
hp storage white paper This allows the array to dy namically allocate and de-allocate clusters of any RAID type without affecting the logical view as seen from the ser ver . In other words, clusters can be moved and new RAID 5 and RAID 1 0 stripes can be created or extended dynamically without the host view being cha nge d and without system administrator inter vention. Just as a file system on the host presents a virtualized view of the storage to the application or user , the virtual array presents a vir tualized view of the storage to the file system. For example, just as you can create and delete files in a file system, you can create and delete LUNs within the virtual array . This degree of virtualization is what gives the virtual array its unique ability to relieve the system administrator of many of his/her mundane storage management responsibilities. the reasons you need more virtualization in the array • capacity requirements will always grow • IT depar tments will always have a limited budget and limited human resources for managing storage • vir tualization improves the efficiency of storage administrators • vir tualization reduces the training required for people to manage storage • data always grows, but human head count doesn’t—virtualization reduces the number of people required to manage storage key features of hp’ s patented virtual technology • automatically stripes ever y LUN across all disks in a ver y large redundancy group Benefit: simplifies overall management; reduces the number of LUNs required to “hot spots” 1 • automatically adds new disk drives into existing RAID groups Benefit: capacity upgrades require no IT training, are faster , and hot spots are virtually eliminated • dynamically , and without any human inter vention, optimizes the RAID level to the application workload Benefit: perfor mance tunes the array 24 hours per day and eliminates downtime for reconfigurations • allows LUNs to be created in seconds Benefit: reduces setup times • allows immediate use of the array after LUN configuration Benefit: speeds implementation time • allows hundreds of LUNs to be created of virtually any size—from a single megabyte to over 2TB Benefit: simplifies configurations, improves application perfor mance, and greatly decreases management complexity and potential for security errors • allows any combination of disk drive capacities and speeds within the array Benefit: simplifies disk drive inventor y control and capacity management • allows disk drives and controllers to be moved to any slot in the array Benefit: reduces the chance for human error 1.1 array virtualization defined achieve a balanced workload and eliminates
hp storage white paper 1 why now is the time for array virtualization Data centers have become enormously complex. Interestingly , new technologies that initially held the promise of simplifying the data center have, in fact, added to the complexity—at least for now . Fibre Channel, SANs, and the low cost and ease of adding storage have enabled data centers to expand both in size and complexity . The world is moving from SCSI to Fibre Channel. Fibre Channel and SANs allow for more devices to be connected together , with greater per formance, and at far greater distances. All of these are wonderful benefits, but along with those benefits comes an ability to now create configurations that are larger , more complex, fur ther apar t, and harder to manage than anything envisioned even a few years ago. Couple this new reality with the exponential increase in storage, fueled by its low cost, and you have a recipe for losing control of the data center . Consider this: while human resources remain static, environments grow larger and more complex. Simplification is the only realistic answer . Adding newer and slicker management software oftentimes can help, but in the long term, adding software tends to have the opposite effect of reducing complexity . Software may give you a central place from which to manage your hardware and it may simplify processes, but it doesn’t necessarily eliminate those processes. Armed with better management software, environments are encouraged to add more hardware and eventually the environment becomes as complex as it was before, but for different reasons. And when the human resources are already stretched to the breaking point, this is a recipe for complexity , stress, long hours, and human error . V i r tualization is the answer . It solves the basic underlying problem. It permanently simplifies the environment for the system administrator . — Chessmaster Savielly Grigorievitch T art akower on the game’ s opening position Configuring and managing traditional storage is time-consuming and challenging even for experienced system administrators. When configuring storage, a wide variety of factors must be accounted for , judgments must be made, and steps repeated over and over again. The potential for error is high. The cost of configuring an array improperly is also high. The administrator must consider the following factors: • capacity , cost, per formance and availability requirements • requirements for future additional capacity and perfor mance • number of disk drives and their capacity • per formance characteristics of the disks • which RAID level will meet desired capacity , cost, and availability needs • number and type of RAID groups • number of LUNs based on application, perfor mance, and array configuration needs • size of LUNs • configuration of the ser ver volume manager • cache configuration options • stripe depth configuration • implementation plan: who, what, and when • time to bind LUNs 1.2 reasons for more virtualization in the array managing traditional storage “The mistakes are all there waiting to be made.”
section 1 In addition, the same processes must be followed whenever disk drives are added or the environment changes. Plus, these additional factors must be considered: • current configuration • desired additional capacity , per formance, and availability • whether the new disks will be stripe extensions of existing disks or be independent groups here is a typical process for setting up a traditional array: 1. Deter mine number of disks, number of RAID groups, disks and disk type per RAID group, RAID level of each group, total LUNs, LUNs per RAID group, stripe depth. 2. Deter mine volume manager configuration, stripe size and depth, LUNs per logical volume. 3. Using the command station, set up the LUNs and their RAID levels and assign them to particular disks. 4. Set up the cache page size depending on the size of the I/Os coming in from the host. 5. Finally , before the new LUNs can be used, disks must be formatted, which can take many hours per array . configuring an array for a database Properly configuring an array for a database typically involves a large problem set with many variables. Many database administrators have been taught to isolate different pieces of the database in an attempt to optimize perfor mance, availability , and recover y . This process, although based on sound objectives, is far too error-prone. This typically involves a large problem set with many , many variables. Unfortunately , database 1 perfor mance requirements for each of the pieces, and they don’t know the performance behavior of the array in its multitude of configurations. In these real-world environments, it is typically far too time-consuming to tr y a number of different storage configurations, so database administrators typically apply rules from previous installations. The changing characteristics of newer versions of the database typically result in an unbalanced configuration that has “hot spots” that limit the perfor mance of the system. This entire process can take from a few hours to several days, depending on the skill of the administrator and the number and size of the LUNs. During much of this time the array is either unusable or must operate in a degraded perfor mance mode. In other words, LUNs cannot be utilized until they have been formatted. This formatting takes up a lot of the array’ s internal resources and bandwidth. After a LUN has been formatted, it can be used; but as long as other LUNs in the array are also going through their format process, the entire array will suffer from degraded perfor mance. Now , a shor t word about human error . Every step of this process has the potential for human error . Except in the grossest cases, errors would probably not result in data loss, but ever y miscalculation in this process would easily result in a decline in perfor mance. Some of these declines could be huge. For example, miscalculating the RAID levels or the cache page size could severely degrade the array’ s performance. 1.3 hp storage white paper administrators oftentimes don’t have all the critical information. They don’t know the precise database
section 1 the system administrator ’ s dilemma Change is the issue. Many environments change over time and this makes their initial configurations progressively less and less optimal. The dilemma is that an optimum configuration today eventually becomes out-of-date, and typically over time, perfor mance degrades in traditional arrays, but the administrator usually has neither the time nor the window of opportunity to bring the system down and perfor m the necessar y reconfiguration. Access patterns change. Different areas of the database become more highly used. Sometimes at the end of the month certain data that nor mally lies unused now becomes highly used and requires higher perfor mance. All of this poses problems for the conscientious system administrator . At first, the administrator notices a change in perfor mance, but it is not enough to justify bringing the system down. However , over time, the performance continues to degrade until eventually the problem is so severe that the administrator is forced to bring the system down over the protests of the users. He must then go through all 13 configuration steps, including binding all the LUNs and waiting for the reformat to complete. After that the cycle begins again: Perfor mance initially is great, but over time it degrades until finally the pain increases to the point that a reconfiguration again becomes justifiable. managing the hp virtual array Configuring an HP V i r tual Array is much simpler than the process for configuring a traditional array . Remember the complex steps involved in configuring a traditional array? The steps involved in configuring the HP V i r tual Array consist of: • deter mining the total capacity and per formance r equirements • f or each application, determining the number and size of LUNs for the necessar y per formance 1 Step away from the array; the configuration is now complete. Ever y other step is automatic. RAID levels are automatic. The different capacity , number , and speeds of disk drives are automatically accounted for . The cache page size is automatically set. Even the disk formatting is per formed automatically . Moreover , after the LUNs are set, the array is immediately available to accept data. The array does the work, not the administrator . Not only is this the initial configuration process, but the process is just as simple for any subsequent reconfigurations. In other words, if LUNs have to be deleted and new ones created, the process is just as simple. Note: As with any array , if you wanted to delete the LUNs but save the data, you would have to do a backup and restore. adding capacity with hp virtual arrays HP’ s V irtual Array Architecture also simplifies the process of adding capacity to an array . T oday , many traditional arrays allow the administrator to add disks on-the-fly—in other words, to add a disk drive when the array is up and running and accepting I/Os. However , when the disk drive is added to a traditional array , it is not par t of any LUN. It is not formatted, and it is not able to accept data. An administrator must go in and manually perfor m those functions. If the disk drive is to be added to an already existing RAID group, then the data in that group must first be backed-up and later restored to the newly created LUN that now contains the new disk drive. And except for the backup, the steps are the same as when an array is first configured. 1.4 hp storage white paper • deter mining the number of disks required • creating the LUNs—note: creating LUNs takes only a few seconds per LUN
section 1 The HP V irtual Array accepts new disks while the array is up and running and accepting I/Os as with some higher-end traditional arrays. However , the HP V irtual Array takes it one step further . Once the disk is inserted, the array automatically includes that disk into the existing disk space and stripes all LUNs across that disk. This means that even without the creation of any additional LUNs, the array perfor mance will improve because of the additional available spindle. Only the HP V irtual Array automatically adds the new disks to existing LUNs. Further , any newly created LUNs are also automatically spread across all the disks in the array , including the additional disk. time to implementation: formatting the array As mentioned earlier , after new disks are added to a traditional array , it then takes several hours to complete the formatting of the RAID group. During this format phase, no data can be written to the new LUNs. With some implementations, the array is offline until all the LUNs have been formatted. In other implementations, I/Os can be written to already formatted LUNs even while other LUNs are going through the format process, although perfor mance is ver y slow . Because executing the disk format command uses up so much of the array’ s internal bandwidth, array perfor mance is greatly reduced until all of the disk formatting has been completed. W ith HP’ s V irtual Array T echnology , the array is immediately available as soon as the LUNs have been configured. The disk formatting is done as the writes are done. In other words, as writes are sent to disk, the formatting is accomplished for only those blocks being written to. This means that while there is a small hit to perfor mance for that individual write, there is ver y , ver y little impact on overall array perfor mance. automating the cache parameters 1 Configuring a traditional array typically requires setting the cache parameters such as the percentage of read and write cache, the size of the cache pages, and, in some cases, the allocation of cache to specific LUNs. In making these determinations, there is ample opportunity for error . W ith HP’ s V irtual Arrays, all of this is preset and automatic. And this means that all the parameters within the array are tuned to work in unison with the stripe size and the array hardware. First, the cache is set at 80% read and 20% write, is shared between controllers, and is treated as a “pool.” Second, the cache page size is set at 64K and is set to automatically destage to disk ever y 4 seconds whether the page is full or not. The 64K size minimizes the number of I/Os to the back-end in sequential environments and provides a carefully calculated balance within the array between the number of cache pages and the speed of the back-end in random environments. performance T raditional arrays are susceptible to “hot spots” and to changes in the environment that make the initial configuration obsolete. The HP V i r tual Array virtually eliminates these critical perfor mance issues. First, the HP V irtual Array is far less likely to experience a hot spot—in other words, it will almost never experience a condition where a few disk drives become a perfor mance bottleneck in the array . Here’ s why: the virtual array always (and automatically) stripes all of the LUNs across all of the disks in the RAID group. For example: assume a virtual array loaded with a total of 60 disks had 30 disks in each of its two RAID/redundancy groups. Ever y LUN in that group would be spread across all 30 disks. 1.5 hp storage white paper
section 1 Ever y LUN would have 30 spindles at its disposal. And don’t forget, in the virtual array the spare disk capacity is also spread across all of the spindles, i.e., there are no unused spindles in an HP V irtual Array . Second, the virtual array automatically per formance tunes the array 24 hours per day , 365 days per year . The RAID level is matched with the workload. Data with access patterns that would benefit from RAID 1 0 storage are automatically directed to a RAID 1 0 section of the array . Infrequently used data, or data whose access patterns match RAID 5DP perfor mance characteristics, are directed to the RAID 5DP section of the array . These are the same rules that a storage exper t would use to optimize an application’ s performance. In addition, like an expert storage administrator , any changes to the configuration are made only during low-usage periods or when disks are added. Note: When disks are added, the existing LUNs are automatically extended across the new disks. This normally requires data movement within the array . However , administrators have the option to postpone this data movement by simply turning off the array’ s “Auto-Include” feature. faqs Question: Isn’t this virtualization technology new and doesn’t that make it risky and untried? Answer: It would be new for HP’ s competitors, but HP has been virtualizing arrays since 1995, and since then HP has sold over 20,000 virtualized arrays. The technology is neither new nor risky and it offers a simple and compelling value proposition, i.e., great ease of management a n d great data protection. For arrays, management costs are far more costly than the initial purchase price. The HP virtual array is the easiest array in the world to configure, add capacity , and manage— the administration savings are significant. It also has the best availability of any mid-range array on the market. See the white paper titled “V A7100 Hardware High A vailability Features” 1 Short answer: Both the HP Surestore V irtual Array 7100 and the HP Surestore V irtual Array 7400 can be configured to run in either fixed RAID 1 0 or AutoRAID modes. And to help with perfor mance analysis, the controller can provide data on the actual usage of each RAID level. Long answer: W ith the HP Vir tual Array , if your data is frequently used, it WILL be in RAID 1 0 and will have the best perfor mance. The virtual array RAID level policies were developed after researching the decision patterns of experienced system and database administrators. Almost always, technology progresses from highly manual t o highly automated operations. In almost ever y case, the developers of automation simply replicate the best of the already developed manual processes. This is exactly what HP did with the HP V irtual Array . Also please note: The V irtual Array policies are improved over those of the Model 12H. The virtual array really does strive i n a lm o st ever y instance to do any background data movement during periods of low array activity . Question: On which disks is my data kept? I suppose the virtual array can be trusted to handle the RAID level decisions, but I also need to know on which disks my indexes and redo logs are kept. The way the virtual array moves data and stripes across all the disks means I have no idea where my data is kept. Answer: First, if you are worried about the integrity of your data, the HP V irtual Array does a combination of things to protect your data that no other array does. First, the HP V irtual Array offers end-to-end checksum, ECC protected memor y , parity coherence, disk scrubbing, and RAID 5DP which, in a typical configuration, 1.6 hp storage white paper for more details. Question: Shouldn’t a good system administrator know which RAID level the data is in? With the virtual array , I have no idea if it is in RAID 5 or RAID 1 0.
section 1 gives 100X the data protection of traditional RAID 5 and 10X the data protection of RAID 1. Second: The HP V irtual Array stripes across all the disks in the redundancy group. This involves more disks but is not so different from any other RAID 1 0 implementation. Third: array striping is the way of the future. It’ s faster and safer . Question: Doesn’t all this “behind the scenes” movement of data require a huge perfor mance hit? Surely , the trade-off for virtual technology is slow perfor mance. Answer: Not true. For years, HP’ s original virtual array , the AutoRAID Model 12H, was used for HP’ s V -class TPC-C benchmark tests. And today , HP V irtual Array products have been used for the new rp8400 benchmark testing and are scheduled to be used for the Superdome TPC-C tests. Remember , the HP V i r tual Ar ray mimics the policies of experienced system administrators. W ould an experienced system administrator do a reconfiguration of the array during a period of high workloads? No! And neither would the HP V i r tual Array . The HP V i r tual Array policy is to NOT perfor m the background tuning operations when the array is under a greater than 60% workload. Question: Isn’t manual always better than automatic? In cars, manual transmissions give better perfor mance than automatic transmissions because they give the driver more control over perfor mance; likewise wouldn’t manual RAID configurations be better than HP’ s Vi r tual Array T echnology and its automatic RAID configurations? Answer: Manual transmissions in cars would not give better perfor mance if drivers were only allowed to shift gears once and could never change them after that. In essence, this is what you have with traditional disk arrays. Y ou are stuck with the initial configuration unless you bring the array down and go through a time-consuming and complex reconfiguration ever y time the 1 hours per day . It is the hands-down winner in real-world perfor mance. A better analogy would be to compare the multiple manual processes required to set the type, load the paper , and actually print books on the old-style printing presses with those of the automated printing systems of our generation. Obviously , automation in printing presses adds to greater perfor mance. It works the same for arrays. Question: W ouldn't striping the data across such large RAID groups make the disk rebuild times ver y long and take up a high percentage of the array resources? Answer: Because of HP’ s RAID 5DP this is practically a non-issue for HP V irtual Array products. The HP V irtual Array’ s RAID 5DP gives each redundancy group 10x the protection of RAID 1. RAID 5DP requires that 3 drives would have to fail before there would be data loss. Thus, even AFTER a drive fails, the data is STILL protected with the same degree of protection as standard RAID 5. Finally , this means that a single drive failure does not put the data at risk and therefore does not require an emergency rebuild. The HP V irtual Array can take the time and do the rebuild in the background without impacting incoming I/Os. Also, the HP V irtual Array will rebuild the RAID 1 0 data first since that is the most vulnerable after a failure. In all cases, data integrity is ensured and perfor mance is preser ved. No other array can make this claim. Question: If your environment is totally stable, wouldn’t a manual configuration by an experienced system administrator result in better perfor mance than o ne de rived from the vir tual array’ s poli cies? Answer: Let’ s first admit it: T otally stable environments are rare. But the answer is that if the environment were totally stable, and if the administrator configured the array absolutely 1.7 hp storage white paper environment changes. HP’ s V ir t ual Array Architecture tunes the array automatically , 24
1 things. At first, bits were grouped into bytes. When there were too many bytes to keep track of, they had to be grouped i nt o b l ocks. When there were too many blocks to manage, someone invented volumes and volume managers. What do you think the next logical step is when there are too many volumes? section 1 accurately , the traditional array and the HP V i r tual Array would have similar perfor mance. However , the HP V irtual Array would still have the perfor mance advantage because of its ability to efficiently load balance across all the disks in the array both at the initial configuration and after capacity growth. In both stable and changing environments, the HP V irtual Array is the right choice. Now , if the environment is not totally stable, then the HP virtual array is the hands-down winner . When both arrays are initially configured, the traditional array should be faster at least for a day or so, but after that the HP V irtual Array will catch up and continue to operate at peak perfor mance for as long as the array is plugged in, while the traditional array will get slower over time. This is how it works. When the HP V i r tual Array is first turned on, it doesn’t know which data needs to be in RAID 1 0 and which data needs to be in RAID 5. After a day or so of reading the access patterns, it will figure out which RAID level is best suited for which data. Once that happens, the perfor mance will be as good as in any manually configured array . And, don’t forget, if the environment should change after that, even slightly , the HP V irtual Array will adjust while a manually configured array can do nothing but keep plugging along in a degraded mode. Question: Are you saying that this technology is for ever yone? I’m not looking for anything new . Answer: If it’ s new to you, then it is understandable that HP’ s virtual technology might seem unnecessar y . After all, you’ve succeeded without it up till now . But sometimes improvements do come along that really do hold out the promise of a widespread advantage. The dilemma of our rapidly advancing technological age is to know which new technologies should be ignored and which are the ones that need to be grasped. hp storage white paper Consider: It is always easier to manage a smaller number of things than a larger number of Answer: HP’ s V irtual Array . summar y HP’ s V irtual Array with HP’ s patented V irtual T echnology is the industry’ s most intelligent disk array . Because of its unique ability to greatly simplify storage management, it significantly reduces overall IT management costs and practically eliminates mistakes caused by human error . HP’ s Vir tual T echnology also has a significant positive impact on real-world perfor mance by automatically eliminating “hot spots,” and by perfor mance tuning the array 24 hours per day , 365 days per year . for more information For additional information on HP V irtual Arrays and other HP storage products and solutions, please call your local HP sales representative or visit our W eb site at www .hp.com/go/storage. 1.8
section 2 virtual value in hp’ s va7000 series 2 Vir tualization is a key to solving complexities associated with storage configurations. As Information T echnology continues its non-stop evolution, tools and devices that improve manageability while increasing business efficiency w ill command a premium in the marketplace. HP has taken a significant step for ward in delivering its V irtual Array (V A) 7000 series with a full c om plement of vir tualization functions. Enterprises that value a return on investment in areas such as data-sharing, optimized self-tuning per formance, and data-access flexibility must evaluate HP’ s V i r tual Array of ferings. SAN virtualization is in effect the aggregation of multiple storage devices in a centralized configuration, a storage pool. This simplifies the storage administrator’ s role by enabling all resources to be seen as a single entity . SAN virtualization b y it s e lf, however , offers little assistance in efficiently managing requirements of single applications. It pays scant attention to the capacity and perfor mance dynamics of individual, heterogeneous devices within the storage pool. HP , in its Federated Storage Area Management (FSAM) strategy , acknowledges the importance of SAN vir tualization. In its V A7000 Series, HP extends these concepts and benefits to the device-array l evel. T ogether , they demonstrate the full advantage of interdependencies between managing SAN v i r tual storage pools and managing vir tual storage at the de vice level. HP’ s V A7000 Series Characteristics Feature V A7100 V A7400 Capacity 1 TB (raw) in 3U enclosure; 14 TB in 2M rack 7.6 TB (raw) using the Disk System (DS) 2400 (up to six DS2400s chained); 14 TB in a 2M rack of 73 GB disks LUNs (max) 128 1,024 Disk Drives Supported 15 per 3U enclosure; same disk support as 7400 Up to 105 drives with DS2400; 18 GB 15 K rpm; 36 GB 10 K and 15 K rpm and 73 GB 10 K rpm I/O ports 1 Gb/sec. FC host and disk por ts T wo 2 Gb/sec. FC host ports; two 1 Gb/sec. FC disk ports Cache 256/512/1024/ MB mirrored 512/1024 MB mirrored per controller T ransfer Rate 90 MB/sec. 12,000 I/Os per sec. from cache; 3000 I/Os per sec. back-end 160 MB/sec.; 28,000 I/Os per sec. from cache; 7500 I/Os per sec. back-end In the HP Surestore V A7000 family , “vir tual array” r efers to treating the disks within the array as a pool of storage blocks rather than as physical disks. With this technology , the V A7000 family emerges as a second tier of storage virtualization. Extending virtualization to the device level, the SAN’ s view of storage becomes independent of physical disk attributes. This characteristic reduces the cost associated with SAN virtualization by simplifying storage management, enabling more efficient use of available device capacity , and d e c r easing the amount of required data movement at the de vice level. D.H. Brown Associates, Inc. http://www .dhbrown.com Our research program in Enterprise Storage Solutions (ESS) makes this T echnology T rends available to all our subscribers. Those interested in this program should contact marketing@dhbrown.com or 914-937-4302, ext. 281 2.0 “hp extends virtualization to the array” —by D.H. Brown Associates, Inc.
section 2 V A7000 virtualization, managed at the array level, offers the advantage of providing greater granularity when managing a pool of logical storage. Logical Unit Name (LUN) capacity is definable as a logical set of storage blocks rather than as a specific set of predefined physical disks. Each LUN becomes scalable from a few megabytes to several terabytes by adding capacity without operator inter vention (the array automatically controls the addition of new capacity to the storage pool). This “self-tuning” capability automatically restripes data across new disks added to the array , minimizing any potential imbalances of data distribution within the V A7000 series. Redistribution occurs in the background with no impact on SAN bandwidth or ser ver per formance. This function significantly reduces the efforts required by the system administrator to keep storage resources in balance. Also with V A7000 virtualization, the number of LUNs is no longer restricted by the amount of physical storage. Administrators can configure LUNs up to the limit permitted by the architecture without concern for physical disk groupings within arrays. Disk arrays without virtualization suppor t LUNs only from contiguous free space, requiring allocation of large storage chunks even if only a small portion is necessar y . As changes occur over time, data must be moved or storage reconfigured to recover unused space. W ithin the V A7000, fragmentation of physical space is managed at the RAID (Redundant Arrays of Inexpensive Disks) block level rather than in disk-sized chunks. The impact of free-space management and “garbage collection” is absorbed at the array level, without consuming SAN or ser ver resources. super redundancy enhances RAID 2 As alternatives to the “self-tuning” LUN-management capability described above, the V A7000 series also enhances LUN perfor mance through its use of RAID configurations “redundancy groups.” Many variations to RAID configurations have surfaced over the past 10 years. Most provide moderate variations on the standard RAID 1 (mirroring) and RAID 5 (striping) techniques. In its V A7000 Series, HP delivers a variation that substantially adds to its availability characteristics. RAID 5DP (Double Parity) approaches a RAID 6 solution in that it enables recover y from simultaneous failures in two disks without loss of any data. While this might seem superfluous in RAID arrays of only a few disks, it becomes critical in configurations supporting a large number of disks in a virtual array . The potential for disruption due to disk failure grows with the number of disks involved in an array . The goal of virtualization is to remove the physical constraints, which enables dozens of disks to be used in a single virtual array . HP has addressed this concer n with its RAID 5DP solution, increasing data availability by two orders of magnitude over traditional RAID 5 implementations. Additionally , use of storage is made more efficient as ver y large RAID groups (up to dozens of disks) are now manageable using only two parity disks rather than a parity disk for each group of five disks. T echnology T rends Copyright © 2001 by D.H. Brown Associates, Inc. 2.1 “hp extends virtualization to the array” —by D.H. Brown Associates, Inc.
section 2 reliability characteristics of hp’ s va7000 series • redundant, hot swappable controllers, fans, power supplies, and an internal fibre-channel hub eliminates planned downtime • online fir mware and capacity upgrade using mixed capacity disk drives • Checksum algorithm used for end-to-end data and data-path protection against data corruption • double parity suppor ts recover y from two simultaneous disk failures (RAID 5DP) • error Correction Code (ECC) protects mirrored cache from corruption before writing to disk • three-day batter y backup protects user data in cache from power outages and disasters • phone-home capability is provided through hp’ s Predictive Support programs and Event Monitoring Ser vices (EMS) “Redundancy groups” are physical disks that contain redundant (copies of) data, defined by the RAID type assigned to the data. Each redundancy group is divided into LUNs, addressable by a host. The V A7100 supports only one “redundancy group” with each of the 15 disk drives in the enclosure addressable by both controllers with no physical restrictions. The two fibre-channel loops (one to each controller) include a transmit line and a receive line for the transfer of data to and from the controller . Controller-to-controller communication and loop failover occur via an internal N-way bus. The V A7400 supports two redundancy groups in which one controller manages access to one redundancy group (consisting of all the odd-numbered disk slots) and the other controller to t h e second group 2 as in the V A 7100. The variation in data management at the array level provides great flexibility in matching perfor mance to application need. Self-tuning enables immediate response to data and application variables ensuring more work completed per transaction and maintaining more consistent ser vice-level perfor mance. RAID 1 0 uses a large number of disks in a single configuration to deliver strong perfor m ance to selected LUNs. It does so, however , at the expense of additional space for mirroring the data. RAID 5DP incurs a perfor mance penalty for write operations and can affect system perfor mance if frequent updates to large volumes of data are required. Its advantage is in the data redundancy provided and the improved cost efficiency vs. mirroring. Dual-redundancy groups in the V A7400 enhance the efficiency of data movement through the controllers, improving system perfor mance. T echnology T rends Copyright © 2001 by D.H. Brown Associates, Inc. 2.2 “hp extends virtualization to the array” —by D.H. Brown Associates, Inc. (consisting of all the even-numbered disk slots). Data movement and failover occur in the same manner
section 2 software extends virtual functions 2 While hardware design and functionality are critical to enabling performance, software function can maximize the efficiency and flexibility of the device. HP’ s software packages extend the management and flexibility of the V A7000 family w hile exploiting the virtualization capability . HP’ s Software for V A7000 Series Feature V A7100 V A7400 CommandView SDM HP-UX with all P A-RISC computers; Windows and Red Hat Linux with Intel Pentium III/500 MHz computers or above HP-UX with all P A-RISC computers; Windows and Red Hat Linux with Intel Pentium III/500 MHz computers or above Enterprise Management Smart Plug-ins HP-UX with all P A-RISC computers; Windows and Red Hat Linux with Intel Pentium III/500 MHz computers or above HP-UX with all P A-RISC computers; Windows and Red Hat Linux with Intel Pentium III/500 MHz computers or above Secure Manager V A HP-UX with all P A-RISC computers; Windows and Linux with Intel Pentium III/500 MHz computers or above; supports 128 W orld-Wide Names and 128 secure LUNs HP-UX with all P A-RISC computers; Windows and Linux with Intel Pentium III/500 MHz computers or above; Managed by Command View SDM; supports 128 W orld- Wide Names and 1,024 secure LUNs Business Copy V A HP-UX with all P A-RISC computers; Windows and Linux with Intel Pentium III/500 MHz computers or above; supports 127 Business Copy LUNs HP-UX with all P A-RISC computers; Windows and Linux with Intel Pentium III/500 MHz computers or above; Managed by Command View SDM; supports 1,023 Business Copy LUNs Auto Path V A Windows 2000 with Intel Pentium III/500 MHz computers or above HP-UX with all P A-RISC computers; Windows and Red Hat Linux Network Management T ools Not Applicable HP OpenView NNM, HP T optolls, CA Unicenter , Tivoli NetV iew , BMC Patrol Operating Systems HP-UX 11.0; Windows NT/2000; Red Hat Linux 6.2 HP-UX 10.2, 11.0, 11i; Windows NT/2000; Solaris 2.6, 7.0, 8.0; AIX 4.3.3; NetW are 5.0/5.1; MPE/iX 6.5, 7.0; Red Hat Linux 6.2, 7.0 T echnology T rends Copyright © 2001 by D.H. Brown Associates, Inc. 2.3 “hp extends virtualization to the array” —by D.H. Brown Associates, Inc.
section 2 The major HP Surestore software packages 1 (identified on page 2.2) include, 2 • Command View SDM speeds and eases local and remote management, including install, configure, monitor , and control via a web browser , direct host attach, or SAN. HP Predictive Support programs and Event Monitoring Ser vice (EMS) provide phone-home capability . • Smar t Plug-ins allow HP Command V iew SDM to integrate and process storage events via SNMP with HP OpenV iew Network Node Manager (NNM) . • Secure Manager Virtual Array (V A) controls ser ver access to data on a LUN-by-LUN basis, even in simultaneous heterogeneous array sharing environments. • Business Copy Virtual Array (V A) suppor ts LUN replication for development, testing, or backup situations. Data replication takes place at a specified point in time while the system is operating and entirely within the storage array , reducing the potential for I/O bottlenecks. • Auto Path Virtual Array (V A) routes data around any path’ s failure to maximize data availability . It provides the ability to self-configure and to automatically manage multiple I/O paths. It also provides dynamic load balancing to ensure peak per formance. This document is copyrighted © by D.H. Brown Associates, Inc. (DHBA) and is protected by U.S. and international copyright laws and conventions. This document may not be copied, reproduced, stored in a retrieval system, transmitted in any form, posted on a public or private website or bulletin board, or sublicensed to a third party without the written consent of DHBA. NO copyright may be obscured or removed from the paper . All trademarks and registered marks of products and companies r eferred to in this paper are protected. This document was developed on the basis of information and sources believed to be reliable. This document is to be used “as is.” DHBA makes no guarantees or representations regarding, and shall have no liability for the accuracy of, data, subject matter , quality , or timeliness of the content. 1 CA Unicenter TNG and BMC Patrol are supported in addition to the HP OpenView Storage Node Manager , OpenView Allocator , Builder , Optimizer , and Accountant. T echnology T rends Copyright © 2001 by D.H. Brown Associates, Inc. 2.4 “hp extends virtualization to the array”
06/02 5981-2067ENUC