Jim O'Reilly


Upcoming Events

Where the Cloud Touches Down: Simplifying Data Center Infrastructure Management

Thursday, July 25, 2013
10:00 AM PT/1:00 PM ET

In most data centers, DCIM rests on a shaky foundation of manual record keeping and scattered documentation. OpManager replaces data center documentation with a single repository for data, QRCodes for asset tracking, accurate 3D mapping of asset locations, and a configuration management database (CMDB). In this webcast, sponsored by ManageEngine, you will see how a real-world datacenter mapping stored in racktables gets imported into OpManager, which then provides a 3D visualization of where assets actually are. You'll also see how the QR Code generator helps you make the link between real assets and the monitoring world, and how the layered CMDB provides a single point of view for all your configuration data.

Register Now!

A Network Computing Webinar:
SDN First Steps

Thursday, August 8, 2013
11:00 AM PT / 2:00 PM ET

This webinar will help attendees understand the overall concept of SDN and its benefits, describe the different conceptual approaches to SDN, and examine the various technologies, both proprietary and open source, that are emerging. It will also help users decide whether SDN makes sense in their environment, and outline the first steps IT can take for testing SDN technologies.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

See more from this blogger

Is RAID Fading Into The Sunset?

Many of us IT pros have been using RAID to protect our data for the entirety of our professional lifetimes. RAID has stood the pressures of technical evolution well, in part because the fundamentals of disk drives and storage didn’t change much in that time.

However, much larger drives, faster networks and SSD storage have now combined to create a fork in the road, and alternatives are needed. The first crack in the edifice was the realization that if a drive failed in an array of multi-terabyte drives, the rebuild time was so long that the possibility of a second, terminal failure was too high. This led to a much more complex RAID 6, creating two parity records for each stripe.

RAID 6 has a major drawback, however: requiring a lot of compute power to generate parity. An alternative, RAID 50 uses a single parity, but replicates the data on another set of disks, which uses too much space.

The advent of solid-state disk made both of these options untenable. The issue is that SSDs are somewhere between fast and light-speed compared to hard drives, and those parity calculations became very hard to achieve. In addition, the cost of SSD was so high that often a configuration wouldn’t have the minimum of six drives required to make RAID 5 feasible. Many just needed one or two drives to act as caches and tier 0 storage for critical files.

The result was that SSDs are often replicated or mirrored (RAID 1). These two approaches are very similar, with a second copy of the data on another drive, but replication goes a bit further and stores the data on a separate storage appliance, removing single points of failure.

The other “big event” in storage affecting RAID is the emergence of cloud services. The need to scale out put enormous pressure on storage approaches, and the idea of hard disk drives (HDD) using replication made economic sense. The trade-off is that cloud service providers can buy HDD at the lowest OEM prices, making it cheaper to add drives rather than high-speed RAID heads to protect data. The CSPs also addressed a pressing need for data dispersion for disaster recovery by having a third replica geographically distant from the other two.

The CSP model makes sense with HDDs costing around $60 for a 2TB drives. The cost of a typical (proprietary) RAID head node pays for a lot of drives! Replication also has the benefit of not slowing down when a drive is lost, since data doesn’t need to be recreated from parity, and it also maintains integrity if a second drive fails, since there are three copies.

[Read about a new standard that ramps up SSD performance with a radical new approach to storage I/O handling in "NVMe Poised To Revolutionize Solid-State Storage."]

Historically, replication has been tied to an object storage model, somewhat like a file server on steroids. This model uses its own access protocol, REST, to get to data across the network. Still, block I/O operations to update data are possible, and this need has even created universal storage appliances that can manage file, block and object access to the same object store. An example that's rapidly gaining popularity is the open source Linux storage application, Ceph.

Replication’s major drawback is the need for three or more full copies of data. Cleversafe has pioneered an extension of the RAID concept called erasure coding. This involves adding redundant information, somewhat like parity, to the data and then distributing it over multiple appliances. Typically 10 data blocks become 16 total blocks (10+6 coding) and the rule is that any 10 of these 16 blocks are sufficient to reconstruct the data.

However, erasure code calculation is compute-intensive, slowing both writes and reads, especially when blocks are missing. The number of drives involved tends to be high. This makes it useful for scale-out archival data, but problematic for SSDs in Tier 0 or 1. Likely this will remain an issue unless hardware assist logic becomes available.

With SSDs straining performance limits and cloud storage using very inexpensive drives to protect data, it looks like replication will take the lead from RAID, if it has not already done so. RAID arrays won’t disappear overnight, but faster object stores, open source enterprise-grade software and cheap drives all mean that the playing field is tilted towards universal storage boxes and the replication approach.

Related Reading

More Insights

Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | Please read our commenting policy.
Vendor Comparisons
Network Computing’s Vendor Comparisons provide extensive details on products and services, including downloadable feature matrices. Our categories include:

Research and Reports

Network Computing: April 2013

TechWeb Careers