I was having a discussion with one of our Windows administrators a few weeks ago about Exchange 2010, which makes some pretty substantial departures from how Exchange did things in the past. Since I’m mostly a Linux and VMware guy I don’t want to get too much into the product itself (I’m sure the MS Exchange Team has plenty of that), but the biggest change is that instead of four clustering modes, Exchange 2010 has one, and it doesn’t require any shared storage. The gist of it is that databases are organized into Database Availability Groups, and the databases that form them are replicated around the DAG in an arrangement rather like a big distributed RAID-10 array, with the difference that databases and not logical blocks are being striped across the replication group. When a server goes down, each database pops back up on another node in the cluster. Because of some back-end database improvements (namely, the elimination of single-instance storage, which stores only a single copy of a message or attachment sent to multiple users in the same database), Exchange 2010 cuts down random disk I/O by a huge amount, making it much simpler to run on commodity direct-attached disk with little to no penalty. Combine this with the removal of the shared storage requirement, and you no longer need a SAN to run clustered Exchange.
(Before any Exchange people chime in: yes, I’m aware that continuous copy replication/log shipping has been available since Exchange 2007. It just wasn’t viable for larger environments because you couldn’t easily distribute where the databases got replicated, meaning you either ran a replication slave for each Exchange server or seriously overspecified/overcommitted your hardware.)
Microsoft minces words pretty frequently to save face with customers (as most corporations do), and they’re still pushing the opinion that high-end SAN storage is a good idea so as not to rock the boat and upset anyone who already shelled out for high-end SAN hardware to run Exchange. However, the truth of it as far as I can surmise is that Microsoft specifically redesigned Exchange to work on commodity hardware in order to cut operating expenses on their own hosted Exchange offering.
Many applications are seeing a major paradigm shift towards distributed processing like Hadoop and schemaless NoSQL distributed data stores like, MongoDB and HBase, and proprietary software vendors are starting to take notice and move towards better use of commodity hardware. When there’s a lot of engineering effort involved, though, sometimes the best incentive for a company to improve the efficiency of their products is to try to make money on it themselves, and the results can benefit everybody.
El Reg: Cloud storage: It’s strictly for airheads
The Register has an interesting take on the Sidekick/Microsoft-but-really-Sun-and-Oracle-but-really-really-Microsoft debacle. The good bits:
Now, in this regard, I don’t really think that a standards group for cloud storage in particular is necessarily the right approach, versus something more generalized that could apply to all outsourced cloud IT vendors. I don’t believe that storage, in this regard, is any different than any other IT service. After all, while your data can certainly be lost by a storage incident, whether local, remote or somewhere in between, it can also be lost by a logical failure of the IT software infrastructure leveraging that storage. That’s what happened in this Danger business with T-Mobile: the logical failure apparently occurred (if The Register’s writeup is to be believed) when a server in the cluster threw up and trashed the data.
Danger didn’t have appropriate backups. (Oops. More on online vs. offline backups, and physical vs. logical failures, in another post.) In my experience, there’s a certain threshold of reliability and competence where once your physical infrastructure is robust enough, your potential for logical failures far outweighs your potential for physical failures. (Most backup system failures are really logical failures.) It’s this mismatch that makes business continuity planning so difficult.
But they’re right that the industry needs accountability, and it’s probably going to take a major shakeup for that to happen. Right now, there’s just too many vendors, and they’re all too young for us as IT practitioners to figure out which ones are reliable enough for business needs, and which ones aren’t. Like with any fledgling industry, there will be new technologies, there will be acquisitions, and then there will be vendors who produce an enterprise-ready product.
For the time being, we refer to the ages-old axiom: if you want something done right, you’ve got to do it yourself.