EMC’s Chad Sakac goes into SRM failback (essentially running SRM back from the Recovery Site to your original site), but in the midst of this talks about the value proposition of SRM when, after all, many of the required steps can just be scripted, right?
Link: Virtual Geek: A Few Technical Threads – Part 3: SRM Failback.
- SRM exactly automates those steps. Automation in a DR
situation is everything. Buildings will be burning or sprinklers
running, and cellphones will be ringing. It’s not the time for complex manual operations.- Could
it be manually scripted? Sure. Who will maintain that script?
Traditionally – DR was reserved generally for mainframes and other
things deemed "mission critical" enough for expensive Disaster
Recovery. In those cases, the environments are VERY static – so the
idea of creating a DR plan, refreshing it and testing it once a year at
a multi-million dollar cost was reasonable. VMware is different, and
SRM brings DR to a whole new use case. This same week, I talked to a
customer who is adding 100 VMs a week on their infrastructure. Heck,
even if you’re doing 1 a week, will you update that script constantly?- They
tested a single VM booting. Yeah! They have 400 VMs today. First
of all, who’s going to manually register all those VMs. More
importantly – what is are the DEPENDENCIES between the VMs? There is
a specific start sequence needed, or your entire DR plan will not work.
I’m always interested in IT how projects needing cross-domain expertise
are hard, because everyone trivializes everyone else’s work or
complexity. AD and DNS, then Exchange/SQL Server, then Sharepoint –
and somewhere in their, your hundreds of other VMs – in a specific
start sequence. Who will figure out the specific start
dependencies the first time, and how will that be maintained in this
uber-script? SRM help, and come to AD3500 at VMworld to find out what
EMC is doing to make this easy.- The tested booting the VM on an isolated vswitch. The IP addressing scheme at the remote site was totally different. What will update all the IPs? Update DNS? Do any hosts hardcode IPs rather than use DNS names… anywhere?
- The
test (including the one they did) is a useless test unless it is an
END-TO-END test. Otherwise, you have told your management that you
are ready, and the unthinkable happens and you have failed them, you’ve
failed your business, and you’ve failed yourself. In other words, a successful "pseudo test" which leads to "we have it figured out" unless you REALLY test – GUARANTEES FAILURE.