Yesterday I was alerted to the fact that there was a change in the VMware 5.5 U2 heartbeat method. In U2 and vSphere 6 it now uses ATS on VAAI enabled arrays to do heartbeats. Some arrays are experiencing outages due to this change. It’s not clear to me what array are exactly effected other than IBM has posted an article here. It seems to cause one of the following symptoms : Host disconnects from vCenter or storage disconnects from host. As you can see one of these (storage) is a critical problem creating an all paths down situation potentially.
The fix suggested by IBM disabled the ATS lock method and returns it to pre U2 methods. It’s my understanding that this is an advanced setting that can be applied without a reboot. I have also been told that if you create this advanced setting it will be applied via host profile or powercli.
It is very early in the process in all accounts you should open a VMware ticket to get their advice on how to deal with this issue. They are working on the problem and should produce a KB when possible with more information. I personally would not apply this setting unless you are experiencing the issue as identified by VMware. I wish I had more information but it has not happened in my environment.
Post comments if you are experiencing this issue with more information. I will update the article once the KB is posted.