Hi all,
I had numerous filesystem failures two nights ago with several drives on vm servers giving "The VBDA named "C:" on host vmxx.xxx.net
reached its inactivity timeout of 7200 seconds.
The agent on host will be shutdown.
Followed some hours later by:
[Critical] From: BMA@xxxxx.net "(x)" Time: 23/07/2013 02:16:13
Bad message format
and:
Cannot connect to Media Agent on system xxxxx.net, port (xx) (IPC Cannot Connect) => aborting.
[Critical] From: VBDA@xxxxxx "C:" Time: 23/07/2013 02:16:51
Unexpected close reading NET message => aborting.
and:
IPC failure reading NETIO message (IPC Write Error
System error: [10053] Software caused connection abort
) => aborting
It all looks very serious and as I say I had numerous failures. I did an L&TT test on the drive which came back clean. I checked events on the cell manager and no issues were reported and also there were no network issues at the time.
The re-run of failed backups the morning after or the next nights had no such problems.
Could anyone point me in the right direction of what I should be looking at here?
any contributions much appreciated as usual,
thanks,
Steven