I'm pulling my hair out here.
I'm running Opsview-core and am calling check_by_ssh which executes a script on the remote linux server which check if a database process is running or not.
It all works fine for a few minutes and returns the correct result, then I suddenly start getting "CRITICAL - Plugin timed out after 60 seconds" and "(Return code of 255 is out of bounds)" messages for no apparent reason.
I was oringally using check_nrpe but had similar timeout issues which i though might have been caused due to too many calls on nrpe, but that seems not to be the case.
I have between 300 and 350 of these checks taking place per server, I have increased the sshd_config MaxSessions value to 1000 but I'm still getting these errors at random.
Any assistance will really be appreciated.