Hello,
I came across one issue in VCS where application is configured using Application agent. We found VCS iniated failover of application after Application agent found Application went offline outside cluster ( not initiated by vcs ), however from Application logs we found everything was fine on application.
Then we tested Monitor Program ( script ) provided by Application. It was well defined to monitor a PID and provide exit code based on PID available to VCS agent. i.e exit 110 if running, 110 if not running.
To test if Monitor script is giving correct exit code to VCS agent, we introduced echo messages in script directed to a log file locally. And we found some strange input in our created log file.
Note : MonitorInverval = 60
MonitorTimeout = 60
We noticed, when Monitor script run after exacly 60 seconds, it gives exit code 110 which is fine. However, we found Monitor Scripts executes by VCS before 60 seconds sometimes, i.e arund 7-8 seconds, and at that time it gives exist code 100 which is problem.
I want to know why VCS agent is calling Monigor Programm before 60 second ( within 7-8 seconds ) after last run.
# cat is_server_running.log | egrep 'exit'
[Tue Apr 1 13:12:34 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:23:52 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:27:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:27:32 BST 2014] exit 100
[Tue Apr 1 13:28:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:29:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:30:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:31:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:32:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:32:31 BST 2014] exit 100
[Tue Apr 1 13:33:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:34:01 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:34:26 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:35:26 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:36:26 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:37:26 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:37:32 BST 2014] exit 100
[Tue Apr 1 13:38:26 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:39:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:39:30 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:40:03 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:40:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:41:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:42:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:42:32 BST 2014] exit 100
[Tue Apr 1 13:43:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:44:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:45:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:46:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:47:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:47:32 BST 2014] exit 100
[Tue Apr 1 13:48:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:49:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:50:25 BST 2014] Server is already running, exit 110
[Tue Apr 1 13:51:25 BST 2014] Server is already running, exit 110