Discussion:
shepherd exited with exit status = 15
llikethat
2010-12-22 10:02:11 UTC
Permalink
Hi,
I have a set of compute nodes running SGE6.2u5 in windows. When the daemon (sge_execd) is started it is fine. Sometimes when a job is run on these nodes, it exists with the following error and the node goes into an unknown state.
I'm not able to figure out what could be the possible reason for this, any pointers would be great.
shepherd of job 3518.17 exited with exit status = 15
The node's hit this situation randomly, it is not bound to any specific machine.
Thanks,

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=308229

To unsubscribe from this discussion, e-mail: [users-***@gridengine.sunsource.net].
fx
2010-12-22 22:25:01 UTC
Permalink
Post by llikethat
Hi,
I have a set of compute nodes running SGE6.2u5 in windows. When the daemon (sge_execd) is started it is fine. Sometimes when a job is run on these nodes, it exists with the following error and the node goes into an unknown state.
I'm not able to figure out what could be the possible reason for this, any pointers would be great.
shepherd of job 3518.17 exited with exit status = 15
I know nothing about the MS Windows clients, but you want to check for
error messages in the equivalent of syslog for the host and the SGE host
spool file.
--
Dave Love
Advanced Research Computing, Computing Services, University of Liverpool
AKA ***@gnu.org

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=310558

To unsubscribe from this discussion, e-mail: [users-***@gridengine.sunsource.net].
llikethat
2011-01-03 04:08:52 UTC
Permalink
Hi,
I got this information from the SGE host spool file. I couldn't find any other information  pertaining to this error.
Thanks,

--- On Thu, 23/12/10, fx <***@liverpool.ac.uk> wrote:

From: fx <***@liverpool.ac.uk>
Subject: Re: [GE users] shepherd exited with exit status = 15
To: ***@gridengine.sunsource.net
Date: Thursday, 23 December, 2010, 3:55 AM
Post by llikethat
Hi,
I have a set of compute nodes running SGE6.2u5 in windows. When the daemon (sge_execd) is started it is fine. Sometimes when a job is run on these nodes, it exists with the following error and the node goes into an unknown state.
I'm not able to figure out what could be the possible reason for this, any pointers would be great.
shepherd of job 3518.17 exited with exit status = 15
I know nothing about the MS Windows clients, but you want to check for
error messages in the equivalent of syslog for the host and the SGE host
spool file.
--
Dave Love
Advanced Research Computing, Computing Services, University of Liverpool
AKA ***@gnu.org

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=310558

To unsubscribe from this discussion, e-mail: [users-***@gridengine.sunsource.net].

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=312240

To unsubscribe from this discussion, e-mail: [users-***@gridengine.sunsource.net].
Continue reading on narkive:
Loading...