[Issue 2397] New - Irersponsible machnes in master-slave configuration

classic Classic list List threaded Threaded
7 messages Options
Reply | Threaded
Open this post in threaded view
|

[Issue 2397] New - Irersponsible machnes in master-slave configuration

musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397
                 Issue #|2397
                 Summary|Irersponsible machnes in master-slave configuration
               Component|hudson
                 Version|current
                Platform|All
              OS/Version|All
                     URL|
                  Status|NEW
       Status whiteboard|
                Keywords|
              Resolution|
              Issue type|DEFECT
                Priority|P1
            Subcomponent|master-slave
             Assigned to|issues@hudson
             Reported by|musilt2






------- Additional comments from [hidden email] Mon Sep 22 14:57:26 +0000 2008 -------
From time to time, execution on some slave machines fails. We are using matrix
job to run our test on slave machines (connected by ssh). However, sometimes
console output on some slave machines shows only:

started
Building remotely on Lin-Ubuntu-1-stable

and nothing happens, even buildtimeout plugin does not handle this and build has
to be killed. Next scheduled build for such a machine behaves the same. The only
way how to get this work is to disconnect and reconnect the slave. Seems to me
like some problems in master-slave channel,because even when i try to create
simple job, that just should execute some shell command (e.g. ls), build is
still not started and output is showing message above. So probably jobtype
independent bug.

(Detailed info can be found at
http://markmail.org/message/o6zfqp3kpqcjo4jl#query:Build%20on%20slave%20machine%20is%20not%20started+page:1+mid:y7kyokagweyjllho+state:results)


we have been asked at mailing list to capture:

1) master threaddump http://server/hudson/threadDump 
2) slave system info http://server/hudson/computer/SLAVENAME/systemInfo
3) slave log http://server/hudson/computer/SLAVENAME/log

we are not able to cature 2) - it is unavailable, probably because of broken
master-slave channel.

Attaching 1) and 3)

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Issue 2397] Irersponsible machnes in master-slave configuration

musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397






------- Additional comments from [hidden email] Mon Sep 22 14:58:02 +0000 2008 -------
Created an attachment (id=387)
thread dump


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Issue 2397] Irersponsible machnes in master-slave configuration

musilt2
In reply to this post by musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397






------- Additional comments from [hidden email] Mon Sep 22 14:58:25 +0000 2008 -------
Created an attachment (id=388)
log


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Issue 2397] Irersponsible machines in master-slave configuration

mirilovic
In reply to this post by musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397



User mirilovic changed the following:

                What    |Old value                 |New value
================================================================================
                      CC|''                        |'mirilovic'
--------------------------------------------------------------------------------
                 Summary|Irersponsible machnes in m|Irersponsible machines in
                        |aster-slave configuration |master-slave configuration
--------------------------------------------------------------------------------




------- Additional comments from [hidden email] Mon Sep 22 15:04:48 +0000 2008 -------
... or any advices how to capture 2) ?

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Issue 2397] Unresponsive slave machines

mdonohue
In reply to this post by musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397



User mdonohue changed the following:

                What    |Old value                 |New value
================================================================================
                 Summary|Irersponsible machines in |Unresponsive slave machine
                        |master-slave configuration|s
--------------------------------------------------------------------------------




------- Additional comments from [hidden email] Sun Apr  5 08:15:28 +0000 2009 -------
Fix summary

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Issue 2397] Unresponsive slave machines

mdonohue
In reply to this post by musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397






------- Additional comments from [hidden email] Sun Apr  5 08:43:56 +0000 2009 -------
*** Issue 3217 has been marked as a duplicate of this issue. ***

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Issue 2397] Unresponsive slave machines

Kohsuke Kawaguchi
Administrator
In reply to this post by musilt2
https://hudson.dev.java.net/issues/show_bug.cgi?id=2397



User kohsuke changed the following:

                What    |Old value                 |New value
================================================================================
                  Status|NEW                       |STARTED
--------------------------------------------------------------------------------
                Priority|P1                        |P2
--------------------------------------------------------------------------------




------- Additional comments from [hidden email] Thu Jul  9 19:23:10 +0000 2009 -------
Master asked slave to create a directory, but the slave isn't responding for
some reason.

I really need to see the stack dump of the slave JVM --- can you get it via
jstack by logging into the slave in question?


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]