-
Bug
-
Resolution: Incomplete
-
Critical
-
None
-
Master is on Debian Lenny 64bit, Slaves are : Debian Lenny x64, Ubuntu 9.04 x64, Ubuntu 9.10 x64. All are Xen Virtual Hosts
We installed hudson 3 month ago and have successfully been running it with two Ubuntu Slaves (9.04 and 9.10). Recently we added Debian Lenny slave.
About a week ago slaves started disconnecting for no apparent reason. Only noticed one at the time. And its a different node all the time. Manually disconnecting and connecting it again puts them back online.
All slaves are launched via SSH. I tried downgrading it to 1.348 but that did not solve the problem.
I am not too sure where to look as I am very new to Hudson.
When looking at the log it says "Ping Failed. Terminating". I went into Manage Hudson -> Nodes -> Configure and unticked "Preventive Node Monitoring | Response Time"
Here is the log for a slave when it goes offline.
##############################################################################
04/08/10 13:13:10] [SSH] Opening SSH connection to 192.168.0.211:22.
[04/08/10 13:13:10] [SSH] Authenticating as root/******.
[04/08/10 13:13:10] [SSH] Authentication successful.
[04/08/10 13:13:10] [SSH] The remote users environment is:
BASH=/bin/bash
BASH_ARGC=()
BASH_ARGV=()
BASH_EXECUTION_STRING=set
BASH_LINENO=()
BASH_SOURCE=()
BASH_VERSINFO=([0]="3" [1]="2" [2]="48" [3]="1" [4]="release" [5]="x86_64-pc-linux-gnu")
BASH_VERSION='3.2.48(1)-release'
DIRSTACK=()
EUID=0
GROUPS=()
HOME=/root
HOSTNAME=build-ubuntu-904-amd64
HOSTTYPE=x86_64
IFS=$' \t\n'
LANG=en_GB.UTF-8
LOGNAME=root
MACHTYPE=x86_64-pc-linux-gnu
MAIL=/var/mail/root
OPTERR=1
OPTIND=1
OSTYPE=linux-gnu
PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games
PIPESTATUS=([0]="0")
PPID=11101
PS4='+ '
PWD=/root
SHELL=/bin/bash
SHELLOPTS=braceexpand:hashall:interactive-comments
SHLVL=1
SSH_CLIENT='192.168.0.210 55147 22'
SSH_CONNECTION='192.168.0.210 55147 192.168.0.211 22'
TERM=dumb
UID=0
USER=root
XDG_SESSION_COOKIE=b108e1bf01f477ea58d142124b47304f-1270732390.165528-775798503
_=']'
[04/08/10 13:13:10] [SSH] Checking java version of java
[04/08/10 13:13:10] [SSH] java -version returned 1.6.0_0.
[04/08/10 13:13:10] [SSH] Starting sftp client.
[04/08/10 13:13:10] [SSH] Copying latest slave.jar...
[04/08/10 13:13:10] [SSH] Copied 214,582 bytes.
[04/08/10 13:13:10] [SSH] Starting slave process: cd '/var/hudson' && java -jar slave.jar
<===[HUDSON REMOTING CAPACITY]===>���channel started
Slave.jar version: 1.353
This is a Unix slave
Copied maven-agent.jar
Copied maven-interceptor.jar
Copied maven2.1-interceptor.jar
Slave successfully connected and online
Ping failed. Terminating
Would be great if someone could point in the right direction.
Many thanks