Uploaded image for project: 'Jenkins'
  1. Jenkins
  2. JENKINS-50243

windows slave node went offline when build in progress

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Cannot Reproduce
    • Icon: Major Major
    • remoting
    • None

      HI,

      Windows slave node went offline during the build executing. It cause the build failure. 3 kinds of log collected as below.

      Our windows slave use this command to launch -

      /proj/selijenkins/oge/latest/ogejenkins-master.sh --debug  --node selilsw687 --fetch_slave_jar_using_ssl --noexportenv "-l jenkinsq,arch=lx24-amd64,OS=SLES12.1"

      The slave log when the issue appeared :(Checking from jenkins web server side,the issue appeared from 2014 to 2018))

      ================================

      [03/13/18 06:40:22] Launching slave agent
      $ /proj/selijenkins/oge/latest/ogejenkins-master.sh --node selilsw687 --fetch_slave_jar_using_ssl --noexportenv "-l jenkinsq,arch=lx24-amd64,OS=SLES12.1"
      Entering ogejenkins-master.sh at Tue Mar 13 06:40:22 CET 2018
      INFO: Executing garbage collection
      rm /home/bscauto/.selijenkins/log/jk4-40231.out
      rm /home/bscauto/.selijenkins/log/jk4-40231.err
      rm /home/bscauto/.selijenkins/tmp/jk4-40999.qsub
      INFO: Using workspace: /proj/eiffel005_config/jenkins_home/workspaces/fem102-selilsw687
      INFO: Preparing to schedule job in Grid Engine...
      INFO: Slave will log to: /home/bscauto/.selijenkins/log/jk4-40436.out
      INFO: Job submitted to Grid Engine
      Waiting max 120 minutes for job to be scheduled...
      INFO: Beginning to listen on port 40436 at Tue Mar 13 06:40:23 CET 2018
      Your job 598391 ("jk4-40436") has been submitted
      <===[JENKINS REMOTING CAPACITY]===>Slave.jar version: 2.59
      This is a Unix slave
      Slave successfully connected and online
      ERROR: Connection terminated
      java.io.IOException: Unexpected EOF
      at hudson.remoting.ChunkedInputStream.readUntilBreak(ChunkedInputStream.java:99)
      at hudson.remoting.ChunkedCommandTransport.readBlock(ChunkedCommandTransport.java:39)
      at hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
      at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48)

      ===============================================

      The console log on master side as below  when the issue appear:

      ==============================

      08:09:48 2018-03-16 08:09:48,335 [INFO] ltssetup: TG 0 TRXC 7 System Information reconfigured
      08:09:48 2018-03-16 08:09:48,335 [INFO] FMStopReject - expected if FAS is configured for the first time
      08:09:48 2018-03-16 08:09:48,336 [INFO] ltssetup: FAS for 16 ARFCN started on TG 0 TRXC 0
      08:09:49 2018-03-16 08:09:49,346 [INFO] LTS setup OK
      08:09:49 2018-03-16 08:09:49,350 [INFO] Stopping Airmosaic (if already running)
      08:09:49 2018-03-16 08:09:49,371 [INFO] Checking TSTM selimssim002-tstm for running scenarios
      08:30:27 Slave went offline during the build
      08:30:27 ERROR: Connection was broken: java.io.IOException: Unexpected EOF
      08:30:27 at hudson.remoting.ChunkedInputStream.readUntilBreak(ChunkedInputStream.java:99)
      08:30:27 at hudson.remoting.ChunkedCommandTransport.readBlock(ChunkedCommandTransport.java:39)
      08:30:27 at hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
      08:30:27 at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48)
      08:30:27
      08:30:27 Build step 'Execute shell' marked build as failure
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:27 CreateLinkToUfteLogs.groovy: Log directory found: /proj/bscautodata/public/html/bts_ci/ufte/selilsw686/20180316_065339
      08:30:28 Email was triggered for: Always
      08:30:28 Sending email for trigger: Always
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: Error: No workspace found!
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 selilsw1125_CII_Load_And_Stability_PL4_1_mdevg5_b801_#50205 is still in progress; ignoring for purposes of comparison
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 ERROR: selilsw686 is offline; cannot locate Java 8
      08:30:28 node = selilsw686
      08:30:28 Email sending was cancelled by user script.
      08:30:28 Finished: FAILURE

      =================================

      Windows slave log at path C:\jenkins\slaveroot\, please see the attachment

      =================================

      Jan 24, 2018 10:48:42 AM hudson.remoting.SynchronousCommandTransport$ReaderThread run SEVERE: I/O error in channel channel java.net.SocketException: Connection reset at java.net.SocketInputStream.read(Unknown Source) at java.net.SocketInputStream.read(Unknown Source) at java.io.FilterInputStream.read(Unknown Source) at java.io.BufferedInputStream.fill(Unknown Source) at java.io.BufferedInputStream.read(Unknown Source) at hudson.remoting.FlightRecorderInputStream.read(FlightRecorderInputStream.java:82) at hudson.remoting.ChunkedInputStream.readHeader(ChunkedInputStream.java:72) at hudson.remoting.ChunkedInputStream.readUntilBreak(ChunkedInputStream.java:103) at hudson.remoting.ChunkedCommandTransport.readBlock(ChunkedCommandTransport.java:33) at hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34) at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48) channel stopped channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started channel started

      =======================================
       

            oleg_nenashev Oleg Nenashev
            ezshuji Jane SHU
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: