Uploaded image for project: 'Jenkins'
  1. Jenkins
  2. JENKINS-57555

Remote children jobs should be restartable in case the connection drops

    Details

    • Type: New Feature
    • Status: Open (View Workflow)
    • Priority: Minor
    • Resolution: Unresolved
    • Component/s: core
    • Labels:
    • Environment:
      Jenkins ver. 2.164.3
    • Similar Issues:

      Description

      When a job running on a remote node "fails" because the connection drops, it should be possible to restart it automatically. Currently it's cumbersome to get a list of the failed children jobs and checking how they failed by grepping for java.io.EOFException in their log files.

      https://groups.google.com/forum/#!topic/jenkinsci-users/sjnF6zlPd3Q

      This is how the failure looks in the children job console output:

      Building remotely on instance-1 (tag1) in workspace /var/lib/jenkins/workspace/eval
      [vmu-eval-single] $ /bin/sh -xe /tmp/jenkins4616924287086740166.sh
      + /path/to/evaluation-tool
      FATAL: command execution failed
      java.io.EOFException
      	at java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2681)
      	at java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:3156)
      	at java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:862)
      	at java.io.ObjectInputStream.<init>(ObjectInputStream.java:358)
      	at hudson.remoting.ObjectInputStreamEx.<init>(ObjectInputStreamEx.java:49)
      	at hudson.remoting.Command.readFrom(Command.java:140)
      	at hudson.remoting.Command.readFrom(Command.java:126)
      	at hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:36)
      	at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:63)
      Caused: java.io.IOException: Unexpected termination of the channel
      	at hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:77)
      Caused: java.io.IOException: Backing channel 'instance-1' is disconnected.
      	at hudson.remoting.RemoteInvocationHandler.channelOrFail(RemoteInvocationHandler.java:214)
      	at hudson.remoting.RemoteInvocationHandler.invoke(RemoteInvocationHandler.java:283)
      	at com.sun.proxy.$Proxy78.isAlive(Unknown Source)
      	at hudson.Launcher$RemoteLauncher$ProcImpl.isAlive(Launcher.java:1144)
      	at hudson.Launcher$RemoteLauncher$ProcImpl.join(Launcher.java:1136)
      	at hudson.tasks.CommandInterpreter.join(CommandInterpreter.java:155)
      	at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:109)
      	at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:66)
      	at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:20)
      	at hudson.model.AbstractBuild$AbstractBuildExecution.perform(AbstractBuild.java:744)
      	at hudson.model.Build$BuildExecution.build(Build.java:206)
      	at hudson.model.Build$BuildExecution.doRun(Build.java:163)
      	at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:504)
      	at hudson.model.Run.execute(Run.java:1816)
      	at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43)
      	at hudson.model.ResourceController.execute(ResourceController.java:97)
      	at hudson.model.Executor.run(Executor.java:429)
      

        Attachments

          Activity

          There are no comments yet on this issue.

            People

            • Assignee:
              Unassigned
              Reporter:
              aleb Alexandru Băluț
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: