Uploaded image for project: 'Jenkins'
  1. Jenkins
  2. JENKINS-36013

Automatically abort ExecutorPickle rehydration from an ephemeral node

    Details

    • Similar Issues:
    • Sprint:
      Pipeline - July/August

      Description

      ExecutorPickle.rehydrate ought to be able to detect that it has been spinning in circles because the agent node it was supposed to run on is not in the Jenkins node list, and automatically abort, causing the build to fail with a comprehensible message rather than just hanging indefinitely. (As opposed to being registered but offline, which is normal enough for a JNLP agent etc.—in such cases we just want to wait for the agent to come back online.)

      This would provide a better experience for the case of a build which was running on an EphemeralNode (such as from a Cloud without durable-task integration) when Jenkins was restarted. An agent using an inappropriate RetentionStrategy is trickier since it might still be defined after a restart, but will soon be terminated. Similarly, there may be cases where the agent is actually going to be redefined (with the same name) when it is attached after the restart—not sure about the Swarm plugin, but CloudBees DEV@cloud OPEs work this way. To prevent the build from being killed too aggressively, the cleanup should be delayed until some time has elapsed since rehydration began (or, ideally, since Jenkins completed initialization)—say, five minutes.

        Attachments

          Issue Links

            Activity

            jglick Jesse Glick created issue -
            jglick Jesse Glick made changes -
            Field Original Value New Value
            Epic Link JENKINS-35399 [ 171192 ]
            jglick Jesse Glick made changes -
            Link This issue depends on JENKINS-26130 [ JENKINS-26130 ]
            rtyler R. Tyler Croy made changes -
            Workflow JNJira [ 172654 ] JNJira + In-Review [ 184708 ]
            abayer Andrew Bayer made changes -
            Component/s pipeline-general [ 21692 ]
            abayer Andrew Bayer made changes -
            Component/s workflow-plugin [ 18820 ]
            jglick Jesse Glick made changes -
            Component/s workflow-durable-task-step-plugin [ 21715 ]
            Component/s pipeline [ 21692 ]
            jglick Jesse Glick made changes -
            Link This issue relates to JENKINS-41569 [ JENKINS-41569 ]
            hrmpw Patrick Wolf made changes -
            Link This issue is related to JENKINS-41569 [ JENKINS-41569 ]
            jglick Jesse Glick made changes -
            Link This issue relates to JENKINS-43607 [ JENKINS-43607 ]
            jglick Jesse Glick made changes -
            Link This issue relates to JENKINS-33761 [ JENKINS-33761 ]
            jamesdumay James Dumay made changes -
            Labels robustness cloudbees-internal-pipeline robustness
            jamesdumay James Dumay made changes -
            Priority Major [ 3 ] Critical [ 2 ]
            jamesdumay James Dumay made changes -
            Sprint Pipeline - July/August [ 371 ]
            jamesdumay James Dumay made changes -
            Assignee Jesse Glick [ jglick ] Sam Van Oort [ svanoort ]
            svanoort Sam Van Oort made changes -
            Status Open [ 1 ] In Progress [ 3 ]
            oleg_nenashev Oleg Nenashev made changes -
            Link This issue is related to JENKINS-45917 [ JENKINS-45917 ]
            svanoort Sam Van Oort made changes -
            Status In Progress [ 3 ] In Review [ 10005 ]
            svanoort Sam Van Oort made changes -
            Status In Review [ 10005 ] Closed [ 6 ]
            Resolution Fixed [ 1 ]
            svanoort Sam Van Oort made changes -
            Link This issue is duplicated by JENKINS-45917 [ JENKINS-45917 ]
            cloudbees CloudBees Inc. made changes -
            Remote Link This issue links to "CloudBees Internal CD-179 (Web Link)" [ 18944 ]
            cloudbees CloudBees Inc. made changes -
            Remote Link This issue links to "CloudBees Internal CLTS-2226 (Web Link)" [ 18978 ]
            jglick Jesse Glick made changes -
            Link This issue relates to JENKINS-49707 [ JENKINS-49707 ]
            jglick Jesse Glick made changes -
            Remote Link This issue links to "workflow-durable-task-step #47 (Web Link)" [ 22735 ]
            jglick Jesse Glick made changes -
            Remote Link This issue links to "workflow-durable-task-step #48 (Web Link)" [ 22736 ]

              People

              • Assignee:
                svanoort Sam Van Oort
                Reporter:
                jglick Jesse Glick
              • Votes:
                6 Vote for this issue
                Watchers:
                11 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: