Job in RECOVERING_JOBS group does not appear to respect DisallowConcurrentExecution annotation #1097

luke-marrs · 2024-01-31T18:44:30Z

Quartz version: 2.3.2

I'm not sure if this is a bug or not: I have a job that is is annotated with @DisallowConcurrentExecution, but during a particular scenario, I found that execution of that job was attempted concurrently.

Here's my scenario:

We have quartz set up on three nodes with a thread pool size of two worker threads per node. At this time I don't think the multi-node setup contributed to this issue.
During a particular event, over 200 triggers fired for job - all had the same job group, but all had different job names. Since each job instance takes multiple minutes, the vast majority of the jobs started misfiring every minute.
For some reason (not yet sure why), one of the job instances (name: 17) needed to be recovered. However, it also fired "normally" after misfiring for a few minutes on a different node.
So, job 17 started in "myGroup", then also started in the "RECOVERING_JOBS" group during the same time period. From taking a peek at the code that checks isConcurrentExectionDisallowed in the JobStoreSupport class, it appears that a second instance of a job won't be started only if the job key matches - that is, the job must have the same group and name. But in this case, the triggered job instance is in the "RECOVERING_JOBS" group, while the executing job instance is in "myGroup". Here's what the logs look like for this scenario. Notice the overlap for job name 17 starting at 16:56:57.

16:43:49 [3_Worker-2] Trigger [group: myGroup, name: 17] fired for job [group: myGroup, name: 17]
16:56:57 [2_Worker-2] Trigger [group: RECOVERING_JOBS, name: recover_***] fired for job [group: myGroup, name: 17]
16:58:14 [3_Worker-2] Trigger [group: myGroup, name: 17] completed for job [group: myGroup, name: 17]
16:58:34 [2_Worker-2] Trigger [group: RECOVERING_JOBS, name: recover_***] completed for job [group: myGroup, name: 17]

Is this intentional behavior? Should the original job group & name be checked instead when a job is in the recovery group and isConcurrentExectionDisallowed is true?

The text was updated successfully, but these errors were encountered:

adarshvijayaraghavan · 2024-08-09T01:23:00Z

This could be related to the following issues, which all seem to arise from the same cause.

jhouserizer · 2024-10-14T23:32:13Z

This is a known limitation of job recovery, it could certainly be improved.

luke-marrs · 2024-10-15T13:26:17Z

This is a known limitation of job recovery, it could certainly be improved.

Thank you for the confirmation. We have worked around this for now by disabling recovery for jobs that can't have concurrent execution.

rkorpu01 · 2024-10-16T18:22:56Z

@jhouserizer Is there any plans to fix this issue.

jhouserizer · 2024-10-16T19:04:47Z

No immediate plan/priority for it, no - this limitation has existed for about 2 decades now. PRs welcome from anyone confident about a quality solution.

jhouserizer added is:enhancement Enhancement to an existing feature needs:review Needs review / investigation labels Oct 14, 2024

jhouserizer mentioned this issue Oct 16, 2024

quartz fires duplicated job when the job running for more than 12 hours #1070

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Job in RECOVERING_JOBS group does not appear to respect DisallowConcurrentExecution annotation #1097

Job in RECOVERING_JOBS group does not appear to respect DisallowConcurrentExecution annotation #1097

luke-marrs commented Jan 31, 2024

adarshvijayaraghavan commented Aug 9, 2024

jhouserizer commented Oct 14, 2024

luke-marrs commented Oct 15, 2024

rkorpu01 commented Oct 16, 2024

jhouserizer commented Oct 16, 2024

Job in RECOVERING_JOBS group does not appear to respect DisallowConcurrentExecution annotation #1097

Job in RECOVERING_JOBS group does not appear to respect DisallowConcurrentExecution annotation #1097

Comments

luke-marrs commented Jan 31, 2024

adarshvijayaraghavan commented Aug 9, 2024

jhouserizer commented Oct 14, 2024

luke-marrs commented Oct 15, 2024

rkorpu01 commented Oct 16, 2024

jhouserizer commented Oct 16, 2024