Paging Iterator #6784

Capt-Mac · 2025-03-07T17:23:42Z

Describe the issue

Batch2 Processing of job instances has the capacity to load significant amount of workchunks to be processed in parallel. In some testing this can exceed 100k workchunks, which means high availability of processing nodes and saturation of these nodes is paramount for efficient processing and maximum parallelization.
The current JobInstanceProcessor class appears to make use of a paging iterator to manage this instead of a Java stream, or something similar. When testing at scale it has been observed that nodes process their batch of tasks and will wait idle for the next iteration of chunks to be released by the iterator

To Reproduce

Run a Batch2 job like to process a large qty of workchunks (20k workchunks is a good example) to process
This could be where the first step creates the workchunks to process, and the second processes across available consumers.
Observe the behavior of step 2 of the job. In a single node deployment, the batch2 processing should allow for all consumers to process simultaneously. This can be observed on the workchunks in the table per instance with status 'in-progress'.
As the job processes you will observe on several occasions that the 'in-progress' queue will saturate fully to for all consumers initially and then trickle off down to 1 consumer processing, instead of remaining at capacity of processing across all available consumers.
This can be observed over and over as records are processing, which significantly impacts performance

Expected behavior

JobInstanceProcessor should allow for all available consumers to process available tasks until no tasks remaining. The only time 'in-progress' tasks should fall below available capacity is when remaining tasks is < available work.

Suggested fix

Convert usage of paging iterator to use a stream instead to avoid the need manual iteration and a more best practices approach to managing large batched workloads.
Another option would be to eliminate use of PagingIterator and find an alternative approach to queue workchunks without bloating memory or under utilizing available processors

Capt-Mac self-assigned this Mar 7, 2025

Capt-Mac linked a pull request Mar 7, 2025 that will close this issue

Batch2 Remove Paging Iterator #6783

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paging Iterator #6784

Paging Iterator #6784

Capt-Mac commented Mar 7, 2025 •

edited

Loading

Paging Iterator #6784

Paging Iterator #6784

Comments

Capt-Mac commented Mar 7, 2025 • edited Loading

Describe the issue

To Reproduce

Expected behavior

Suggested fix

Capt-Mac commented Mar 7, 2025 •

edited

Loading