Ensure triggering of callback in chord #397

s-bessing · 2023-07-03T14:13:37Z

If one task for whatever reason is executed twice the callback is never triggered because the chord will be deleted before the last task is executed. The last task will then raise a warning that the chord can't be found.

If one task for whatever reason is executed twice the callback is never triggered because the chord will be deleted and the before the last task is executed. The last task then will raise a warning that the chord cant be found.

django_celery_results/backends/database.py

AllexVeldman

This makes sense, you could consider refrasing the warning so it indicates why it's emitted instead of what broke. Something like "Chord %s executed more tasks than expected"

auvipy

beside the review comments, we need to make sure the CI is green

s-bessing · 2024-04-30T13:43:49Z

beside the review comments, we need to make sure the CI is green

The tests seem to be wrong. The counter goes to 0 but the group was not done.

fixing unit test

AllexVeldman · 2024-05-01T13:04:51Z

t/unit/backends/test_database.py

+        chord_counter.refresh_from_db()
+        assert chord_counter.count == 1


I don't think this is correct, we have a header of 2, 1 is done, the other fails.
If a task in the header fails the chord is done and the callback is not called.

If this is the case, a ChordCounter object will remain in the db for every failed chord.

@AllexVeldman but if you have retry logic (on code or worker level) all task eventually are successful, but the callback is never triggered.

https://docs.celeryq.dev/en/latest/reference/celery.result.html#celery.result.AsyncResult.ready would suggest deps.ready() would be False in case of a retry, True in case of a failure like in this test.

So this test should call the path all the way to trigger_callback() and fail the Chord, otherwise a Chord can never fail.
It suprises me that this is not the case with your proposed changes, since trigger_callback should be True, deps.ready() should be True so both trigger_callback() and chord_counter.delete() should have been called.

OK, it took some time but I figured out why this change makes the test pass (which is still not correct).

It's because the MagicMock used for the request parameter to mark_as_done() and mark_as_failure() does not have ignore_result = False set. This in turn would make request.ignore_result evaluate to True, skipping saving the result, which is needed to have deps.ready() work properly. Since we did not rely on deps.ready() in the previous state this never surfaced as an issue.

Add ignore_result = False to the MagicMock and reverting the test changes will fix the test.

AllexVeldman · 2025-02-28T15:53:44Z

@s-bessing I started to dig a little into how retries work, and on_chord_part_return is not called when a Task is marked for retry..

This works both on your branch and on main:

"""
Smoke test some Chord setups, including retries and failures.
"""

import os

from celery import Celery, chord
import django
from django.core.management import call_command

os.environ.setdefault('DJANGO_SETTINGS_MODULE', 'smoke.settings')

app = Celery(broker='redis://localhost:6379/2')
app.config_from_object('django.conf:settings', namespace='CELERY')


tries = 0

@app.task(name="add")
def add(x, y):
    return x + y

@app.task(name="fail", bind=True, max_retries=3)
def fail(self):
    """Retry 2x, return 5 on the third try"""
    global tries
    tries += 1
    if tries < 3:
        self.retry(countdown=5)
    return 5

@app.task(name="callback")
def callback(numbers):
    return sum(numbers)

if __name__ == "__main__":

    django.setup()
    call_command("migrate")
    from django_celery_results.models import ChordCounter

    header = [
        add.s(1,2),
        add.s(3,4),
        fail.s(),
    ]
    result = chord(header)(callback.s())
    assert ChordCounter.objects.count() == 1
    assert result.get(propagate=False) == 15
    assert ChordCounter.objects.count() == 0

So this PR adds nothing in the case of retries.

Could you elaborate a bit more on how you expect a task (with the same ID) to be executed twice, causing the callback to never be executed?

AllexVeldman · 2025-02-28T15:56:51Z

For completeness, this is the unittest I added to prove retries work on main:

    def test_on_chord_part_return_retry(self):
        """Test if the callback is executed if a task is retried"""
        gid = uuid()
        tid1 = uuid()
        tid2 = uuid()
        subtasks = [AsyncResult(tid1), AsyncResult(tid2)]
        group = GroupResult(id=gid, results=subtasks)
        self.b.apply_chord(group, self.add.s())

        chord_counter = ChordCounter.objects.get(group_id=gid)
        assert chord_counter.count == 2

        request = mock.MagicMock()
        request.id = subtasks[0].id
        request.group = gid
        request.task = "my_task"
        request.args = ["a", 1, "password"]
        request.kwargs = {"c": 3, "d": "e", "password": "password"}
        request.argsrepr = "argsrepr"
        request.kwargsrepr = "kwargsrepr"
        request.hostname = "celery@ip-0-0-0-0"
        request.periodic_task_name = "my_periodic_task"
        request.ignore_result = False
        result = {"foo": "baz"}

        self.b.mark_as_done(tid1, result, request=request)

        chord_counter.refresh_from_db()
        assert chord_counter.count == 1

        self.b.mark_as_retry(tid2, result, request=request)

        chord_counter.refresh_from_db()
        assert chord_counter.count == 1

        self.b.mark_as_done(tid2, result, request=request)

        with pytest.raises(ChordCounter.DoesNotExist):
            ChordCounter.objects.get(group_id=gid)

        request.chord.delay.assert_called_once()

s-bessing added 2 commits July 3, 2023 16:13

Ensure triggering of callback in chord

6e0aa80

If one task for whatever reason is executed twice the callback is never triggered because the chord will be deleted and the before the last task is executed. The last task then will raise a warning that the chord cant be found.

Fixing line length

90344aa

auvipy self-requested a review July 6, 2023 05:19

auvipy requested changes Jul 6, 2023

View reviewed changes

django_celery_results/backends/database.py Show resolved Hide resolved

auvipy closed this Nov 8, 2023

auvipy reopened this Nov 8, 2023

AllexVeldman approved these changes Apr 17, 2024

View reviewed changes

auvipy requested changes Apr 27, 2024

View reviewed changes

auvipy closed this Apr 27, 2024

auvipy reopened this Apr 27, 2024

s-bessing and others added 2 commits April 30, 2024 16:03

Merge branch 'celery:main' into patch-1

5c6b5df

update message

1b04c88

fixing unit test

s-bessing requested a review from auvipy April 30, 2024 15:45

AllexVeldman suggested changes May 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure triggering of callback in chord #397

Ensure triggering of callback in chord #397

s-bessing commented Jul 3, 2023 •

edited

Loading

AllexVeldman left a comment

auvipy left a comment

s-bessing commented Apr 30, 2024

AllexVeldman May 1, 2024

s-bessing Feb 13, 2025

AllexVeldman Feb 28, 2025

AllexVeldman Feb 28, 2025 •

edited

Loading

AllexVeldman commented Feb 28, 2025 •

edited

Loading

AllexVeldman commented Feb 28, 2025

		chord_counter.refresh_from_db()
		assert chord_counter.count == 1

Ensure triggering of callback in chord #397

Are you sure you want to change the base?

Ensure triggering of callback in chord #397

Conversation

s-bessing commented Jul 3, 2023 • edited Loading

AllexVeldman left a comment

Choose a reason for hiding this comment

auvipy left a comment

Choose a reason for hiding this comment

s-bessing commented Apr 30, 2024

AllexVeldman May 1, 2024

Choose a reason for hiding this comment

s-bessing Feb 13, 2025

Choose a reason for hiding this comment

AllexVeldman Feb 28, 2025

Choose a reason for hiding this comment

AllexVeldman Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

AllexVeldman commented Feb 28, 2025 • edited Loading

AllexVeldman commented Feb 28, 2025

s-bessing commented Jul 3, 2023 •

edited

Loading

AllexVeldman Feb 28, 2025 •

edited

Loading

AllexVeldman commented Feb 28, 2025 •

edited

Loading