Queries stuck after worker failures #5801

igorcalabria · 2022-07-21T11:33:27Z

Issue Summary

Sometimes queries may be get stuck "running" forever (until redis expiration) after worker failures in redash 10. Running the query again with no changes has no effect as redash thinks the job is already running.

Steps to Reproduce

I'm not sure if these steps are deterministic but I had great success in our pre-production environment.

Refresh a dashboard with several queries that takes more than a few seconds to run
Forcibly kill the worker process

Some of the queries on the dashboard are now in this stuck state.

Technical details:

In the network tab, you can see the redash pinging the job endpoint and getting the query in "Started" state. I double checked if the "remove_ghost_locks" was running and from the logs it didn't remove these queries. It seems the main issue is rq, looking at the recent changelogs, there a bunch of improvements to error handling. Upgrading rq to rq==1.10.1 seemed to fixed this issue.

Redash Version: 10.1
Browser/OS: Firefox/Linux
How did you install Redash: docker on kubernetes

The text was updated successfully, but these errors were encountered:

susodapop · 2022-07-21T23:10:06Z

Related to #5797

We plan to update rq to gather some of these benefits. Thank you for reporting your experience that updating to 1.10.1 seemed to fix the issue for you 👌

This was referenced Jan 24, 2023

Redash Workers Becomes Inactive Automatically #5885

Closed

Add liveness check for workers getredash/contrib-helm-chart#140

Merged

Add liveness check for workers #5886

Merged

SchrosCat2013 mentioned this issue Sep 6, 2023

Redis decode_responses rq incompatibility #6424

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Queries stuck after worker failures #5801

Queries stuck after worker failures #5801

igorcalabria commented Jul 21, 2022

susodapop commented Jul 21, 2022

Queries stuck after worker failures #5801

Queries stuck after worker failures #5801

Comments

igorcalabria commented Jul 21, 2022

Issue Summary

Steps to Reproduce

Technical details:

susodapop commented Jul 21, 2022