enh: add zero parameter download cli #96

fedorov · 2024-07-02T15:05:42Z

This should address most common usage scenario.

Follow up on #33.

This should address most common usage scenario.

vkt1414 · 2024-07-03T10:19:13Z

idc_index/cli.py

+        else:
+            item_ids = [generic_argument]
+        # this is a streamlined command, we will only check the first item, and will assume all other items are of the same kind
+        if client.index["collection_id"].str.contains(item_ids[0]).any():


I do not see a reason why we should not validate every value passed on. This may go back to the philosophical debate of whether we inform the user for passing some invalid values. In my opinion, we must. So I would change this to..

else: # Split the input string and filter out any empty values item_ids = [item for item in generic_argument.split(",") if item] if not item_ids: logger_cli.error("No valid IDs provided.") raise ValueError("No valid IDs provided.") index_df = client.index def check_and_download(column_name, item_ids, download_dir, kwarg_name): matches = index_df[column_name].isin(item_ids) matched_ids = index_df[column_name][matches].tolist() if not matched_ids: return False unmatched_ids = list(set(item_ids) - set(matched_ids)) if unmatched_ids: raise ValueError(f"Partial match for {column_name}: matched {matched_ids}, unmatched {unmatched_ids}") logger_cli.debug(f"Downloading from {column_name}") client.download_from_selection(**{kwarg_name: matched_ids, 'downloadDir': download_dir}) return True # Check for matches in each column and download if matches found if not ( check_and_download("collection_id", item_ids, download_dir, 'collection_id') or check_and_download("PatientID", item_ids, download_dir, 'patientId') or check_and_download("StudyInstanceUID", item_ids, download_dir, 'studyInstanceUID') or check_and_download("SeriesInstanceUID", item_ids, download_dir, 'seriesInstanceUID') ): logger_cli.error("None of the values passed matched any of the four UUIDs: collection_id, PatientID, StudyInstanceUID, SeriesInstanceUID.") raise ValueError("None of the values passed matched any of the four UUIDs: collection_id, PatientID, StudyInstanceUID, SeriesInstanceUID.")

If you think it's an overkill, I would at least add a raise exception at the end to catch if the first value passed does not match any of the four UUIDs

That's a good idea. Can you add this in a commit?

I do not like the exception though, I don't think it is helpful. I think console error is sufficient.

I do not know what a console error is. Also what do you not like about the exception message? Isn't it exactly what we trying to do and failing if it reaches the last else condition?

If I understand correctly, you simply wanted to remove raising exceptions but warn the user from logger? If so, I made the change now.

I do not know what a console error is.

I mean logger error.

Also what do you not like about the exception message? Isn't it exactly what we trying to do and failing if it reaches the last else condition?

Raising exception aborts the command line execution and prints execution stack. I see zero benefit in showing execution stack to the user. It is sufficient to let them know none of the identifiers was matched.

since we call IDCClient first, logging level will be set to info by default, as it is the level in index.py To get around it, we can set the desired log level in cli.py

- invoke download in sequence, to allow for mixing different kinds of identifiers - revisit logging output types to reduce console clutter - change default logging level to info

fedorov force-pushed the add-smart-download branch 5 times, most recently from c5c88c6 to 5f791c9 Compare July 2, 2024 21:47

enh: add zero parameter download cli

4370ebb

This should address most common usage scenario.

fedorov force-pushed the add-smart-download branch from 5f791c9 to 4370ebb Compare July 2, 2024 21:50

fedorov requested a review from vkt1414 July 2, 2024 21:53

vkt1414 reviewed Jul 3, 2024

View reviewed changes

vkt1414 and others added 5 commits July 3, 2024 20:38

enh: validate all values passed on to the cli

f224e33

enh: remove duplicates while displaying matches and unmatches

7ad96f0

enh: add logging level argument

c83501a

since we call IDCClient first, logging level will be set to info by default, as it is the level in index.py To get around it, we can set the desired log level in cli.py

enh: set the default log level to warning for download endpoint

1fed366

enh: minor adjustments of logic and cleanup of logging output

3338aa9

- invoke download in sequence, to allow for mixing different kinds of identifiers - revisit logging output types to reduce console clutter - change default logging level to info

fedorov merged commit fa56355 into main Jul 4, 2024
10 checks passed

fedorov deleted the add-smart-download branch July 4, 2024 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enh: add zero parameter download cli #96

enh: add zero parameter download cli #96

fedorov commented Jul 2, 2024

vkt1414 Jul 3, 2024 •

edited

Loading

fedorov Jul 3, 2024

vkt1414 Jul 3, 2024

vkt1414 Jul 4, 2024

fedorov Jul 4, 2024

enh: add zero parameter download cli #96

enh: add zero parameter download cli #96

Conversation

fedorov commented Jul 2, 2024

vkt1414 Jul 3, 2024 • edited Loading

Choose a reason for hiding this comment

fedorov Jul 3, 2024

Choose a reason for hiding this comment

vkt1414 Jul 3, 2024

Choose a reason for hiding this comment

vkt1414 Jul 4, 2024

Choose a reason for hiding this comment

fedorov Jul 4, 2024

Choose a reason for hiding this comment

vkt1414 Jul 3, 2024 •

edited

Loading