Added tests for format module #7

mattheww95 · 2024-04-04T21:04:21Z

I have added initial tests for the format module verifying outputs, it appeared that the outputs that were passed as examples to the pipeline differed from those generated. It seemed some column headers were mangled and there may have been differences in fields generated. I have updated the test data to reflect the more recent version of locidex

kbessonov1984 · 2024-04-09T15:56:02Z

locidex/example/format_db_mlst_out/results.json

-        "input": "/Users/jrobertson/PycharmProjects/locidex/locidex/example/format_db_mlst_in",
-        "outdir": "/Users/jrobertson/PycharmProjects/locidex/locidex/example/format_db_mlst_out",
+        "input": "locidex/example/format_db_mlst_in",
+        "outdir": "/tmp/pytest-of-mwells/pytest-115/build0",


I was wondering if you can use a global variables for paths instead of absolute paths for better portability such as TEST_ROOT = os.path.dirname(__file__). Take a look at https://github.com/phac-nml/mob-suite/blob/master/mob_suite/tests/test_mobtyper_vs_biomarker_report.py

kbessonov1984 · 2024-04-09T15:57:41Z

tests/test_format.py

+from dataclasses import dataclass
+
+EXPECTED_DATA_OUT = "locidex/example/format_db_mlst_out"
+TEST_DATA = "locidex/example/format_db_mlst_in"


You can also even make it more general via __file__ such as PACKAGE_ROOT = os.path.join(os.path.dirname(mob_suite.__file__),/example/...

kbessonov1984 · 2024-04-09T15:59:39Z

tests/test_format.py

+def test_format(cmd_args):
+    format.run(cmd_args)
+
+def test_outputs(output_directory):


Not sure what is the purpose of this function? I suspect it is to make sure the listed files are in the same order. Also on Mac or other OS there might be hidden files such as .DS_Store that could break this assertion

kbessonov1984 · 2024-04-09T16:01:06Z

tests/test_format.py

+        "force"
+    ]
+    result_json = "results.json"
+    expected = os.path.join(EXPECTED_DATA_OUT, result_json)


I think the actual vs expected absolute paths need an assertion statement here so the error is more cleaner if they happen do differ

kbessonov1984 · 2024-04-09T16:03:27Z

tests/test_format.py

+    expected = os.path.join(EXPECTED_DATA_OUT, result_json)
+    actual = os.path.join(output_directory, result_json)
+    with open(actual, 'r', encoding='utf8') as act, open(expected, 'r', encoding='utf8') as expc:
+        act_js = json.load(act)[primary_key]


I would better first test if the key is found in the JSON loaded object and display more cleaner error message instead of key not found error. Also it would not be clear if tis not found in actual vs expected paths.

if key_to_check in json.load(act).keys(): print(f"Key '{key_to_check}' exists.") else: print(f"Key '{key_to_check}' does not exist.")

mattheww95 added 2 commits April 4, 2024 16:00

mattheww95 requested a review from kbessonov1984 April 4, 2024 21:04

kbessonov1984 reviewed Apr 9, 2024

View reviewed changes

mattheww95 merged commit 4e38102 into tests Apr 23, 2024

mattheww95 deleted the test/format branch April 25, 2024 16:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added tests for format module #7

Added tests for format module #7

mattheww95 commented Apr 4, 2024

kbessonov1984 Apr 9, 2024

kbessonov1984 Apr 9, 2024

kbessonov1984 Apr 9, 2024

kbessonov1984 Apr 9, 2024

kbessonov1984 Apr 9, 2024

Added tests for format module #7

Added tests for format module #7

Conversation

mattheww95 commented Apr 4, 2024

kbessonov1984 Apr 9, 2024

Choose a reason for hiding this comment

kbessonov1984 Apr 9, 2024

Choose a reason for hiding this comment

kbessonov1984 Apr 9, 2024

Choose a reason for hiding this comment

kbessonov1984 Apr 9, 2024

Choose a reason for hiding this comment

kbessonov1984 Apr 9, 2024

Choose a reason for hiding this comment