Record message_ix version in scenario data & GDX #765

khaeru · 2023-12-11T21:29:23Z

Closes #747.

Housekeeping:

Close Split tutorial tests to separate CI job #739. This appears to bring the job time down to:
- 4:40–10 minutes for the tests —so the first indication of success/failure comes in under 5 minutes, which is more fit for purpose.
- 8–12 minutes for the tutorials.
Update older tests. The above change caused some of these to begin to fail, which makes me suspect they rely on state leaking from the tutorial tests or are otherwise fragile. The updates should ensure the tests are complete.
- test_equation_NEW_CAPACITY_CONSTRAINT.py::test_new_capacity_up: fix typo in the file name. Clone the subject scenario to a unique URL, so that GDX I/O files do not collide even if the two test cases are run in parallel.
- test_integration.py, test_macro.py: similarly clone to unique URLs.
- Expand some flaky marks from Handling flaky tests #731 that were previously scoped to macOS only, to now apply to all platforms:
  - test_feature_bound_activity_shares.py::test_add_bound_activity_up_all_modes
  - test_feature_price_emission.py::test_custom_type_variable_periodlength
  - test_integration::test_multi_db_run.
Advertise and test Python 3.12 support.
Update copyright year to 2024.

How to review

Read the diff and note that the CI checks all pass.

PR checklist

Merge required changes upstream: Record Python package versions in GDX files ixmp#502.
Continuous integration checks all ✅
Add or expand tests; coverage checks both ✅
Add, expand, or update documentation.
Update release notes.

codecov · 2023-12-11T21:37:56Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (f7be336) 95.2% compared to head (134b1cb) 95.2%.

Additional details and impacted files

@@          Coverage Diff          @@
##            main    #765   +/-   ##
=====================================
  Coverage   95.2%   95.2%           
=====================================
  Files         46      46           
  Lines       4312    4335   +23     
=====================================
+ Hits        4107    4130   +23     
  Misses       205     205

Files	Coverage Δ
message_ix/macro.py	`96.7% <100.0%> (ø)`
message_ix/models.py	`99.0% <100.0%> (ø)`
message_ix/report/__init__.py	`100.0% <ø> (ø)`
message_ix/testing/__init__.py	`99.6% <100.0%> (+<0.1%)`	⬆️
message_ix/tests/conftest.py	`100.0% <100.0%> (ø)`
message_ix/tests/test_cli.py	`100.0% <100.0%> (ø)`
message_ix/tests/test_core.py	`100.0% <100.0%> (ø)`
..._ix/tests/test_equation_NEW_CAPACITY_CONSTRAINT.py	`100.0% <100.0%> (ø)`
...age_ix/tests/test_feature_bound_activity_shares.py	`100.0% <100.0%> (ø)`
message_ix/tests/test_feature_price_emission.py	`100.0% <ø> (ø)`
... and 5 more

khaeru · 2023-12-12T13:03:14Z

@glatterf42 FYI, looking at the recent commits on the branch, I am finding that for some apparently flaky tests, it helps to .clone(scenario=request.node.name) first.

I think this is because we are using pytest-xdist: where this results in 2+ tests with identical model name and scenario names being run in parallel, then strange effects can occur; for instance, one test reads and sees the result from the GDX file of the other test. (This occurs because message_ix tries to heed users' desire to keep GDX I/O files all in the same directory, rather than separate, temporary directories per-run as the base ixmp GAMSModel class does.)

request.node.name is from the Pytest request fixture, and gives a string that is unique to each test—including individual cases of parametrized tests. It's the easiest way to get such a unique label.

We could later (no urgency) try this approach to address other apparently flaky tests.

glatterf42 · 2023-12-12T13:13:57Z

Very useful catch, would be great to remove some flaky markers :)

glatterf42 · 2024-01-10T07:16:40Z

Just rebasing onto current main.

- Clone the subject scenario to a unique scenario name, so that GDX I/O cannot collide even when tests are run in parallel. - Rename file: fix typo in file name. - Use snake_case for function name. - Expand parametrized args with distinct names. - Use make_df(). - Remove redundant .set_index(). - Reflow docstring.

glatterf42

LGTM, thanks :)

glatterf42 · 2024-01-11T07:14:54Z

Rebased after dependabot update to main: are we just seeing flakiness in these tests?

glatterf42 · 2024-01-11T07:52:44Z

Looks like flakiness; are we merging regardless and clean up/apply markers later if needed?

khaeru · 2024-01-11T07:57:27Z

are we merging regardless and clean up/apply markers later if needed?

I would say the latter.

I have adapted several tests in this PR to ensure they use unique model/scenario names; this means there is no chance that another scenario being run in another pytest-xdist worker thread will try to access the same scenario in the database or in a GDX file. However, this is different from doing a full sweep through the test suite or adapting fixtures to be sure that no test has this fragility. We can do that in a subsequent PR.

khaeru added the enh New features & functionality label Dec 11, 2023

khaeru added this to the 3.8 milestone Dec 11, 2023

khaeru self-assigned this Dec 11, 2023

khaeru force-pushed the issue/747 branch 2 times, most recently from f930098 to 3f3419d Compare December 11, 2023 23:21

khaeru force-pushed the issue/747 branch from 260b1eb to af5e25d Compare December 12, 2023 13:05

glatterf42 force-pushed the issue/747 branch from af5e25d to 5410828 Compare January 10, 2024 07:10

khaeru force-pushed the issue/747 branch 4 times, most recently from f56ca4e to 3f378b2 Compare January 10, 2024 12:07

khaeru requested a review from glatterf42 January 10, 2024 16:09

khaeru added 14 commits January 11, 2024 07:37

Set GAMSModel.record_version_packages

8e7251c

Pass ixmp_version through GAMS to output GDX

9eeee8e

Address Sphinx nitpick

19e4e72

Test ixmp_version is present in GDX I/O files

f9af772

Avoid leaking config state from test_copy_model()

4929d13

Add "tutorial" job in "pytest" CI workflow

523726c

Quiet traitlets logging from test_tutorial()

b27af9a

Set PYDEVD_DISABLE_FILE_VALIDATION=1 in tutorial tests

64eb377

Update deprecated genno.Key usage in .util.tutorial

77378e3

Make some flaky marks cross-platform

0e4d6a0

Ensure unique scenario names in test_feature_bound_activity_shares

d3fdbe6

Isolate test_{integration,macro} tests from parallel runs

4ff7258

Support and run CI on Python 3.12

014dcfb

khaeru added 9 commits January 11, 2024 07:37

Support unique scenario name from make_westeros()

867d2f7

Use distinct name in test_soft_constraint

5bd90ee

Use distinct name in test_add_model_data()

f8a6952

Use distinct names in test_feature_bound_activity_shares

33e95fd

Use distinct name in dantzig_reporter fixture

309424f

Mark test_multi_db_run as flaky on all platforms

4e10bef

Don't re-use scenario name in .macro.calibrate()

e677377

Add #767 to release notes, docs

134b1cb

glatterf42 force-pushed the issue/747 branch from 5343277 to 134b1cb Compare January 11, 2024 06:37

glatterf42 approved these changes Jan 11, 2024

View reviewed changes

glatterf42 merged commit 0f78fe8 into main Jan 11, 2024

glatterf42 deleted the issue/747 branch January 11, 2024 08:46

This was referenced Jan 12, 2024

Flaky tests and pytest-xdist #776

Closed

Make pytest use one process for CI iiasa/ixmp#510

Merged

Address tests that are flaky with pytest-xdist iiasa/ixmp#512

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record message_ix version in scenario data & GDX #765

Record message_ix version in scenario data & GDX #765

khaeru commented Dec 11, 2023 •

edited

Loading

codecov bot commented Dec 11, 2023 •

edited

Loading

khaeru commented Dec 12, 2023

glatterf42 commented Dec 12, 2023

glatterf42 commented Jan 10, 2024

glatterf42 left a comment

glatterf42 commented Jan 11, 2024

glatterf42 commented Jan 11, 2024

khaeru commented Jan 11, 2024

Record message_ix version in scenario data & GDX #765

Record message_ix version in scenario data & GDX #765

Conversation

khaeru commented Dec 11, 2023 • edited Loading

How to review

PR checklist

codecov bot commented Dec 11, 2023 • edited Loading

Codecov Report

khaeru commented Dec 12, 2023

glatterf42 commented Dec 12, 2023

glatterf42 commented Jan 10, 2024

glatterf42 left a comment

Choose a reason for hiding this comment

glatterf42 commented Jan 11, 2024

glatterf42 commented Jan 11, 2024

khaeru commented Jan 11, 2024

khaeru commented Dec 11, 2023 •

edited

Loading

codecov bot commented Dec 11, 2023 •

edited

Loading