-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Zarr commit date is AFTER the last-modified of new/pending Zarr entry #66
Comments
@yarikoptic The commit timestamp for Zarr backups is the latest I'm not sure how this situation happened, but your theories don't seem right.
Not true; no metadata about local files is sent to S3 when uploading a Zarr entry.
The actual commit date (not the author date, which we fake) of the |
@yarikoptic Looking into this further, I think the original backup of the Zarr errored out somehow partway through. For one thing, the actual Zarr has a bunch of files that aren't in the backup, and the |
Note that |
why do you think so? my counter-argument is that if it errorred out, there would be no commit, right?
good observation, FTR
but wouldn't it be just indicative of some bug instead of backup "errored out"? FTR, in my mailbox I have indicators of finding that dandiset dirty only starting Nov 07
and continues until Nov 18 (2024-11-18T21:02:54-0500) while the most recent commit
so somehow we (I) addressed it that day, likely through FWIW, I do not see anything suspicious among admin emails from drogon for Oct 29 (like out of space etc). For Oct 29 we have the single log with matching that zarr
and in that log we run into issues with annex starting with
but that is long after that suspicious commit at Date: |
@yarikoptic I don't know how that "empty" commit with an inaccurate commit message happened, but the "AssertionError: feed_data after feed_eof" is an |
Long story with conclusion summarized in subject line:
In attempt to provide temporary remedy for
I reran our tool with
--mode force
which ignored differences in metadata which happened without modifications to time stamp. Then I reran with--mode verify
which still failed withthe single error for a single dandiset 001246 zarr
And that zarr is marked as "Complete"
The timestamp for the draft version of the dandiset is 2024-10-29T19:39:37.231865Z
and for that .zattrs
and we have
and in the zarr
and that .zattrs has
So, we have a commit time of 28:10 minutes, but the last-modified of the .zattrs (which was not yet added to that commit) is 27:41 minutes. How could that happen? How exactly do we decide on commit date for zarrs - is it based on last-modified?
I could hypothesize/propose:
wdyt @jwodder?
The text was updated successfully, but these errors were encountered: