-
Notifications
You must be signed in to change notification settings - Fork 131
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DistributeFilesDataset _num_shards issue #1678
Comments
As you told me, before, you used a RETURNN version from 2024-07, where it was working fine. |
What is the dataset config? What is the training config (distributed setting)? |
the dataset config is like this
|
The issue was likely introduced in #1630, which added the |
For the latest RETURNN, when I use DistributeFilesDataset, I have this error.
The DistributeFilesDataset is inherited from CachedDataset2, which is again inherited from Dataset, the the _num_shards should be set to 1 in the init function. I am not sure how self._num_shards is changed to num of gpus in my case.
(cc @NeoLegends, @michelwi)
The text was updated successfully, but these errors were encountered: