-
Notifications
You must be signed in to change notification settings - Fork 359
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HTTP input file names not respected in execution VM #4184
Comments
The hash failures are expected with http inputs and should not be the cause of your workflow failure. Also we don't currently support |
Thanks much for the helping with debugging on this. Beyond the hash failure from Cromwell the other errors I get are all from the workflow itself due to not preserving the original file names. The numerical hashes for files get passed directly into the downstream tools, stripping off any extensions or other identifying information. This results in tool confusion, like tabix can't tell a file wasn't already gzipped:
or bwa can't find all the other associated indices:
Is it expected to lose the original input file names when passing through the pipeline. A lot of tools are sensitive to these and this might be the underlying issue. Regarding the configuration, without
I was trying to lift off how things were done with the Google/gcp resolution so added it in there to fix this issue. Is there a different configuration approach I should be using? Thanks again for helping with this. |
Hi Brad In addition to the |
Thanks for this help with the configuration. I'm not intentionally overwriting the global https://gist.github.com/chapmanb/72c6bf2d8282412b252f6192968b17cf I appreciate all the help debugging this. |
Within your |
@cjllanwarne Did you try that or are you making an educated guess? I haven't yet seen anyone need to do that, although AFAIK (educated guess myself) the |
Chris, thanks for the idea. I tried this and unfortunately had the same issue. From the behavior it looks like http is working in that the files get downloaded, but they don't get proper naming with numerical names instead of the expected file names. This disconnect seems to be what causes issues when passing these on to the CWL tools. |
Hi @chapmanb - sorry for the delay in responding here. I was able to get http inputs to work in CWL against a default (ie no custom config specified) instance of Cromwell in server mode. The test case I used is in the linked PR (#4392) I wonder whether you could confirm:
|
Chris; Here's the example output I get from running this that demonstrates the file naming issue:
This input should be called In terms of the test cases, it would be great if it also checked that the file extension and name get preserved. Thanks again for looking at this. |
Is there a fix in the latest development for this issue? I'm still stuck on this so not sure if I missed something. |
I was wondering the same. @rebrrown1395 it’s not obvious why this issue was closed and as it’s from an external user it should be explained prior to closing |
@geoffjentry and @chapmanb, my apologies this wasn't supposed to be closed! |
Hello! Is there an update or workaround on this? |
Hi all;
In testing release 35 with CWL inputs I've also been looking at supporting remote URL references. This is working correctly for GS URLs but not for http URLs. I've put together a test case that demonstrates the problem:
https://github.com/bcbio/test_bcbio_cwl/tree/master/gcp
The
somatic-workflow-http
CWL workflow uses http URLs and doesn't work, while the comparablesomatic-workflow
CWL workflow uses GS URLs referencing the same data and does work.The workflow fails with:
when running tasks. The files get downloaded to the input directories but get numerical values instead of the original file names so never seem to sync over and get translated correctly to the workflow
My configuration is:
Am I doing anything wrong with my configuration or setup that I could tweak? Thanks so much for any pointers/suggestions.
The text was updated successfully, but these errors were encountered: