Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Warning: Expected one CSV file... #59

Open
vxnuaj opened this issue Feb 19, 2025 · 1 comment
Open

Warning: Expected one CSV file... #59

vxnuaj opened this issue Feb 19, 2025 · 1 comment

Comments

@vxnuaj
Copy link

vxnuaj commented Feb 19, 2025

Hey! Consistently getting the following error when running python3 get_spider2snow_submission_data.py --experiment_suffix gpt-4o-mini-test1 --results_folder_name ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1

Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_bq416, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_bq096, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_local039, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_bq444, found 0

...

Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_ga014, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_ga013, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_bq081, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_bq203, found 0
Warning: Expected one CSV file in ../../spider2-snow/evaluation_suite/gpt-4o-mini-test1/sf_bq211, found 0

Wondering where these CSV files were supposed to come from?

Pretty sure this is causing erroneous eval of 4o-mini, as

Final score: 0.0, Correct examples: 0, Total examples: 281
Real score: 0.0, Correct examples: 0, Total examples: 547

doesn't seem right.

@vxnuaj
Copy link
Author

vxnuaj commented Feb 19, 2025

Update -- Ran it on the example submissions folder, went all smooth -- it seems that the final eval seems to be valid and that 4o-mini just sucks... lmao

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant