Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ingest extra sources #35

Closed
wants to merge 6 commits into from
Closed

Conversation

jameshadfield
Copy link
Member

@jameshadfield jameshadfield commented May 16, 2024

See commit messages for more. This PR was used to update the live cattle-flu genome build.

I'm not thrilled with the complexity added here, and we're scheduled to discuss this tomorrow (internally) so let's wait until after that to merge.

The default line endings for `csv.DictWriter` are CRLF (amazingly)

<https://docs.python.org/3/library/csv.html#csv.Dialect.lineterminator>
in preparation for the subsequent commit which will add another ingest
source
This adds a new ingest "source", namely hardcoded source-data which
is not yet available from the existing two sources. The point of this is
to produce up-to-date whole genome builds, so we tailor the ingest
for this purpose. From internal discussion¹² we are aiming to keep
this data separate to the fauna-derived (and S3 uploaded) dataset.

¹ <https://bedfordlab.slack.com/archives/CD84ELG0N/p1715729994271819>
² <https://bedfordlab.slack.com/archives/CD84ELG0N/p1715731530506199>
@jameshadfield jameshadfield force-pushed the james/ingest-extra-sources branch from 1bfefea to bb27e01 Compare May 16, 2024 02:32
@jameshadfield jameshadfield mentioned this pull request May 16, 2024
@trvrb
Copy link
Member

trvrb commented Jun 3, 2024

With addition of NCBI ingest in PR #40, we now grab this milk sequence directly from GenBank. We can close this PR unmerged. Thank you so much @jameshadfield for the putting in the speedy work on this when it was needed.

@trvrb trvrb closed this Jun 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants