Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FASTA database download not accessible as FAIR data #16

Open
peterjc opened this issue Jul 6, 2022 · 2 comments
Open

FASTA database download not accessible as FAIR data #16

peterjc opened this issue Jul 6, 2022 · 2 comments

Comments

@peterjc
Copy link

peterjc commented Jul 6, 2022

I am not finding the FASTA download to be very accessible in the meaning of FAIR data, see e.g.https://en.wikipedia.org/wiki/FAIR_data

I would like to have a simple easily discoverable URL to download specific releases of the OomyceteDB FASTA file, such as could be used with curl or wget or any other programmatic approach.

As far as I can tell, the current website requires a human to click things to download a version of the database as a FASTA file:

  1. Goto http://oomycetedb.cgrb.oregonstate.edu/search.html or directly to the Shiny page at http://oomy.cgrb.oregonstate.edu:3838/grunwald/OomyceteDB_dev/search
  2. Click on a release e.g. 1, dated 2021-03-01, with 885 sequences, comment "First release."
  3. Click on "Download database"

Cross reference #15 for the problem of variable filenames from this procedure.

Options include keeping the FASTA file directly in git on GitHub (perhaps on a separate repository to the database website if you are worried about the history size over time), using public archives which assign a DOI like Zenodo or Data Dryad, or perhaps allowing direct access via URL redirection to the FASTA files on the server as data/releases/*.fa instead?

For example, there appears to be an accidentally committed pre-version 1 of the database at https://github.com/grunwaldlab/OomyceteDB/blob/master/website/2020-10-22_release_1_rps10.fasta which makes the following possible in a script etc:

echo "Downloading rps10 reference FASTA file"
wget "https://github.com/grunwaldlab/OomyceteDB/raw/master/website/2020-10-22_release_1_rps10.fasta"
@zachary-foster
Copy link
Contributor

This is a good point, thanks! We are working on a new version of the website for hosting the database and will make sure that it can support scripted downloaded with static links.

@peterjc
Copy link
Author

peterjc commented Jul 20, 2022

Thank you - this would be a great addition as part of any reworking of the website.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants