Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add busco-prok with bacteria_odb #854

Merged
merged 4 commits into from
Jan 25, 2024
Merged

Conversation

Kincekara
Copy link
Collaborator

Closes #848
I left prokaryotes only tools to make a smaller image.
Busco uses the working directory to search and download lineage sets. So I added bacteria linage into the data folder.

Pull Request (PR) checklist:

  • Include a description of what is in this pull request in this message.
  • The dockerfile successfully builds to a test target for the user creating the PR. (i.e. docker build --tag samtools:1.15test --target test docker-builds/samtools/1.15 )
  • Directory structure as name of the tool in lower case with special characters removed with a subdirectory of the version number (i.e. spades/3.12.0/Dockerfile)
    • (optional) All test files are located in same directory as the Dockerfile (i.e. shigatyper/2.0.1/test.sh)
  • Create a simple container-specific README.md in the same directory as the Dockerfile (i.e. spades/3.12.0/README.md)
    • If this README is longer than 30 lines, there is an explanation as to why more detail was needed
  • Dockerfile includes the recommended LABELS
  • Main README.md has been updated to include the tool and/or version of the dockerfile(s) in this PR
  • Program_Licenses.md contains the tool(s) used in this PR and has been updated for any missing

@Kincekara
Copy link
Collaborator Author

@erinyoung, do you want any more changes to this image? I left this as a draft for now.
I think we shouldn't write this to the latest tag.

# offline test
RUN wget -q https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/010/941/835/GCA_010941835.1_PDT000052640.3/GCA_010941835.1_PDT000052640.3_genomic.fna.gz && \
gzip -d GCA_010941835.1_PDT000052640.3_genomic.fna.gz && \
busco --offline -l bacteria_odb10 -m genome -i GCA_010941835.1_PDT000052640.3_genomic.fna -o offline --cpu 4 && \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like that you include an offline test to make sure that it's working

@erinyoung
Copy link
Contributor

I think this looks great! I don't have any changes to recommend, except for adding it to the main readme.

I think it's okay if the other, smaller version being the default 'latest' tag.

@Kincekara Kincekara marked this pull request as ready for review January 25, 2024 20:18
Copy link
Contributor

@erinyoung erinyoung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this looks good.

I'm going to merge this PR.

I think we can use tag '5.6.1-prok-bacteria_odb10_2024-01-08', but it's kind of long.

Is that the tag you had in mind?

@erinyoung erinyoung merged commit fcbcf6c into StaPH-B:master Jan 25, 2024
2 checks passed
@Kincekara
Copy link
Collaborator Author

I couldn't find a name that is short and definitive, unfortunately. If you have something in your mind, we can change it.

@erinyoung
Copy link
Contributor

Alright. I'm going to use that tag then.

You can check the status of the deploy at https://github.com/StaPH-B/docker-builds/actions/runs/7660600410

Thank you for putting this together!

@Kincekara Kincekara deleted the busco-slim branch January 26, 2024 13:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Container Request]: BUSCO lite with bacteria_odb
2 participants