-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error annotating pangenome #285
Comments
Hello, It seems like the error you're encountering might be related to how we're handling metadata on our end. Starting from version v2.1.0 (PR #227), some information about contigs/genomes is extracted from annotation files and included as metadata in the pangenome file. In your case, it looks like some of the extracted metadata isn't being written correctly to the pangenome file, which is causing the error. The intended behavior is that if any metadata can't be written, it should be ignored, and a log is printed instead. Would you be able to share the annotation files you used with us, assuming they're not sensitive? It would help us investigate further. Thanks for bringing this to our attention! |
All of the files are publically released so was collecting the information to send to you, but thought I might give ppanggolin a try with the public genbank files as they are all PGAP annotated and the ones we generated were annotated by RAST. The PGAP annotation files are working fine, the RAST annotation ones are failing. The annotation is just slightly different enough to be causing the error. I think this also might be the root of the issue I wrote with the translation table default (#226). The single file that was failing the entire process at that time was RAST annotated. I also tried with fasta files and those work well, but I do need the genes annotated with product names. As an aside, is there gene product annotation that occurs with this feature, the result doesn't have this and I don't know if I am missing something. |
When using ppanggolin with FASTA files, it won't give you any gene product annotations. It just runs prodigal to call the genes, and that's it. So if you require gene product annotation you would have to use annotation files as input. Would you mind sharing a RAST-annotated genome that’s causing the issue? Just one problematic genome file that causes the error would be enough and very helpful for us to debug. It's probably something specific in the RAST output that we haven't considered for yet. |
Archive.zip |
Thank you so much for providing the files! They were very helpful. The issue arises from the In the meantime, if you need to run PPanGGOLiN with these types of files, you can use the git clone --branch fix_genome_metadata_handeling https://github.com/labgem/PPanGGOLiN.git
cd PPanGGOLiN
pip install . |
The fix has been release in version 2.2.0 . |
Hello!
I am getting an error trying to write the pangenome. It is odd because I have two different datasets and it is only one that throws this error. I hesitated writing this issue as it may be due to some configuration of my data, but have thus far been unable to figure what could be different and what could be causing the problem.
I have ppanggolin 2.1.1 installed in the computing cluster installed from source. I am trying to annotate using the --anno option:
ppanggolin annotate --anno ../2024.09.16__run1test2_.txt -o ../ppgg_run1
It starts fine and runs up until writing the genome metadata.
And then throws this error:
Thank you in advance.
The text was updated successfully, but these errors were encountered: