-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error while retriving HOGIDs #46
Comments
Hi Dhanuush, could you let us know how you try to get the sequences for a given HOG-id? are you using the OMA api? also, it could be helpful if you could tell when the requests were made, so I can check in the logs what goes wrong. Regarding the second question: yes, the HOG-ids will change with every OMA realease update. The letter in the HOG-id (currently Best, Adrian |
Yes, using API. I will also share the snippet for clarity `from omadb import Client random.seed(0) #write fasta file (nucleotide sequences) with open(fasta_file, "w") as outfile:
outfile.close()` We ran this in December (around 12th/13th Dec). I ran it again in january this week, it gave me similar errors. Okay, thanks. But can I know what is the current version called? (the E version). Best, |
Dear Dhanuush, it looks like you enforce the Tetrapoda level for retrieving the HOG members, but several of these HOGs don't go up the species tree up to Tetrapoda (e.g. 'HOG:E0732584' goes up to Amniota, or 'HOG:E0718996' goes up to Myomorpha). How comes that you expect the gene to exist at the Tetrapoda level (you didn't try to map it with OMAmer to that level only, right?). If you want to ensure you don't select a member gene from the HOG that is too distant from your species, you could first check if the hog reaches a certain level and fetch the members of that level or the rootlevel otherwise:
Of course, you could also iterate the Tetrapoda over the whole lineage of your species of interest, e.g. first try more recent levels and go up until you find one. The current OMA release (with HOG-letter
We update the OMA browser usually 1-2 per year (nowadays rather one time a year). Cheers Adrian |
Hi Adrain, Right! Now I see where the problem is. Thanks much for the information. I forgot to mention something. I am trying this out with several species (Pleurodeles, axolotl, xenopus, mouse, humans, chicken, etc). I do not have this error in pleurodeles, axolotl and xenopus (in the basal tetrapods) and I see these in mouse humans and other higher level tetrapods. While this makes sense, I am wondering why aren't the basal tetrapods that I am looking at do not have these errors ( As in why are the HOG_IDs which are not in tetrapods not showing up in the missing genes in these species). I will try to check if there is anything wrong with the first step while performing the proteome assessment. Cheers, |
Hi Dhanuush, as a guess, it could be simply because in OMA we don't have that many basel tetrapods species included. so there are fewer level in between the extant species and the tetrapod level, at which we could infer the gene gains (i.e. root level of hog). only 3 out of 117 species are non amniota species. Cheers |
i am successfully able to retrive the list by while trying to use the HOG IDs to get the sequence of a related HOG, I face "500 Server Error: Internal Server Error " for some HOG IDs (not all).
I would like to know if there are any internal errors in the server? It would be great if you could help resolve my problem.
Sincerly,
Dhanuush.
The text was updated successfully, but these errors were encountered: