Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Help understanding the output txt #10

Open
syedzayyan opened this issue Feb 23, 2023 · 6 comments
Open

Help understanding the output txt #10

syedzayyan opened this issue Feb 23, 2023 · 6 comments

Comments

@syedzayyan
Copy link

Hello!

So I ran the model on 130ish compounds to test and then tested 5 compounds. I have an output that is like this:

1 3
2 57

I don't really understand what these numbers really mean. So is it that it selected 1 and 2 from the 5 compounds? Then what do the 3 and 57 mean? The number of conformation that fit?

@meddwl
Copy link
Owner

meddwl commented Feb 23, 2023

Hello!

The first number is stereoisomer id, the second is conformation id.
Knowing this information you can find 3D representations of compounds in your SDF file.

Thanks for the feedback on the difficulties in understanding the results. We are already preparing a new, more comfortable version of this tool.

If you have any more questions, I will gladly answer them.

@syedzayyan
Copy link
Author

Just to be clear, those are the compounds from the screen test in those conformations that got selected as an active compound candidate right?

So when doing docking to confirm binding, those selected conformations would be the conformation to select?

Apologies for the silly question, I am a beginner in this world.

@DrrDom
Copy link
Collaborator

DrrDom commented Mar 1, 2023

It is assumed that these are conformations in which those compounds should bind to the target, but it is not 100% necessary. Doing docking you may use this conformation as a starting one or a random conformation. Nevertheless, the starting conformations will be changed. In some cases output docking conformation will correspond to a conformation retrieved by a pharmacophore model, but again there is no guarantee.

@syedzayyan
Copy link
Author

Thank you so much for the replies. Helped me understand a ton.

@syedzayyan
Copy link
Author

There are many output files in the "screen_results". Initially, I was under the impression that they were all the same since I tried with a small database and had only two duplicate files. But with a large dataset, it has many files, with SMILES id and conformer ID. Why so many files and what do these mean?

@syedzayyan syedzayyan reopened this Mar 22, 2023
@syedzayyan
Copy link
Author

Also, the paper mentions that centroids of clusters are selected for training, and the rest are for testing. But in this search version is there any test set or are all ligands in a cluster selected?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants