Skip to content

Estimate number of UniProtKB entries that will have an associated GO-CAM model? #109

Answered by kltm
dlrice asked this question in Q&A
Discussion options

You must be logged in to vote

This is likely a good guesstimate, but I'd note that the use of gocam-gps.json is definitely going to be stable as it is strictly internal (hidden) and changeable resource.

Another way to estimate (at least for now) would be grabbing the internally intended noctua_uniprotkb.gpad.gz at https://snapshot.geneontology.org/products/upstream_and_raw_data/ and running through it real fast.

As a quick one-off (I did not check):

grep -v '^!' noctua_uniprotkb.gpad | cut -f 12 | grep -E -oh "gomodel\:[[:alnum:]]*_?[[:alnum:]]*_?[[:alnum:]]*" | sort | uniq -c | wc -l
1242

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@dlrice
Comment options

Answer selected by dlrice
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants