-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Provide an admin facility to add articles using biopax file from url #881
Conversation
metincansiper
commented
Oct 14, 2020
•
edited
Loading
edited
- Works with biopax_to_factoid branch of factoid-converters.
You could do a diff on
|
@metincansiper & @jvwong : To test this, it might be simplest to merge this into unstable so that this can be tried out on unstable.factoid.baderlab.org. The unstable instance should use an environment variable to use the latest @metincansiper, do you have a publicly-accessible instance of the latest |
An element may have to be complete ( |
Thank you, the problem has been fixed after calling |
No. Is the factoid-converters deployed automatically to "http://biopax.baderlab.org/convert" from either master or unstable of factoid-converters? |
One other thing is what must be the email address used here (eb1564d#diff-24330a3dafb9473fe683be4b0a77ec948baf915b8a675a9fd6981009332aa77aR91). I did not want to assign it to the pc help email address to avoid the email that would be sent. I wanted to use an address most probably non-existing like The other question is should we avoid sending emails in a way when multiple articles are created through a biopax file url. |
The document creation route could have a new parameter (i.e. `email: true | false`) to control whether emails are sent. We may need a similar flag for avoiding tweets on submission.
It may also be a good idea to have a flag in the document itself to indicate that these documents are generated from PC (something like `source: pc | author`).
… On Oct 15, 2020, at 12:14 AM, metincansiper ***@***.***> wrote:
One other thing is what must be the email address used here (eb1564d#diff-24330a3dafb9473fe683be4b0a77ec948baf915b8a675a9fd6981009332aa77aR91 <eb1564d#diff-24330a3dafb9473fe683be4b0a77ec948baf915b8a675a9fd6981009332aa77aR91>). I did not want to assign it to the pc help email address to avoid the email that would be sent. I wanted to use an address most probably non-existing like ***@***.***
The other question is should we avoid sending emails in a way when multiple articles are created through a biopax file url.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub <#881 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAHRO42W55BEWIZT5CY2Z6LSKZZLHANCNFSM4SQ6P7UA>.
|
The unstable/master instances (running on XXX.XXX.81.174 ) can access a local converters instance exposed on port 9999. Currently, it is accessing the prod converter version, but the |
I think it's this branch here It's got a dockerfile, so that's probably the easiest way to deploy it. Manually setting up all the java stuff on a vm would be a pain. |
We've met this naming problem before. Here's some options:
So any of those work for me. |
I've got pathwaycommons/factoid-converters:biopax_to_factoid running. Turns out the unstable.factoid.baderlab.org instance was always pointing at a local converters via |
I added the controls for email on doc creation and the source. However, I guess that there may not be any need for updating anything to disable tweet on submit. I think so because the tweeting is done when the submit is done through requesting this service endpoint (https://github.com/PathwayCommons/factoid/blob/unstable/src/server/routes/api/document/index.js#L1426). In our case I just call |
Yes. The editor |
I think I figured out that your converter endpoint |
There's no grounding information coming out of that converter: see PathwayCommons/factoid-converters#14 Right now, we could probably fake it, but we need some sort of object that looks like the grounding information: Currently: {
"type": "protein",
"name": "Oct1",
"id": "e35b8c32-165e-446c-be2e-c3a4e946ffaa"
} Desired: {
"namespace": "ncbi",
"type": "ggp",
"dbName": "NCBI Gene",
"dbPrefix": "ncbigene",
"id": "6580",
"organism": "9606",
"organismName": "Homo sapiens",
"name": "SLC22A1",
"synonyms": [
"HOCT1",
"OCT1",
"oct1_cds",
"solute carrier family 22 member 1",
"organic cation transporter 1",
"solute carrier family 22 (organic cation transporter), member 1"
],
...
},
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The grounding should be solved by mapping rather than search.
name | ||
}; | ||
|
||
return searchGrounding( opts ).then( res => { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The BioPAX source will provide an bp:EntityReference
with bp:UnificationXref
, in the case of PhosphoSitePlues, some uniprot knowledgebase
id.
We'll have to map those rather than search.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
e.g.
wget --quiet \
--method POST \
--header 'accept: application/json' \
--header 'content-type: application/x-www-form-urlencoded' \
--body-data 'from=ACC&to=P_ENTREZGENEID&query=P14873&format=tab' \
--output-document \
- https://www.uniprot.org/uploadlists/
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@metincansiper & @jvwong, can this be marked as resolved?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks fine to me. We can always pop in the grounding-search /map
here instead, now or later @metincansiper
@maxkfranz ena entities are in 147 different documents out of 16557 documents. 16557 is the number of documents remaining after excluding the ones including entities with no db xref. Before excluding them there are 16567 documents in total. I think 147 looks pretty small as compare to whole as you mentioned. |
OK, that makes sense, Metin. I think it’s fine to exclude those documents for now. We can reevaluate later if needed. At the end of the conversion, we should have a list of excluded documents that we can review manually. Maybe a CSV that we could import into a spreadsheet (something like id, name, pc url, columns) |
Okay, then I think that the code is ready for review. |
OK. While Jeff and I review, could you put together that spreadsheet I mentioned? It looks like there’s also a conflict that needs to be resolved before merging. Thanks |
Okay, is this row you mentioned (id, name, pc url, columns) all for a document? What do you expect to have in columns field? |
I just meant that it should include these columns:
If you think of other columns that would be useful, feel free to add them. |
Actually paxtools provide me with the pubmed ids of interactions and I make the documents by grouping interactions by their pubmed ids. I can query the document name as well by using the same way as it is done in factoid. However, I guess there may not be a PC URL for these documents/pmid's. What do you think? |
The pubmed id is the main thing, so that’s fine |
A list with just the pubmed ids would be fine |
Okay, where were you meaning to download(factoid client side, factoid server side or java server side)? |
It doesn't matter to me. Wherever it's easiest. You could even just log them to the console and copy to Google Sheets. We just need the list once. The list generation probably won't be a proper, reusable feature of the system, since we don't expect to need it more than once. Does that make sense? |
Sure I created the Google Sheet with pmids: https://docs.google.com/spreadsheets/d/1Aafdwi3f9pmlPCD_zrGIrUo_s3hp9nIEdbux8pAThig/edit?usp=sharing |
Great, Metin. Could you add a column that just links out to pubmed for the ID? We'll need to go over the articles on Pubmed to determine whether any of them are important. |
I updated the document in that way. |
import fs from 'fs'; | ||
import { URLSearchParams } from 'url'; | ||
|
||
import { exportToZip, EXPORT_TYPES } from './export'; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file needs the conflicts to be resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@maxkfranz I resolved the conflict and merged the current unstable in this branch.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great, thanks Metin
…he service to create a document.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unless there are any red flags, let's merge this in for testing on the master instance.
@metincansiper does this depend on https://github.com/PathwayCommons/factoid-converters/tree/biopax_to_factoid ? |
@jvwong yes |
OK, then let's get that converter code in as well. |
Great. Let's merge in the convertors to the latest, and let's merge this in. Then we can tag and release. The email template has been updated to the latest version of the text from the Google Doc. I verified that the emails are sent by hitting the refresh button on the admin UI, so we're ready to go. Once we've deployed prod, we can send out the emails for one of the already submitted factoids. |
I was not able to merge and push the changes into master directly. I got the following logs:
Therefore, I created a PR instead (PathwayCommons/factoid-converters#18). I am not allowed to merge it in yet. It is pending as below right now, I am not able to click somewhere to visit the travis run: |
@metincansiper, I've made you an admin for the convertors repo, so you should be able to override and merge form now on, if you need. Travis is just really slow lately. It looks like the builds haven't even started. |
Okay I merged both of this one and the PR in factoid-converters. |
@jvwong considering this comment (#881 (comment)) and so PathwayCommons/factoid-converters#14: We are currently searching the grounding by the xrefs coming from converter and using the results as here (https://github.com/PathwayCommons/factoid/blob/unstable/src/server/routes/api/document/index.js#L1675). Here is the example of an xref currently coming from factoid converters (I created a PR for adding the
and this is the grounding that is retrieved in factoid side using that xref:
Are you suggesting to send the grounding as the second json directly from the converters instead of sending the xref from converter (and so not retrieving grounding from the factoid based on the xref as we are doing now but using the input from the converter as is)? |
I think things have improved a lot since that comment and maybe its OK now. I think the original problem was that there wasn't any grounding info at all. |