Store SBML meta information (level, version, packages, provenance) #810

Midnighter · 2019-02-28T10:32:13Z

Hey @opencobra/cobrapy-core,

I'd like to store some meta information about the SBML that was parsed:

level
version
packages used (fbc, annotation, groups)

The big question is where to store that information and I'd like your opinions. Current ideas, either

Create a new attribute on the model cobra.Model.sbml_info that could be a tuple (level: int, version: int, packages: Tuple[str]).
Create a new cobra.Model.meta attribute which would allow for some more general information later. It could be a dictionary and model.meta["SBML"] could contain above information.

Curious what you think and if you have any other ideas about this 😃

The text was updated successfully, but these errors were encountered:

ChristianLieven · 2019-02-28T11:14:20Z

Since it doesn't really belong to the cobra.Model object, I was thinking something along the lines of returning a tuple if a specific flag was set on the parsing function:

model = read_sbml("path/to/model.xml")
model, tuple = read_sbml("path/to/model.xml", sbml_info=True)

Would that work too?

Midnighter · 2019-02-28T11:18:07Z

Yes, that's another consideration. In general, I think functions having varying return types are a pain in the neck and bad design and I'd like to avoid it in future but this may be an acceptable exception to the rule 😉

kvikshaug · 2019-02-28T13:33:54Z

I agree it doesn't belong to cobra.Model since it might as well have been loaded from JSON. It should also be possible to extract this info from libsbml in cases where it's not able to build a model instance.

gregmedlock · 2019-02-28T14:52:40Z

What about a helper function in io, e.g. cobra.io.read_sbml_info(), that does nothing but return the meta information?

Midnighter · 2019-02-28T15:48:48Z

Some further requirements to take into account:

Some of the meta information might be desirable when writing a model back to SBML. In that case the model is the only logical place where this information can be stored. @matthiaskoenig can give a better picture on this.
Even though in case of the version information, which is right in the header, it is cheap to do, I think in general it's not very desirable to go back and parse information again.

ChristianLieven · 2019-03-01T08:30:28Z

Some of the meta information might be desirable when writing a model back to SBML. In that case the model is the only logical place where this information can be stored. @matthiaskoenig can give a better picture on this.

In that case, I'd prefer the dictionary approach (2.) of your original post.

Would parsing it and then storing it in some sort of global variable that is detached from the model object itself be a solution that is less of a pain in the neck and bad design? Something comparable to a Click context?

Such that:

model = read_sbml("path/to/model.xml")

creates both model but also adds an entry to some sort of MODEL_REGISTRY dictionary that exists for this session:

MODEL_REGISTRY[model<Object 1238452>] = {meta: Information}

When any of the cobra.io functions then encode the model as SBML or JSON they could default to the information that makes sense for that filetype i.e. writing to SMBL would retrieve

'info': 'SBML L3V1, fbc-v2, groups-v1', 
'level': 3, 
'packages': {'fbc': 2, 'groups': 1}, 
'version': 1}

but writing to JSON wouldn't use that information.

matthiaskoenig · 2019-03-01T08:36:31Z

Just to add to the discussion: Yes, the information is important and would also be very helpful/useful for writing SBML models. Part of the information are the notes and annotations on the SBMLDocument, but also the ModelHistory information, i.e. who created the model. This is also information you would want to set on cobra models before writing, i.e. who created the model, when was it created and what are the notes and annotations on the SBMLDocument. Such provenance is crucial. You want to have this information persistent with the model, so that it can be written on export again. Or at least some way to store model/document meta information. Writing a model without information on who created it an when is very bad style. This information should be part of the model. The information could look like this: ``` meta = {'annotations': {'sbo': 'SBO:0000624'}, 'created': 2016-10-05T13:59:23Z, 'creators': [{'familyName': 'König', 'givenName': Matthias, 'organisation': 'Humboldt University Berlin', 'email': '[email protected]'}], 'info': 'SBML L3V1, fbc-v2, groups-v1', 'level': 3, 'notes': {}, 'packages': {'fbc': 2, 'groups': 1}, 'version': 1} ``` And this information should be written on the model, e.g. in a `model._sbmlmeta` field Best M

…

On Thu, Feb 28, 2019 at 4:48 PM Moritz E. Beber ***@***.***> wrote: Some further requirements to take into account: 1. Some of the meta information might be desirable when writing a model back to SBML. In that case the model is the only logical place where this information can be stored. @matthiaskoenig <https://github.com/matthiaskoenig> can give a better picture on this. 2. Even though in case of the version information, which is right in the header, it is cheap to do, I think in general it's not very desirable to go back and parse information again. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#810 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AA29ugGM45yCi_HX0cKEzr3p-tJcQyK5ks5vR_phgaJpZM4bWfnG> .

-- Matthias König, PhD. Junior Group Leader LiSyM - Systems Medicine of the Liver Humboldt Universität zu Berlin, Institute of Biology, Institute for Theoretical Biology https://livermetabolism.com [email protected] https://twitter.com/konigmatt https://github.com/matthiaskoenig Tel: +49 30 2093 98435

cdiener · 2019-03-01T18:30:25Z

I would argue that information of this kind belongs to a cobra model since it specifies provenance. I agree that is should not be a SBML specific attribute though. First, cobra.Model already has an annotations dictionary which is not used for much right now and could get a provenance entry. Alternatively we could add cobra.Model.provenance which annotates how that model was obtained. For instance it could indicate the JSON schema version, the reconstruction method, etc. The SBML write can then pick which of that info it wants to use to write SBML. This also goes in line with what many workflow managers or other large projects (for instance Qiime 2) are doing.

Midnighter · 2019-03-01T20:16:08Z

Could you link to an example or documentation that shows this for Qiime 2? I don't know it but your reasoning sounds convincing to me.

cdiener · 2019-03-02T04:44:52Z

There is some argumentation in https://docs.qiime2.org/2019.1/concepts/?highlight=provenance#data-files-qiime-2-artifacts, namely

Artifacts enable QIIME 2 to track, in addition to the data itself, the provenance of how the data came to be. With an artifact’s provenance, you can trace back to all previous analyses that were run to produce the artifact, including the input data used at each step. This automatic, integrated, and decentralized provenance tracking of data enables a researcher to archive artifacts, or for example, send an artifact to a collaborator, with the ability to understand exactly how the artifact was created. This enables replicability and reproducibility of analyses, as well as generation of diagrams and text that can be used in the methods section of a paper. Provenance also supports and encourages the proper attribution to underlying tools (e.g. FastTree to build a phylogenetic tree) used to generate the artifact.

Most of Qiime still works via the command line, but you can look at an example for provenance in the web visualization (clicking on the provenance tab on top)

cdiener · 2022-11-04T19:15:17Z

Now tracked in #1237 and available as part of the history.

Midnighter added the SBML Related to reading and writing SBML models. label Feb 28, 2019

Midnighter added this to the COBRApy 1.0 milestone Feb 28, 2019

Midnighter self-assigned this Feb 28, 2019

matthiaskoenig changed the title ~~Store SBML information~~ Store SBML meta information (level, version, packages, provenance) Mar 5, 2019

matthiaskoenig self-assigned this Mar 18, 2019

matthiaskoenig mentioned this issue May 29, 2019

Error encountered trying to set model history when writing SBML #850

Closed

Hemant27031999 mentioned this issue Aug 15, 2020

Metadata fbc3 group #988

Open

9 tasks

akaviaLab mentioned this issue May 29, 2022

Metadata fbc3 group #1225

Closed

akaviaLab mentioned this issue Jun 19, 2022

Metadata fbc3 #1237

Open

1 task

cdiener closed this as completed Nov 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store SBML meta information (level, version, packages, provenance) #810

Store SBML meta information (level, version, packages, provenance) #810

Midnighter commented Feb 28, 2019

ChristianLieven commented Feb 28, 2019

Midnighter commented Feb 28, 2019

kvikshaug commented Feb 28, 2019

gregmedlock commented Feb 28, 2019

Midnighter commented Feb 28, 2019

ChristianLieven commented Mar 1, 2019 •

edited

Loading

matthiaskoenig commented Mar 1, 2019 via email

cdiener commented Mar 1, 2019

Midnighter commented Mar 1, 2019

cdiener commented Mar 2, 2019

cdiener commented Nov 4, 2022

Store SBML meta information (level, version, packages, provenance) #810

Store SBML meta information (level, version, packages, provenance) #810

Comments

Midnighter commented Feb 28, 2019

ChristianLieven commented Feb 28, 2019

Midnighter commented Feb 28, 2019

kvikshaug commented Feb 28, 2019

gregmedlock commented Feb 28, 2019

Midnighter commented Feb 28, 2019

ChristianLieven commented Mar 1, 2019 • edited Loading

matthiaskoenig commented Mar 1, 2019 via email

cdiener commented Mar 1, 2019

Midnighter commented Mar 1, 2019

cdiener commented Mar 2, 2019

cdiener commented Nov 4, 2022

ChristianLieven commented Mar 1, 2019 •

edited

Loading