|
PsimiXMLSpecifications
PSI-MI XML 2.5 format specifications
Featured IntroductionThe PSI-MI XML 2.5 is a community standard for molecular interactions which has been jointly developed by major data providers (BIND, CellZome, DIP, GSK, HPRD, Hybrigenics, IntAct, MINT, MIPS, Serono, U. Bielefeld, U. Bordeaux, U. Cambridge, and others). This format is stable and used for several years now : published in October 2007 (Broadening the Horizon – Level 2.5 of the HUPO-PSI Format for Molecular Interactions; Samuel Kerrien et al. BioMed Central. 2007.), it has been adapted for many different usages. It can be used for storing any kind of molecular interaction data :
Data representation in PSI-MI 2.5 XML relies heavily on the use of controlled vocabularies. They can be accessed easily via the Ontology Lookup Service: PSI-MI, PSI-MOD.
Simplified view of the schema
Detailed documentation of the schemaThis documentation is aimed at describing the various elements that compose the PSI-MI 2.5 XML schema.
XML Compact and Expanded formatsTwo different flavours of this format have been developed : the Compact format and the Expanded format. These two different formats are described in a single XML schema which can be confusing because it is easy to mix the two different flavours. To be able to fully enjoy the flexibility of the PSI-MI XML 2.5 format, it is important to understand what are the two different flavours and what are the limits between them.
Compact formatIn the Compact format, the description of experiments and interactors is done at the beginning of the entry using experimentList and interactorList elements. When describing the interactions (in the next element interactionList), we will use references to the previously described experiments and interactors using their id attributes.
How to build it?1) Describe all the experiments of your file in the experimentList at the entry level. All the id attributes of the experiments must be present and unique in the all file (not only with experiments but also must be different from interaction ids, interactor ids, etc.) 2) Describe all the interactors of your file in the interactorList at the entry level. All the id attributes of the interactors must be present and unique in the all file (not only with interactors but also must be different from interaction ids, experiment ids, etc.) 3) For each interaction, use the element experimentRef in the experimentList. You should refer to the id attribute of one of the experiments described at the entry level. 4) For each participant, use the element interactorRef (or interactionRef if you want to describe a complex). You should refer to the id attribute of one of the interactors/interactions described at the entry level. 5) For each experimental interactor (participant level), use the element interactorRef. You should refer to the id attribute of one of the interactors described at the entry level. 6) For each inferred interaction (interaction level), the participantRef (interacting participant) or featureRef (binding site interacting with other binding sites) should refer to a participant or feature described within the interaction and not to a participant or feature described in another interaction. 7) Several elements can have experiment references which should point to one experiment described at the entry level :
Advantages
Drawbacks
Warnings
Examples
Expanded formatIn the Expanded format, the description of experiments and interactors is done within each interaction. The file doesn't contain any experimentList or interactorList at the entry level, it only contains an interactionList.
How to build it?1) For each interaction, describe fully your experiments. Use the experimentDescription element in the experimentList. 2) For each participant (interaction level), describe fully your interactor. Use the interactor element (or interactionRef element in case of complex. The references point to an id attribute of one interaction described in the interactionList at the entry level). 3) For each experimental interactor (participant level), use the element interactor. You can use the interacorRef element but it should only refer to an interactor described within this interaction. 4) For each inferred interaction (interaction level), the participantRef (interacting participant) or featureRef (binding site interacting with other binding sites) should refer to a participant or feature described within the interaction and not to a participant or feature described in another interaction. 5) Several elements can have experiment references which should point to one experiment described within the interaction :
Advantages
Drawbacks
WarningsIt is possible to use experiment references in the Expanded format only if we refer to an experiment described within the current interaction. In the Expanded format, each interaction is independent from each other and it is not possible to be be within an interaction and use references to other experiments and interactors which have been described in other interactions. Examples
Overwriting elementsThe XML format is very flexible and allow you to overwrite some elements :
In some cases, a specific participant will be identified with different method(s) and you can overwrite the participant identification method of the experiment within the participant element using the participantIdentificationMethodList. If several experiments are attached to the interaction, it is necessary to specify the experimentRef for each participantIdentificationMethod described at the participant level.
In some cases, a specific feature will be detected with different method(s) and you can overwrite the feature detection method of the experiment within the feature element using the featureDetectionMethodList.
LimitationsIncomplete elements
Publication
ComplexesYou can describe complexes in two different ways :
|