Order of elements in phyloXML
2010.09.09 3 Comments
Since the validity of phyloXML documents is enforced by a XSD Schema, the order of elements matters (for more information and discussions, see http://www.w3schools.com/Schema/schema_complex_indicators.asp).
The current (as of 9 September 2010) BioPerl implementation of the phyloXML format unfortunately produces output with incorrect element order. For Archaeopteryx users a temporary “solution” is to turn off XSD-based validation, with the following line in the Archaeopteryx configuration file:
validate_against_phyloxml_xsd_schema: false
Examples of proper order of sub-elements
For <clade> the order of sub-elements is:
- <name>
- <branch_length>
- <confidence>
- <width>
- <color>
- <taxonomy>
- <sequence>
- <events>
- <binary_characters>
- <distribution>
- <date>
- <reference>
- <property>
- <clade>
For <sequence>, the order is:
- <symbol>
- <accession>
- <name>
- <location>
- <mol_seq>
- <uri>
- <annotation>
- <domain_architecture>
For <taxonomy>, the order is:
- <id>
- <code>
- <scientific_name>
- <authority>
- <common_name>
- <synonym>
- <rank>
- <uri>
Needless to say, not all sub-elements have to appear, but if they do, they have to appear in proper order.