Title: Taxon and GO
1Taxon and GO
2Removal of sensu
- Old system
- larval development (sensu Insecta)
- def The process whose specific outcome is the
progression of the larva over time, from its
formation to the mature structure. As in, but not
restricted to Insecta.
3- New system
- instar larval development
- def The process whose specific outcome is the
progression of the larva over time, from its
formation to the mature structure. This begins
with the newly hatched first-instar larva,
through its maturation to the end of the last
larval stage
4GO is species-neutral
- The GO can support annotation across all species.
- This is good!
- Total exclusion of taxon information can
- create problems.
5- We have 23802 terms.
- Time-sink for users.
- Worst for
- peripheral
- species groups.
http//bestblog.mlblogs.com/inside_pitch/images/ne
edle_in_the_haystack_4.jpg
6Typical User Query Hello, I'd like to load
into our database only GO terms that are
relevant for plants. Is there such GO-slim ? If
not, is there a way to filter out GO terms that
are not applicable for plants? Thanks, -Naama
Menda
Plant and Animal Genome Meeting 2005 Speaker
Some people think the size of GO
makes it hard to handle. Audience Raucous
laughter and shouts of no really?
Sanger microbial annotation We need a
prokaryote slim or we will not be able to
annotate fast enough to meet our targets.
(paraphrased)
7GO language can be quite subtle.
- membrane of vacuole
- with cell cycle-independent morphology
- def The lipid bilayer surrounding a vacuole that
retains the same shape regardless of cell cycle
phase. The membrane separates its contents from
the cytoplasm of the cell.
8To an outsider this is like
- (8) If the Secretary of State considers it
necessary or desirable to do so, he may by
regulations prescribe a scale or scales of fees
to have effect, for such period as is specified
in the regulations, in place of any scale or
scales prescribed by the Commission and, if he
does so, references in subsections (3) and (4) to
the appropriate scale are to be read as respects
that period as references to the appropriate
scale prescribed by the Secretary of State.
Audit Commission Act 1998 - 1998 CHAPTER 18 -
http//www.opsi.gov.uk/acts/acts1998/ukpga_1998001
8_en_2
9To an outsider this is like
- (8) If the Secretary of State considers it
necessary or desirable to do so, he may by
regulations prescribe a scale or scales of fees
to have effect, for such period as is specified
in the regulations, in place of any scale or
scales prescribed by the Commission and, if he
does so, references in subsections (3) and (4) to
the appropriate scale are to be read as respects
that period as references to the appropriate
scale prescribed by the Secretary of State.
Audit Commission Act 1998 - 1998 CHAPTER 18 -
http//www.opsi.gov.uk/acts/acts1998/ukpga_1998001
8_en_2
10Seen in context..
- membrane of vacuole
- with cell cycle-independent morphology
- def The lipid bilayer surrounding a vacuole that
retains the same shape regardless of cell cycle
phase. The membrane separates its contents from
the cytoplasm of the cell.
membrane of vacuole with cell cycle-correlated
morphology def The lipid bilayer surrounding a
vacuole, the shape of which correlates with cell
cycle phase. The membrane separates its contents
from the cytoplasm of the cell.
11With some inside information
Plant vacuole membrane
- membrane of vacuole
- with cell cycle-independent morphology
- def The lipid bilayer surrounding a vacuole that
retains the same shape regardless of cell cycle
phase. The membrane separates its contents from
the cytoplasm of the cell. - membrane of vacuole
- with cell cycle-correlated morphology
- def The lipid bilayer surrounding a vacuole, the
shape of which correlates with cell cycle phase.
The membrane separates its contents from the
cytoplasm of the cell.
Fungal vacuole membrane
12Error checking
!?
13The Proposed Solution
- We can apply taxon information in a rigorous and
non-misleading way using relationships - is_relevant_for
- is_only_in
- applies_to_all
14 15Relevance is_relevant_to
- This GO term can be used for annotation of this
taxon. - The GO term may not be appropriate for all
subgroups of this taxon. - The GO term may not be appropriate for all
supergroups of this taxon.
16- membrane of
- vacuole
- with cell cycle
- -independent
- morphology
is_relevant_for
Arabidopsis
membrane of vacuole with cell cycle- correlated
morphology
Saccharomyces
is_relevant_for
17Relevance relationships
- Added by curators.
- Inferred from annotations.
18Relevance - Propagation
- Propagates up the GO.
- Propagates up the taxon hierarchy.
19Relevance
20Relevance
21Relevance
cellular organisms Eukaryota Fungi/Metazoa
group Metazoa Eumetazoa Bilateria
Coelomata Deuterostomia Chordata Craniata
Vertebrata Gnathostomata Teleostomi
Euteleostomi Actinopterygii Actinopteri
Neopterygii Teleostei Elopocephala
Clupeocephala Otocephala Ostariophysi
Otophysi Cypriniphysi Cypriniformes
Cyprinoidea Cyprinidae Rasborinae Danio
22Relevance
cellular organisms Eukaryota Fungi/Metazoa
group Metazoa Eumetazoa Bilateria
Coelomata Deuterostomia Chordata Craniata
Vertebrata Gnathostomata Teleostomi
Euteleostomi Actinopterygii Actinopteri
Neopterygii Teleostei Elopocephala
Clupeocephala Otocephala Ostariophysi
Otophysi Cypriniphysi Cypriniformes
Cyprinoidea Cyprinidae Rasborinae Danio
23 24Specificity is_only_in
- The GO term never applies to species outside of
this taxon. - (The GO terms may apply to only some subtaxa of
the taxon, and not all.)
25Viridiplantae
http//www.seaweed.ie/algae/images/hydrodictyon2.j
pg
http//www.blackburn.gov.uk/server.php?showconMed
iaFile.7355
green plants and algae
26http//www.blackburn.gov.uk/server.php?showconMed
iaFile.7355
Leaf development never occurs outside of
Viridiplantae.
Not all subtaxa of Viridiplantae make leaves.
http//www.seaweed.ie/algae/images/hydrodictyon2.j
pg
27formally
- GO term leaf development
- relationship is_only_in
- taxon Viridiplantae
- Leaf development never occurs outside of
Viridiplantae. - (Not all subtaxa of Viridiplantae make leaves.)
28Specificity - is_only_in
- Added manually by curators as they develop the
graph.
29Specificity - is_only_in
GO Taxon
- Propagates down the GO.
- Propagates up the taxon hierarchy.
30Specificity - is_only_in
31Specificity - is_only_in
32Specificity - is_only_in
Cellular organisms Eukaryota
Viridiplantae
33Specificity - is_only_in
Cellular organisms Eukaryota
Viridiplantae
34What about convergent evolution?
http//www.dcn.davis.ca.us/go/platypus/images/plat
ygeo.jpg
http//homepage.mac.com/wildlifeweb/bird/mallard/m
allard_male_image_11tfk.jpg
35Assumptions- beak development
- These beaks were produced by convergent evolution
- The two beak development processes are the same.
36one beak development term
- is_relevant_for Anas platyrhynchos
- (mallard)
- is_relevant_for Ornithorhynchus anatinus
- (Platypus)
- is_only_in Amniota
- (or vertebrata if that is easier)
37 38Validity applies_to_all
- The GO term applies to all subtaxa of a given
taxon. - (The GO term may also apply to species outside of
the taxon.)
39Validity applies_to_all
Homoiothermy is exhibited by all subtaxa of
the taxon Mammalia.
http//www.thermalconsultancy.com/
http//coolcosmos.ipac.caltech.edu/image_galleries
/ir_zoo/coldwarm.html
40Validity applies_to_all
Homoiothermy is absent in many of the taxonomic
groups outside of Mammalia.
http//coolcosmos.ipac.caltech.edu/image_galleries
/ir_zoo/coldwarm.html
41Validity applies_to_all
with some exceptions
Homoiothermy is also present in birds, tuna fish,
and honey bees.
Bee Hive
http//www.beebehavior.com/
42formally
- GO term homoiothermy
- relationship applies_to_all
- taxon Mammalia
- Homoiothermy is found in all subtaxa of the taxon
Mammalia. - (Homoiothermy is also found outside of the taxon
Mammalia.)
43Validity
- Propagates up the GO.
- Propagates down the taxon hierarchy.
44Validity
45Validity
46Validity
47Validity applies_to_all
- Added manually by curators
48Cave fish
- Not all fish carry out eye development processes.
http//www.petpig.com/fish/fish/blind_cave_fish1.j
pg
49 50eye development applies_to_all
?
51fish
eye development applies_to_all
-
- other
- fish
- cave other
- fish fish
52fish
-
- other
- fish
- cave other
- fish fish
53What does this buy us?
54Users no longer hunting for terms in the GO
haystack.
http//bestblog.mlblogs.com/inside_pitch/images/ne
edle_in_the_haystack_4.jpg
55All species.
56Mammal-specific terms greyed out.
57All species.
58Insect-specific terms greyed out.
59Clues to language
- membrane of vacuole
- with cell cycle-independent morphology
- def The lipid bilayer surrounding a vacuole that
retains the same shape regardless of cell cycle
phase. The membrane separates its contents from
the cytoplasm of the cell.
60Clues to language
- membrane of vacuole
- with cell cycle-independent morphology
- def The lipid bilayer surrounding a vacuole that
retains the same shape regardless of cell cycle
phase. The membrane separates its contents from
the cytoplasm of the cell. - is_relevant_for Viridiplantae
61Automatic Annotation and Ontology Error Checking
Brown Arabidopsis annotations. Blue
Drosophila annotations.
62Brown Arabidopsis annotations. Blue
Drosophila annotations.
63Brown Arabidopsis annotations found. Blue
Drosophila annotations found.
64Error checking
65GO-Taxon relationships
- To be put in an external OBO format file.
- GO file link
file slimmed -
taxonomy
file
66Acknowledgements
- Waclaw Kusnierczyk.
- Taxonomy-based partitioning of the Gene Ontology
- In press Journal of Biomedical Informatics.
- Proposal by
- Jennifer Deegan
- Chris Mungall
- Jane Lomax
With input from Midori Harris Melissa
Haendel Judy Blake John Day-Richter Waclaw
Kusnierczyk David Hill