Title: Development, maintenance and sharing of small-scale databases for genome research
1Development, maintenance and sharing of
small-scale databases for genome research
- Volker Brendel
- Department of Genetics, Development and Cell
Biology - Department of Statistics
- Iowa State University
2(No Transcript)
3(No Transcript)
4 the joys and perils of molecular data mining
5http//genome-www5.stanford.edu/MicroArray/SMD/
Blader IJ, et al. (2001) J Biol Chem
276(26)24223-31 Microarray Analysis Reveals
Previously Unknown Changes in Toxoplasma
gondii-infected Human Cells.
6(No Transcript)
7The Molecular Biology Database Collection 2003
update Andreas D. Baxevanis
Database Categories List Major Sequence
Repositories Comparative Genomics Gene
Expression Gene Identification and Structure
Genetic and Physical Maps Genomic Databases
Intermolecular Interactions Metabolic Pathways
and Cellular Regulation Mutation Databases
Pathology Protein Databases Protein Sequence
Motifs Proteome Resources RNA Sequences
Retrieval Systems and Database Structure
Structure Transgenics Varied Biomedical Content
8Alphabetical Database List
16S and 23S Ribosomal RNA Mutation Database
AAindex
Physicochemical properties of peptides
ACeDB
C. elegans, S. pombe, and human sequences and
genomic information
. . .
ZmDB
Maize genome database
9Molecular Databases - Problems
- Content (accuracy currency)
- Multiplicity (gt 1,000 specialized databases!?)
- Lack of standards (e.g., ZmDB ACeDB, MySQL,
FileMakerPro) - Accessibility (web!?)
10Why are there so many distinct databases in
molecular biology?
11(No Transcript)
12NSF 99 -171 PLANT GENOME RESEARCH PROGRAM -
COLLABORATIVE RESEARCH ON FUNCTIONAL GENOMICS
Program Announcement DIRECTORATE FOR BIOLOGICAL
SCIENCES LETTER OF INTENT NOVEMBER 8, 1999
PROPOSAL DEADLINE JANUARY 7, 2000
NATIONAL SCIENCE FOUNDATION
13- Informatics Include a detailed description of
all informatics components of the project. This
section should describe the informatics tools
used for internal data management as well as the
distribution of information to the scientific
community. Technical descriptions must be
sufficiently detailed to allow adequate review by
informatics experts. All data must be released to
the public in an accessible and useable form. If
project includes development of a new database or
expansion of an existing database, a plan for its
long-term maintenance must be described.
14Case studyZmDB a maize genome database
15(No Transcript)
16(No Transcript)
17(No Transcript)
18(No Transcript)
19(No Transcript)
20End of funding!End of project!?End of
database!?End of data integrity!?
21(No Transcript)
22MaizeGDB
23(No Transcript)
24Case studyZmDB one need leads to many others
25(No Transcript)
26PlantGDB
27(No Transcript)
28Data integrityProblems and possible solutions
29AtGDB
30(No Transcript)
31(No Transcript)
32(No Transcript)
33(No Transcript)
34Matt Wilkerson
Shannon Schlueter
NSF Plant Genome Research Project(s)
35(No Transcript)
36Genome Annotation The Need for User
Contributions! Examples for Arabidopsis
At1g28080 (missed 5' UTR)
At2g40840 (intergenic region? U12 intron!)
User Contributed Annotation Alan Myers!?
At1g14370 (overlapping 3'-UTRs ? no!)
User Contributed Annotation Your Name here!!