Title: UK CropNet databases: a brief guide
1UK CropNet databases a brief guide
- Keith Bradnam
- Nottingham Arabidopsis
- Stock Centre
2http//ukcrop.net
3(No Transcript)
4Where are we?
5UK CropNet Databases
MilletGenes
Arabidopsis Genome Resource (AGR)
BarleyDB
FoggDB
BrassicaDB
26 Other plant related databases
ukcrop.net/db.html
6BLAST server available
- Blast databases updated nightly
- Fast!
- ukcrop.net/perl/ace/ncbi_blast
- Blast server available from any database
7(No Transcript)
8UK CropNet database usage
9Hits to UK CropNet website 1996-2000
10Where in the world?
11UK CropNet next year
- More databases? Have approached
- CompositDB
- LupinDB
- ZmDB
- SnapDragonDB
- PopulusDB
- GFace access
- ARCADE
- ComapDB
12Feedback
- If you like our databasestell us
- If you dont like our databasestell us!
- These databases are often easy to customise
- We are your loyal servants!!!
13Arabidopsis Genome Resource
14AGR a complex ACEDB database
- There is a lot of Arabidopsis information
available! - 130 Mb genome, essentially complete
- Both major ecotypes have been sequenced
- Organising so much data is not easy!
- Why is there so much data?
15Smallest plant genome?
16What data is in AGR?
- Sequences (updated daily)
- AGI genome sequence (1,500 clones)
- EST and GSS sequences
- Organelle sequences
- Other ecotype sequences
- Protein sequences (Swissprot TREMBL)
- Insert sequences
- 230,000 sequences in total
17Other AGR data
- Maps and markers
- Physical, genetic, and recombinant inbred (RI)
maps - 1,200 RI markers
- RI scoring information
- Clone, Locus, and Allele info
- Germplasm info with links to order stocks
18Still more AGR data
- Bibliographic data (from EMBL records)
- Images (plants and gels)
- Other species info (protein sequences and
associated info from all higher plant species) - BLAST homologies millions of hits
- Mostly intra-specific homologies
- Some inter-specific homologies
19AGR recent improvements
- More data
- e.g. EMBL sequences, protein sequences
- More annotation of existing EMBL data, e.g.
- Tissue type information
- Ecotype info
- CDS evidence
- Inclusion of new (non-EMBL) data, e.g.
- Insert sequences
- SNP data
20Insert lines
- Many Arabidopsis plants now contain random
transposon insertions - Therefore genes of interest may be hit, or
modified by inserts - Genomic location of inserts identified by blast
analysis - Can only identify putative location
21Its a knock-out!
22Its a knock-out!
ATCGCTTAAGGACTGGCACCAC
23Insert data in AGR
- SINS (Jones) inserts
- Stocks sequences
- IMA (Sundaresan) inserts
- Stocks some location info
- ITS (Pereira) inserts
- Just sequences
- Launchpad (Muskett)
- Just sequences
24Finding an insert in your gene
- AGR and NASC provide many different ways of
finding insert information - ukcrop.net/agr/insert.html
- arabidopsis.org.uk/catalogue.html
- arabidopsis.org.uk/blast.html
- arabidopsis.org.uk/insertwatch
25Webace screenshot
- 1) find gene on interest in AGR
26Search precomputed blast data
2) search pre-computed blast analysis
27NASC blast server
3) blast search against insert database
284) Register your sequences with InsertWatch
29Spreading the word
- Raise general awareness of AGR
- Many bioinformatics sites link to TAIR but not
AGR - Make people aware of AGR contents
- Sequences maps stocks papers etc.
- AGR vs TAIR
- AGR is now a lot faster for general queries
- No limits (as yet) on blast searching
- AGR contains the sequences
- AGR (hopefully) keeps things simple
30Future developments
- More data!
- Microarray data
- More insert sequences
- More links between NASC and AGR
- Acknowledgements Jamie Kincaid, NASC, UK CropNet
ukcrop.net/agr arabidopsis.org.uk/insertwatch