Title: Overview of Bioinformatics
1Overview of Bioinformatics
A Tutorial on Bioinformatics
A/P Shoba Ranganathan Justin Choo
National University of Singapore
2What is Bioinformatics ?
- Bioinformatics is the study of the information
content and information flow in biological
systems and processes. - - Michael Liebman in Bioinformatics An
Editorial Perspective (http//www.netsci.org/Scie
nce/Bioinform/feature01.html) - Annotate -gt store -gt search/retrieve -gt
analyze -gt visualize - Nucleic acid sequence (genes and RNAs),
protein sequence and structural information.
3SARS Its Implication ...
4SARS - Bioinformatics In Action
5Sequencing Of SARS
Photo above shows the sequencing area of the lab.
Taken from http//www.bcgsc.ca/bioinfo/SARS/
6Partial Sequence of SARS
- gtgi30248028gbAY274119.3 SARS coronavirus
TOR2, complete genome ATATTAGGTTTTTACCTACCCAGGAAAA
GCCAACCAACCTCGATCTCTTGTAGATCTGTTCTCTAAACGA
ACTTTAAAATCTGTGTAGCTGTCGCTCGGCTGCATGCCTAGTGCACCTAC
GCAGTATAAACAATAATAAA TTTTACTGTCGTTGACAAGAAACGAGTAA
CTCGTCCCTCTTCTGCAGACTGCTTACGGTTTCGTCCGTGT
TGCAGTCGATCATCAGCATACCTAGGTTTCGTCCGGGTGTGACCGAAAGG
TAAGATGGAGAGCCTTGTTC TTGGTGTCAACGAGAAAACACACGTCCAA
CTCAGTTTGCCTGTCCTTCAGGTTAGAGACGTGCTAGTGCG
TGGCTTCGGGGACTCTGTGGAAGAGGCCCTATCGGAGGCACGTGAACACC
TCAAAAATGGCACTTGTGGT CTAGTAGAGCTGGAAAAAGGCGTACTGCC
CCAGCTTGAACAGCCCTATGTGTTCATTAAACGTTCTGATG
CCTTAAGCACCAATCACGGCCACAAGGTCGTTGAGCTGGTTGCAGAAATG
GACGGCATTCAGTACGGTCG TAGCGGTATAACACTGGGAGTACTCGTGC
CACATGTGGGCGAAACCCCAATTGCATACCGCAATGTTCTT
CTTCGTAAGAACGGTAATAAGGGAGCCGGTGGTCATAGCTATGGCATCGA
TCTAAAGTCTTATGACTTAG GTGACGAGCTTGGCACTGATCCCATTGAA
GATTATGAACAAAACTGGAACACTAAGCATGGCAGTGGTGC
ACTCCGTGAACTCACTCGTGAGCTCAATGGAGGTGCAGTCACTCGCTATG
TCGACAACAATTTCTGTGGC CCAGATGGGTACCCTCTTGATTGCATCAA
AGATTTTCTCGCACGCGCGGGCAAGTCAATGTGCACTCTTT
CCGAACAACTTGATTACATCGAGTCGAAGAGAGGTGTCTACTGCTGCCGT
GACCATGAGCATGAAATTGC CTGGTTCACTGAGCGCTCTGATAAGAGCT
ACGAGCACCAGACACCCTTCGAAATTAAGAGTGCCAAGAAA
TTTGACACTTTCAAAGGGGAATGCCCAAAGTTTGTGTTTCCTCTTAACTC
AAAAGTCAAAGTCATTCAAC CACGTGTTGAAAAGAAAAAGACTGAGGGT
TTCATGGGGCGTATACGCTCTGTGTACCCTGTTGCATCTCC
ACAGGAGTGTAACAATATGCACTTGTCTACCTTGATGAAATGTAATCATT
GCGATGAAGTTTCATGGCAG ACGTGCGACTTTCTGAAAGCCACTTGTGA
ACATTGTGGCACTGAAAATTTAGTTATTGAAGGACCTACTA - . . . . . . . . . . . . . . .
The complete genome of SARS, obtained from
http//www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?302
48028NCBI4812069
7Bioinformatics -Timeline
- Single Structures
- Modeling Geometry
- Forces Simulation
- Docking
- Sequences, Sequence-Structure Relationships
- Alignment
- Structure Prediction
- Fold recognition
- Genomics
- Dealing with many sequences
- Gene finding Genome Annotation
- Databases
- Integrative Analysis
- Expression Proteomics Data
- Data mining
- Simulation again(whole cells?).
8Biological Databases
- Collect, organise and classify data
- Query the dataset
- Retrieve entries based on keyword search
PDB
EMBL
Genbank
9Sequence Analysis Software
- What is the information contained in a biological
sequence? - How can we analyse it to gain knowledge?
- Does it contain any functional clues?
10Sequence Comparison
- How can we compare a given sequence to the
millions in the database? - Which ones are truly related by evolution?
- What can the study of related sequences tell us?
11Sequence Alignment
- After collecting a set of related sequences, how
can we compare them as a set? - How should we line up the sequences so that the
most similar portions are together? - What do we do with sequences of different
lengths?
12Protein Structure
- The function of a protein is a consequence of its
folded state Anfinsen, 1961 - The 3D fold of a protein is called its structure
- In 3D, the business end of the protein has
contributions from different regions of its
sequence
Picture taken from http//www.strgen.org/
13Visualization
- Using graphic tools to view structures
- Simple commands to analyse structures and active
sites - Different graphic representations and colouring
schemes
Picture taken from http//www.nature.com/
14Careers in Bioinformatics
- Genomics
- Genome sequencing of
- Bacteria, viruses
- Animals
- Plants
- Comparative genomics
- Annotation and Mapping
- Gene Discovery
15Careers in Bioinformatics
- Functional Genomics (Gene Expression and
Regulation) - Control Regions
- Switches
- Circuits
- Bypass
- Feedback loops
- Environmental Effects
- Diseased States
- Chemical Consequences
16Careers in Bioinformatics
- Pharmacogenomics
- SNPs
- Regional, ethnic variations
- Inheritance patterns
- Radiological/ecological modifications
- Therapeutic target recognition
- Correlation of drug and expression effects
- Pathway Effects
17Careers in Bioinformatics
- Proteomics
- Protein Profiling
- Alternate splice variants
- Orphan genes
- Cryptic introns
- Gene Therapy
18Careers in Bioinformatics
- Structural Genomics
- Experimental Protein structures
- Apo state
- Holo state
- Structural modifications
- Membrane Proteins
- Homology Modelling
- Comparative Modelling
19Careers in Bioinformatics
- Drug and Vaccine Design
- Screening Natural Products
- Plants
- Fungi
- Bacteria
- Chemicals
- In silico modifications of ligands
- Vaccine design and delivery
20Job Sectors
- Academia
- Research Institutes
- Biotechnology
- Bioinformatics
- Pharmaceutical
- Agriculture
- Biodiversity
21The End