Title: Practical Applications of Sequence Analysis
1Practical Applications of Sequence Analysis
- MIC 753 - The Theory and Application of Sequence
Analysis
2Instructor (Me)
- Elliot Lefkowitz
- Office BBRB 277A
- Phone 4-1946
- E-mail ElliotL_at_uab.edu
3Time and Place
- April 3 to June 5 2001
- no class May 17
- no class May 29
- Tuesdays 300 430
- Thursdays 100 230
- BBRB 170
4World-Wide-Web Site
- http//genome.microbio.uab.edu/MIC753
- General Information
- Handouts
- Lecture Notes
- Discussion List
- Links
5Discussion List
- Talk with Elliot
- Ask questions
- Get answers
- Express your frustration
- Public!
6Email
- ElliotL_at_uab.edu
- Talk with Elliot
- Ask questions
- Get answers
- Express your frustration
- Private!
- Send me your Email Address!!
7Sequence analysis at UAB
- http//genome.microbio.uab.edu
- Many local sites mentioned in this course have
links from this page
8Course Content
- Tools for Sequence Analysis
- GCG
- NCBI
- Databases
- Other Web-based tools
- Specialized Software
- Publication
- Understanding their use
9GCGGenetics Computer Group
- The Wisconsin Package
- Complete package of programs for sequence
analysis - Nucleic acid and Protein databases
- Supporting databases
10GCG Costs
- Program and database costs are subsidized by
- UAB Health Services Foundation
- UAB AIDS Center
11GCG Interfaces
- Command line
- Text based commands
- UNIX
- Graphics display
- SeqLab
- Xwindows
- Windows environment
- SeqWeb
- Web-based sequence analysis
12 Other Useful Programs
- Available for
- Macintosh
- Windows PCs
- UNIX Workstations
- Web-based tools
13Microcomputer Sequence Analysis Packages
14Pros and Cons
- Initially expensive
- Databases supplied on CD-ROM
- Available on-line from Genbank
- Packages vary in completeness
- Generally easy to use
15Reference Material
16GCG Documentation
- Web-based
- http//genome.microbio.uab.edu/GCG/GCG10/gcgmanual
.html - Username gcg
- Password uab
- Order from GCG
- http//www.gcg.com/ordering/orderdocform.html
- UNIX documentation
17GCG Users Guide
- GCG Manual Version 10.1
- Basic guide to UNIX
- Running GCG programs
- Program descriptions
- Working with data
- Generating graphics
18GCG Program Manual
- Complete Program Guide
- Description
- Input
- Output
- Parameters
- Algorithms
- Example
19SeqLab Users Guide and Tutorial
- GCG XWindows Interface
- Windows-like control
- menus
- dialog boxes
- mouse control
20SeqWeb Interface
- http//gcg.genome.uab.edu
- Flat-file version
- http//gcg.genome.uab.edu90
- Relational database version
- Web-based access to GCG programs
- Online help
21Sequence Analysis Texts
22A Practical Guide to the Analysis of Genes and
Proteins
- Â Strongly Recommended
- Editors
- Andreas Baxevanis
- B.F.Francis Ouellette
- John Wiley Sons, 1998
23Sequence, Structure, and Databanks A Practical
Approach
- Editors
- Des Higgins
- Willie Taylor
- Oxford Univ Press, 2000
24Sequence Analysis Primer
- Editors
- Michael Gribskov
- John Devereux
- Groves Press, 1991
25Sequence Analysis in Molecular Biology
- Gunnar von Heijne, Editor
- Academic Press, 1987
26Phylogenetic Analysis Texts
27Molecular Systematics
- David M. Hillis and Craig Moritz (eds.)
- Sinauer Associates, 1990
28MacClade
- Analysis of Phylogeny and Character Evolution
- Wayne P. Maddison and David R. Maddison
- Program and Manual
- Sinauer Associates, 1992
29PAUP 4.0
- Phylogenetic Analysis Using Parsimony
- David L. Swofford
- Program and Manual
- Mac
- PC (DOS)
- UNIX
- Sinauer Associates, 1998
- Available later this year
30Molecular Evolution
- Computer Analysis of Protein and Nucleic Acid
Sequences - Methods in Enzymology, Vol. 183, Doolittle, R.F.
(ed.) - Academic Press, 1990
31Lecture Schedule
32http//genome.microbio.uab.edu/MIC753/schedule.htm
33 Getting Started
- Introduction Communications
- Using Computers
- UNIX
- Running GCG Programs
- Text
- Graphics
- SeqLab
- SeqWeb
34Data
- Sequence and Other Data
- SeqStore
- Genbank Services
- Sequence Editing
- Fragment Assembly
- Genbank submission
35Sequence Patterns
- Mapping
- Pattern Recognition
- Protein Analysis
36Sequence Similarities
- Sequence Comparison
- Database Searching
- Multiple Sequence Analysis
- Prediction of functional motifs
- Profile Analysis
37RNA
- Secondary Structure Prediction
38Phylogenetic Analysis
39MicroArray Analysis
- Aubrey Hill
- Director, Bioinformatics
- UAB Comprehensive Cancer Center
40Genome Analysis
- Mapping
- Sequencing
- Analysis
41Getting some real work done
- Graphics and Publication
- Working with your results
42General Lecture Outline
- What the program does
- What the program wants
- How the program works
- How can I manipulate the program?
- What do I get in the end?
43Internet-based Tools
- What is available
- How do I find them?
- How do I use them?
44Grading
45Class Projects
- Project 1 - Preparation of Web-based
Bioinformatic Solutions - Project 2 - Analysis of a gene family
- Understanding
- Practice
A
46Preparation of a Web-based Bioinformatic Solution
- Each student will design a web site that
describes online, web-based solutions that are
available to help solve a particular
bioinformatic problem. - Due Date Monday, May 21
47Possible Projects
- Protein motif identification
- Protein structure prediction
- Protein Families
- RNA Secondary structure prediction
- Human genome analysis
- Mouse genome analysis
- Microbial genome analysis
- Database access and searching
- Evolutionary analysis
- Sequence assembly
- Microarray analysis
48Project 1 Guidelines
- You should have your topic approved by Elliot
prior to beginning this project. The list below
provides examples of bioinformatic problems that
may be appropriate, or choose your own. - Your web site should include links to the
appropriate web pages along with descriptions of
what is available and a how-to guide to provide
on-line help in using the bioinformatic tools. - You should also provide a review of the described
sites that focuses on their ability to solve the
problem at hand, and ease-of-use. - You can utilize whatever web-creation software
you wish. Recommendations of easy-to-use web
design programs are Microsoft FrontPage for the
PC and Adobe PageMill for the Macintosh. - Up to TWO students may work on this project
together and submit a common set of files for
their web site.
49Analysis of a gene family
- Each student will be required to identify a
sequence family that they will analyze in
different ways throughout the course. Each
student must be working on a unique set of genes.
Elliot will approve each choice on a first-come,
first-serve basis. So it is to your advantage to
accomplish the following assignment as soon as
possible. - Due Date Monday, June 11
50Project 2 Guidleines
- Pick an interesting gene family that contains at
least six members either in multiple species, or
in multiple alleles. - Email a description of the gene family to Elliot
(ElliotL_at_uab.edu) and why you think it will be
worthwhile to study. - Elliot will let you know if your choice is
satisfactory. If not, you will have to choose
another set of genes. - Throughout the course, you should apply the
various analytical programs demonstrated in class
to your gene family. It will be your decision to
choose which programs to use in your analysis.
Your analysis should be compressive and relevant
to the chosen gene family. - Prepare a short journal-type paper describing
your work.
51Computers and Communications
52Analyze the Sun UNIX Server
- Sun E450 Server
- analyze.genome.uab.edu
- gcg.genome.uab.edu
53Analyze Accounts
- Provided to all registered students
- Feel free to experiment
- Do Not provide access to other individuals
- Username first initial last name
- elefkowitz
- Password
- Social security number
54Analyze Accounts
- Others may register for access
- http//genome.microbio.uab.edu/GCG/GCGatUAB.htm
- Username gcg
- Password uab
55Change Your Password!
- On analyze
- passwd
- SeqWeb
- Preferences
- Change password
56Run The Programs!!
- Only way to learn
- Your grade depends on it!
57Required Hardware
- Computer with a modem or ethernet connection to
the Internet - Macintosh
- Windows PC
58Required Software
- Web Browser
- Internet Explorer v5.0
- Netscape v4.6
- Communications Program
- Email account and client
- Send me your Email Address!
59Communications Software
- VT100 or VT220 Terminal Emulation
- Text
- Tektronix 4105 or 4014
- Graphics
- XWindows
60Macintosh
- NCSA Telnet 0
- vt100/graphics
- freeware
- VersaTerm-Pro 195
- vt100/graphics
- Synergey Software
- eXodus 149
- XWindows
- White Pine Software
61PC
- Tera Term Pro 0
- vt100/graphics
- Freeware
- SmarTerm Office 316
- vt100/graphics
- Esker Software
- WinX-32 149 (60 student license)
- XWindows
- eXodus 149
- XWindows
- White Pine Software
62Connection to analyze
- Internet Connection
- Ethernet
- Direct campus connection
- Modem
- Through ISP
- Domain name
- gcg.genome.uab.edu
63For more information
- See the web site
- http//genome.microbio.uab.edu/MIC753/
64Next Up