Computational methods for genetic mapping of quantitative traits - PowerPoint PPT Presentation

1 / 30

About This Presentation

Title:

Computational methods for genetic mapping of quantitative traits

Description:

Lessons learned in industry and academia. Heritable traits, examples ... Examples: Blood pressure, cholesterol levels, growth rate in e.g. farm animals. ... – PowerPoint PPT presentation

Number of Views:105

Avg rating:3.0/5.0

Slides: 31

Provided by: joha121

Category:

more less

Transcript and Presenter's Notes

Title: Computational methods for genetic mapping of quantitative traits

1
Computational methods for genetic mapping of
quantitative traits

Kajsa Ljungberg
February 17, 2006

2
Kajsa Ljungberg

M.Sc. Biotechnology Engineering (civilingenjör
Molekylär bioteknik), Uppsala.
Ph.D. Scientific Computing, Uppsala.
Various computational projects at e.g.
AstraZeneca and UC San Francisco

3
Overview

Background of my Ph.D. project Quantitative
traits, genetics, experiments.
My work Mathematical problem, methods
(principles only), results.
Two related problems in computational biology.
Lessons learned in industry and academia.

4
Heritable traits, examples

Earlobes Loose or attached
Blood group A, B, AB or 0

5
Qualitative traits

Can be divided into classes in a natural way.
Often governed by a single gene, and in such
cases the genetic basis is relatively easy to
study
Examples Earlobes, blood group, Huntingtons
disease.

6
Heritable traits, more examples

Height
Hair colour
Blood pressure

7
Quantitative traits

Vary continuously on some scale.
Often governed by multiple interacting genes and
the environment.
Examples Blood pressure, cholesterol levels,
growth rate in e.g. farm animals.
Most traits of economic or medical importance are
quantitative.

8
QTL

Abbreviation of Quantitative Trait Locus
Region in the genome where one or several genes
influencing a quantitative trait are located.
Can be found using QTL mapping, and is an
important step in the process of finding the
individual genes.

9
HUGO and QTL mapping

HUGO project
Sequence ? Gene ? Function/Trait
QTL mapping
Function/Trait ? QTL ( ? Gene)

10
Experiments in QTL mapping
0000000000000000000000000000000000
1111111111111111111111111111111111
0000000000000000000000000000000000
0000000000000000011111111111111111
0000000000000000011110000011111111
0000000000000000000011111111111110
0000000000000000000000001111111111
0000000000000000000000000001111111
0000000000000000011111111100000111
11
Fit model A(x)by (bb0 b1). b0 denotes the
mean weight and b1 the effect of the genotype.
Experiment? model
1111000001111111111 0001111111111111000 1111111111
110000000 0000000111111111111 0000000000111111111
1111111111111000111 ...
11 11 11 10 10 11 .. A(x)
1672g 945g 213g 1212g 418g 744g .. y
x
12
If 1 QTL only

Residual norm
x
The genome, all chromosomes lined up
Model
Experiment data A(x), y
Least-squares problem
Residual norm
13
If 1 QTL only, contd.

Residual norm

x
The genome, all chromsomes lined up
The genome position with the smallest residual
norm (best model fit) indicates the most likely
position of the QTL.
14
If 2 QTL
Searching for two QTL corresponds to a
two-dimensional optimization problem.
15
Computational problem
Searching for n QTL corresponds to
an n-dimensional optimization problem, where the
objective function is the residual norm of a
least-squares problem.
16
My project

Minimize f(x), i.e. find the genome positions
giving the best model fit, faster.
Two main strategies
1) Speed up the computation of the residual norm.
Linear algebra, updated QR factorizations...
2) Speed up the multidimensional search for the
optimum. DIRECT algorithm

17
Results

2-3 orders of magnitude speed-up for typical
problems, even more for complicated models.
Computational times measured in minutes instead
of days.
This allows for more thourough data exploration
and model testing.

18
(No Transcript)
19

PSEUDOMARKER 2.02
Hao Wu1, Saunak Sen2, Kajsa Ljungberg3, Karl W.
Broman4, Gary A. Churchill1
1The Jackson Laboratory, Bar Harbor, ME
2Department of Epidemiology and Biostatistics,
University of California San Francisco, San
Francisco, CA
3Department of Information Technology, Division
of Scientific Computing, Uppsala University,
Uppsala, Sweden.
4Department of Biostatistics, Johns Hopkins
University, Baltimore, MD
http//www.jax.org/staff/churchill/labsite/softwar
e/pseudomarker/

20
Related problems, 1

Gene expression levels from microarray
experiments are also quantitative traits.
QTL analysis on microarray data can help
distinguish between covariation and causation.
Hot topic!

21
Related problems, 2

QTL analysis in human populations (different
statistical issues).
Examples at AstraZeneca
1) Procardis risk factors heart disease.
2) Identify QTL related to adverse side
effects, e.g liver problems (Exanta).

22
Lessons learned

(as a computational expert among
experimentalists)
The importance of visualization
Differences in language
Real data is a pain
People use what they know
? Future projects?

End of presentation.

24
Thanks

Sverker Holmgren
Örjan Carlborg
Kateryna Mishchenko
Mahen Jayawardena
Leif Andersson
Martina Hägglund
Forskarskolan i Matematik och Beräkningsvetenskap
Linné-centrum

25
Paper 1
Standard method
Genetic info
Experiment data
Computation
Constant info
Prediction
Genetic info
Computa
-tion
Constant info
Prediction
26
Paper 1 (Ljungberg, Holmgren, Carlborg)

Model
Experiment data
Computation
Prediction Real value Difference
27
Paper 2 (Ljungberg, Holmgren, Carlborg)
Difference between prediction real value

The genome, all chromosomes lined up
Search carefully in regions with good values, but
only sparsely in regions with bad values.
28
Paper 3 (Ljungberg, Mishchenko, Holmgren)
Difference between prediction real value

The genome
When a region with good values has been found,
use a special algorithm which is efficient in
finding the bottom of the closest valley.
29
Paper 4 (Ljungberg)
100, 010, 000, 111 11 00 01 11
2
Model
Experiment data
Computation
Prediction
30
Paper 5 (Jayawardena, Holmgren, Ljungberg)

Difference between prediction real value

Model
Model
Model
Data
Computation
Computation
Data
Computation
Data
Perform several computations simultaneously in an
organized and efficient manner.

Write a Comment

User Comments (0)