Title: Data in Biology
1Data in Biology
2(No Transcript)
3Lousy data
- Biological data, perhaps more than most other
scientific data, has the following
characteristics
- It is complex
- It is incomplete
- It is error prone
- It contains the answers to questions of
biological function and indeed, dysfunction - It is in high demand by a community with
difficult questions
4What is data?
- Any quantifiable information (e.g., wing length,
number of bristles, gene expression, frequency of
albinos, etc.). - A single datum is not amenable to statistical
analysis - Hence data needs to be obtained from a number of
individuals or groups, not merely from a single
individual, or at multiple times from the same
individual.
5Examples of biological data
- Ecological data
- Behavioral data
- Molecular data (DNA, protein)
- Physiological data
- Population genetic data
- Quantitative genetic data
- Etc.
6Why collect data?
- To describe a behavior, a phenomenon, a
structure, etc. - To test a hypothesis
7Data and hypotheses
- Data are very important to the advancement of
knowledge, but they are only half of the science.
- Biological hypotheses (or research questions) and
ideas are the other half.
8Data and hypothesescontd.
- Think of it in this way
- Hypotheses ( research questions) without data
are not very useful, and - Data collected without hypotheses (or research
questions or haphazardly and without purpose)
are wasted.
9Lets explore the nature of data
- When you collect biological data, what is the
perhaps the most obvious and odd thing about it?
10Variation
Sometimes it is obvious
11Variation
Sometimes it is not
12Why is there variation?
13Variables under direct human control
- If a bank offers an interest of 5, then the
relationship between the interest accumulated and
the period of investment is linear - (y 5x)
- E.g., for 1000.00 invested
Year Int. accum. ( of cap.) 1 5 2 10 3 15 4 2
0 5 25
An important feature of such data is that there
is only one value of the dependant variable for a
given value of the explanatory variable.
14Variables not under direct human control
- Growth rate (of fish, etc.)
- Response of living organisms to treatment (drugs,
hormones, feed, etc.) - Expression levels of a gene in different tissues
- The DNA sequence of a given gene in different
individuals - Daily rainfall in a given city
- The number of mycorrhizal spores at different
depths of a plant root - Etc.
15(No Transcript)
16The changing face of biology
Fig 1.1 from Biometry by Sokal and Rohlf (3rd
editiion), 1995
17Why??
- Biology has gone beyond the descriptive phase.
- Need answers to how and why and not merely
what. - Processes involved in answering these questions
are not deterministic in their effects. - Stochastic or random effects present that
cannot be individually identified.
18Boring vs Interesting Data
- If the response can be determined exactly (no
variation), this is deterministic not in the
realm of statistics - If response varies randomly, then such data is
amenable to statistical analysis
19The free-kick, friends and growth
20What to do with data?
- OK, let us assume you have collected the required
data (which is a whole chapter in itself). - The data refers to the standard length of 1000
tilapia (Oreochromis niloticus) fry - What do you do with it?
21Frequency Distributions
Figure 2.1 from Biometry by Sokal and Rohlf, 3rd
edition, 1995
22Distribution of a variable
- Shape of frequency distribution
- Most shapes can be approximated mathematicallly
- Of considerable biological interest
23Histogram
24Bar diagrams
This bar diagram shows the photosynthesis of
three aquatic plants during experiments when they
were exposed to UV light. The first group of bars
shows that all three plants were healthy at the
start of the experiment. The second group shows
that after 5 hours of UV light one plant is still
healthy but the other two are under stress. The
third group shows photosynthesis after the plants
were left for 20 hours with the UV lights turned
off one plant did not completely recover.
25Stem and Leaf Plots
- Total length of 17 aphids (mm x 10)
- 8.8, 8.4, 9.0, 9.2, 10.1, 6.3,
- 7.8, 10.6, 10.9, 6.5, 7.7, 7.2,
- 8.6, 9.6, 8.5, 9.7, 8.7