Data Analysis Workshop - PowerPoint PPT Presentation

1 / 25
About This Presentation
Title:

Data Analysis Workshop

Description:

Data Analysis Workshop ... Showing evidence of differences Estimate Population Parameters Demonstrate equivalence Demonstrate associations Prediction Reliability ... – PowerPoint PPT presentation

Number of Views:120
Avg rating:3.0/5.0
Slides: 26
Provided by: charle760
Category:

less

Transcript and Presenter's Notes

Title: Data Analysis Workshop


1
Data Analysis Workshop
  • Chuck Spiekerman (cspieker_at_u)
  • Karl Kaiyala (kkaiyala_at_u)

2
Course Outline
  • February 20
  • How to describe your study
  • Choosing an Analysis method
  • March 13
  • Student presentations of study designs and
    data-analysis plans
  • March 20
  • Student presentations of data analyses

3
Describing your study
  • Next session (3/13) we are asking you to present
    a description of your planned study
  • The next few slides give an outline of suggested
    components of this description
  • Attention to all these components should help you
    (and/or a consultant) decide on appropriate
    methods of statistical analysis

4
Study Design Description
  • Specific Aims (what?)
  • Background (why?)
  • Previous work (who?)
  • Study methods (how?)
  • several components
  • optional for student presentations

5
Specific Aims
  • Describe the scientific question(s)
  • Be specific and precise
  • Stick to the study at hand

6
Background and Motivation
  • Relevance of this research
  • Existing knowledge
  • Identify gap this research will fill
  • Relate to specific aims
  • If part of a larger study, where does this study
    fit?

7
Study Methods Components
  • Primary outcomes
  • Study population
  • Methods and procedures
  • Data analysis plan
  • optional for student presentations

8
Primary Outcomes
  • Precise definition of key measurement (individual
    data item) of interest
  • Justify why this outcome and not something else.
  • Relate to specific aim
  • Details of collection can be left to methods and
    procedures section

9
Study population
  • How were the subjects selected?
  • Exclusion and inclusion criteria
  • Group classification?
  • Matching?
  • Randomization?

10
Data analysis plan
  • Outline data analysis for each specific aim
  • Make clear which procedures are being used toward
    which aim
  • Usually some simple tables and plots should be
    sufficient
  • Keep it simple

11
Forming an analysis plan
  • Two important questions
  • What do you want to do/show?
  • What kind of data
  • will answer your question best?
  • can you get?
  • do you have?

12
Types of data
  • Continuous
  • Differences between values have meaning, and are
    interpretable independent of the values
    themselves
  • E.g. difference between 8 and 9 basically the
    same as difference between 1 and 2.
  • Ordinal
  • Values have an order, but differences are not
    easily interpretable (e.g. good, fair, poor)

13
Types of data (cont.)
  • Categorical
  • Values are descriptive but do not have any
    obvious ordering. E.g. tx A, tx B, tx C.
  • Binary, Dichotomous
  • Fancy names for categorical variables with only
    two possible values.

14
Types of data (sampling)
  • one-sample
  • Refers to situation when values of interest all
    come from one group and will be compared to a
    known quantity (e.g. change greater than zero)
  • two-sample
  • When data are divided/sampled in two groups and
    observed values compared between groups.

15
What do you want to do?
  • Show evidence of differences
  • Estimate population parameters
  • Demonstrate equivalence
  • Show evidence of association
  • Create/validate a predictive model
  • Assess agreement or reliability
  • Other?

16
Showing evidence of differences
  • Standard hypothesis testing procedures, usually
    comparing means or proportions
  • Which test will depend on type of data. Usual
    suspects (YMMV)
  • T-test or ANOVA for Continuous data
  • Chi-square test for Categorical data
  • Rank-based tests (e.g. Wilcoxon) for Ordinal data
  • Use Rosner flowchart for guidance
  • Supplement p-value with estimate of difference
    (with confidence interval)

17
Estimate Population Parameters
  • P-values and hypothesis tests arent always
    necessary
  • Sometimes you dont really want to compare things
    but only estimate values
  • Estimate parameters of interest and supplement
    with confidence intervals (IMPORTANT!) .

18
Demonstrate equivalence
  • In some instances the goal is to show equivalence
    of, say, two treatments.
  • Failing to show a difference using a standard
    hypothesis test is usually not sufficient
    evidence of equivalence
  • Two strategies
  • Estimate difference and show worst cases with
    confidence interval
  • Compute a standard hypothesis test with very good
    power (gt 95)

19
Demonstrate associations
Independent variable outcome variable outcome variable
Independent variable dichotomous continuous
categorical Chi-square Logistic regression T-test/ANOVA Linear regression
continuous Logistic regression T-test/ANOVA (backwards) Correlation Linear regression Scatterplots
20
Prediction
  • Dichotomous outcome
  • Logistic regression
  • Sensitivities, specificities
  • ROC curves (continuous predictor)
  • Continuous outcome
  • Linear regression
  • Leave one out statistics or cross validation
  • Predictive model building
  • assessing predictive model

21
Reliability/Agreement
  • Kappa statistic is commonly used for categorical
    data and two raters.
  • Intra-class correlation coefficient for multiple
    raters
  • If you have a gold standard it makes the most
    sense to tabulate percent correct or average
    distance from correct.

22
more Reliability/Agreement
  • If trying to demonstrate agreement between two
    continuous measures the correlation coefficient
    is tangential at best
  • Better to tabulate statistics related to mean
    pairwise differences between judges
  • See
  • Bland JM, Altman DG. (1986). Statistical methods
    for assessing agreement between two methods of
    clinical measurement. Lancet, i, 307-310.
  • Available at http//www-users.york.ac.uk/mb55/mea
    s//ba.htm

23
Other?
  • Time-to-event data
  • Kaplan-Meier survival estimate
  • Cox regression
  • Other other?

24
Correlated Data Issues
  • Data consist of clusters of correlated
    observations. This is common in dental studies
    (many teeth from same mouth)
  • Common Solutions?
  • Collapse data to independent units (patient-level
    averages)
  • Adjust for correlation using generalized
    estimating equations (GEE) or mixed model
    regression approaches

25
Homework for Feb. 29
  • Following the guidelines presented in class
    today, present a concise description of your
    study and planned data analysis to the class.
  • Plan to keep your talk under ____ minutes
  • Limited office hours will be available with
    myself and Dr. Kaiyala to help. Call or email us
    for appointments.
Write a Comment
User Comments (0)
About PowerShow.com