Postprocessing Calibration - PowerPoint PPT Presentation

1 / 31

About This Presentation

Title:

Postprocessing Calibration

Description:

FOCUS: Increase reliability by adjusting width of forecast PDF ... Possible positions (rank) For the Verification: 1 2 3 4 5 6 7 8 9 10 11. Uncalibrated ... – PowerPoint PPT presentation

Number of Views:31

Avg rating:3.0/5.0

Slides: 32

Provided by: kenno155

Category:

more less

Transcript and Presenter's Notes

Title: Postprocessing Calibration

1
Postprocessing Calibration
The icing on the cake for high-utility stochastic
weather prediction
2
Presentation Overview

Why Calibrate?
The need
The goal
Aspects of Calibration
1st Moment Calibration Model Bias Correction
2nd Moment Calibration Ensemble Spread
Correction
Reality Calibration Down Scaling
Ground Truth Issues
Choices
Length of Training
What to Know
Checklist
Sample Ensemble System Evaluations

3
The Need for Calibration

Ensemble estimates the true (long-term
verification) Probability Density Function (PDF)
In a large, ideal ensemble, Forecast
Probability (FP) Observed Relative Frequency
(ORF)

Initial State
24hr Forecast State
48hr Forecast State
True PDF
Probability Density or
Frequency
4
Calibration Goal

Increase ensemble skill and utility by
maximizing
Reliability
- Forecast Probability Observed Relative
Frequency
- Statistical Consistency
Mean Square Error of EF Mean Ensemble
Variance
or
Over time, verifying obs.
indistinguishable from ensemble members
2) Sharpness
- Ability to distinguish between events and
non-events
- Narrow (sharp) ensemble PDF is better
-- Results in probabilities closer to 0
or 100
-- BUT, must maximize Sharpness while
ensuring reliability

5
1st Moment Calibration (Model Bias Correction)
6
1st Moment Calibration(Model Bias Correction)
FOCUS Increase reliability by recentering
forecast PDF
7
Model Bias

Statistically significant value of forecast mean
error
But relationship may be linear or nonlinear

NOGAPS
Forecast vs. Analysis in Eastern WA
24h MSLP Analysis (mb)

Major influences on model biases
Model Design
Core, physics,
parameterizations, etc.
Analysis (model initial condition)
Weather regime
Location, season,
weather pattern, etc.
Time of day and forecast lead time

24h MSLP Forecast (mb)
8
Late Afternoon Temperature Bias
9
Early Morning Temperature Bias
10

Univ. of WA Mesoscale Ensemble
http//www.atmos.washington.edu/ens/uwme.cgi
8 12-km MM5 Members over Pacific NW
Initial Condition and Model Perturbations

Raw (uncorrected) 2m Temp
(from Eckel and Mass, WAF, 2003)
48 h
36 h
Average RMSE (?C) and (shaded) Average Bias
24 h
12 h
Mmbr.1 Mmbr.2 Mmbr.3 Mmbr.4 Mmbr.5
Mmbr.6 Mmbr.7 Mmbr.8 Mean
11

Univ. of WA Mesoscale Ensemble
http//www.atmos.washington.edu/ens/uwme.cgi
8 12-km MM5 Members over Pacific NW
Initial Condition and Model Perturbations

Bias-Corrected 2m Temp
(from Eckel and Mass, WAF, 2003)
48 h
36 h
Average RMSE (?C) and (shaded) Average Bias
24 h
12 h
Mmbr.1 Mmbr.2 Mmbr.3 Mmbr.4 Mmbr.5
Mmbr.6 Mmbr.7 Mmbr.8 Mean
12

Univ. of WA Mesoscale Ensemble

Reliability Diagram
P ( 36-h mslp lt 1001 mb)
Raw Ensemble
Bias-Corrected Ensemble
Sample Climatology
13
2nd Moment Calibration (Ensemble Spread
Correction)
14
2nd Moment Calibration,(Ensemble Spread
Correction)
FOCUS Increase reliability by adjusting width of
forecast PDF Most commonly, this means
increasing the PDF width to account for
insufficient dispersion (spread) in the raw
ensemble. Tricky part tails of the
PDF
15
Calibration using Rank Histograms
Event Threshold 12.70 mm (0.5 inch)
Event Probability
24-h Precip Forecast (10 members)
3/10 30
7.1 7.7 8.7 9.0 10.2 11.0 11.2
14.9 16.2 19.2

Democratic Voting (Uncalibrated)
Possible positions (rank) For the Verification
1 2 3 4 5 6
7 8 9 10 11
Uncalibrated Rank Probability
0.10
32.6
Uniform Ranks (Uncalibrated)
0.00
1 2 3 4 5 6
7 8 9 10 11
0.20
42.2
Calibrated Rank Probability
Weighted Ranks (Calibrated)
0.10
0.00
1 2 3 4 5 6
7 8 9 10 11
References Hamill and Colucci (MWR, 1997, 1998
Eckel and Walters, WAF, 1998 used at UKMO)
16
Calibration using Rank Histograms
p(24h precip. gt 0.25 inch)
from Eckel and Walters, WAF 1998

Disadvantages
Odd pdfs, especially when two ensemble members
close in value.
Requires large training period to build robust
histograms
May reduce resolution

Advantages
Demonstrated gain in skill from improved
reliability
Applies readily to different variables,
regardless of pdf shape

17
Reality Calibration (Down Scaling)
18
Reality Calibration(Down Scaling)
FOCUS Increase ensemble sharpness by
incorporating reality - Refine spatial and/or
temporal resolution from coarse model output -
Attempt to predict phenomena outside models
attractor (e.g., wind in complex terrain,
tornado, snow on roads, etc.)
Reality
Model Data
19
Gridded MOS Concept - Step 2
Example Down Scaling Gridded GFS MOS
Add further detail with high-resolution
geophysical data and smart interpolation
Blend first guess and high-density station
forecasts
Model Obs Terrain
Model Obs
Model Output
Mark S. Antolik Meteorological Development
Laboratory Statistical Modeling
Branch NOAA/National Weather Service Silver
Spring, MD
Use this fine-scale analysis as ground truth to
train and down-scale gridded forecasts.
20
Example Down Scaling Gridded GFS MOS
Dew Point Verification
(December 2004 September 2005)
21
Ground Truth Issues
22
Ground Truth Issues
Calibration is only as good as what is used for
truth! Ideal Ground Truth - Accurate
Precise - Unbiased - Independent - Thorough
-- Represent all scales of interest --
Encompass full range of climatologic
possibilities -- Encompass bulk of systematic
forecast errors
23
Need for Reforecast Dataset
Example Attempting to calibrate precip with a
just a short sample of recent
events can be very problematic.
Youd like enough training data to have
some similar events at a similar time of year to
this one.
Hamill, T. M., J. S. Whitaker, and X. Wei, 2004
Ensemble re-forecasting improving medium-range
forecast skill using retrospective forecasts.
Mon. Wea. Rev., 132, 1434-1447.
http//www.cdc.noaa.gov/people/tom.hamill/reforeca
st_mwr.pdf Hamill, T. M., J. S. Whitaker, and
S. L. Mullen, 2006 Reforecasts, an important
dataset for improving weather predictions. Bull.
Amer. Meteor. Soc., 87, 33-46. http//www.cdc.noaa
.gov/people/tom.hamill/refcst_bams.pdf
24
Example Application Calibration by Analogs
(Hamill et al., MWR, 2004, BAMS, 2006)
24-h Cumulative Precip. Forecast
Calibrated Probability Fcst.
Verifying 24-h Precip.
Old Forecast Matches
Verifying Observations
P( Precip gt 25 mm )
P( Precip gt 10 mm )
P( Precip gt 3 mm )
(Actually run with 10 to 75 analogs)
25
Effect of Training Sample Size
(from Hamill et al., BAMS, 2006)
Colors of dots indicate which size analog
ensemble provided the largest amount of skill.
Q Do the benefits exceed the cost of building
the reforecast dataset?
26
What to Know
27