Pulse Brain · Growing Health Evidence Index
Tier 3 — Observational / field trialPeer-reviewed

On the choice of calibration metrics for “high-flow” estimation using hydrologic models

Naoki Mizukami, Oldřich Rakovec, Andrew J. Newman, Martyn Clark, Andrew W. Wood, Hoshin V. Gupta, Rohini Kumar

Hydrology and earth system sciences · 2019

Read source ↗ All evidence

Summary

This study challenges the conventional practice of using squared-error metrics like NSE for calibrating hydrologic models, demonstrating that such calibrations substantially underestimate annual peak flows critical for flood frequency estimation. Across 492 United States basins, the authors show that Kling–Gupta efficiency (KGE) produces superior high-flow estimates compared to NSE, with an alternative application-specific metric (APFB) performing best for annual peaks but at the cost of poorer performance on other high-flow metrics. The findings suggest that metric selection during model calibration fundamentally shapes hydrologic predictions relevant to flood risk management.

UK applicability

The methodological findings regarding calibration metric selection are internationally applicable to UK hydrologic modelling practice, particularly for flood forecasting and water resource management. However, the study's calibration was conducted on United States basins with distinct hydroclimatic characteristics, so direct quantitative conclusions may require validation using UK streamflow data.

Key measures

Nash–Sutcliffe efficiency (NSE), Kling–Gupta efficiency (KGE) and variants, annual peak flow bias (APFB), annual peak flow estimates, flow time series metrics (mean and variance)

Outcomes reported

The study evaluated how different calibration metrics (NSE, KGE, and APFB) affect hydrologic model performance in simulating high-flow events across 492 basins in the contiguous United States. Results showed that NSE-based calibrations produced annual peak flow estimates more than 20% worse than KGE-based calibrations, primarily due to NSE's tendency to underestimate flow variability.

Theme
Measurement & metrics
Subject
Measurement methods & nutrient profiling
Study type
Research
Study design
Field trial
Source type
Peer-reviewed study
Status
Published
Geography
United States
System type
Other
DOI
10.5194/hess-23-2601-2019
Catalogue ID
BFmor3gf2d-bwll2v

Topic tags

Pulse AI · ask about this record

Dig deeper with Pulse AI.

Pulse AI has read the whole catalogue. Ask about this record, its theme, or how the findings apply to UK farming and policy — every answer cites the underlying studies.