Calibration#

R.A. Collenteur, University of Graz

After a model is constructed, the model parameters can be estimated using the ml.solve method. It can (and will) happen that the model fit after solving is not as good as expected. This may be the result of the settings that are used to solve the model or the way the model was constructed. In this notebook common pitfalls and various tips and tricks that may help to improve the calibration of Pastas models are shared.

In general, the following strategy is advised to solve problems with the parameter estimation:

  1. Check the input time series and solve settings

  2. Change the initial parameters,

  3. Change the model structure,

  4. Change the solve method.

import pandas as pd

import pastas as ps

ps.show_versions()
ps.set_log_level("ERROR")
Pastas     : 2.0.0
Python     : 3.14.6
Numpy      : 2.4.6
Pandas     : 3.0.3
Scipy      : 1.18.0
Matplotlib : 3.11.0
Numba      : 0.65.1

Loading the data#

In the following code-block some example data is loaded. It is good practice to visualize all time series before creating the time series model.

head = (
    pd.read_csv("data/B32C0639001.csv", parse_dates=["date"], index_col="date")
    .squeeze()
    .loc["1985":]
)

# Make this millimeters per day
evap = (
    pd.read_csv("data/evap_260.csv", index_col=0, parse_dates=[0])
    .squeeze()
    .loc["1985":"2003"]
)
rain = (
    pd.read_csv("data/rain_260.csv", index_col=0, parse_dates=[0])
    .squeeze()
    .loc["1985":"2003"]
)

ps.plots.series(head, [evap, rain], table=True);
../_images/af5137dd2658d813b7dd8b5dd18a44f0904b7951ffbf1cb4b20005406122eecd.png

Make a model#

Given the data above we create a Pastas model with a non-linear recharge model (ps.FlexModel) and a constant to simulate the groundwater level. We’ll use this model to show how we may analyse different types of problems and how to solve them.

ml = ps.Model(head)
ml.add_noisemodel(ps.ArNoiseModel())
rch = ps.rch.FlexModel()
rm = ps.RechargeModel(rain, evap, recharge=rch, rfunc=ps.Gamma(), name="rch")
ml.add_stressmodel(rm)

Calibrating a model#

In the above code-block a Pastas model was created, but not yet solved. To solve the model we call ml.solve(). This method has quite a few options (see also the docstring of the method) that influence the model calibration, for example:

  • tmin/tmax: select the time period used for calibration

  • noise: use a noise model to model the residuals or not

  • fit_constant: fit the constant as a parameter or not

  • warmup: length of the warmup period

  • solver: the solver that is used to estimate parameters

  • freq_obs: frequency of the observations to use during calibration

We start without providing any arguments to the solve method.

# ml.solve?  ## Run this to see other solve options
ml.solve()
ml.plots.results(figsize=(10, 6));
Fit report solver                   Fit Statistics
==================================================
nfev     47                     EVP          75.12
nobs     463                    R2            0.75
noise    True                   RMSE          0.10
tmin     1985-01-15 00:00:00    AICc      -2549.03
tmax     2005-10-14 00:00:00    BIC       -2512.19
freq     D                      Obj            nan
freq_obs None                   ___               
warmup   3650 days 00:00:00     Interp.         No

Parameters (9 optimized)
==================================================
                optimal     initial   vary
rch_A          0.340380    0.612817   True
rch_n          0.655559    1.000000   True
rch_a        214.265376   10.000000   True
rch_srmax     56.772369  250.000000   True
rch_lp         0.250000    0.250000  False
rch_ks        19.639437  100.000000   True
rch_gamma      3.798627    2.000000   True
rch_kv         0.857894    1.000000   True
rch_simax      2.000000    2.000000  False
constant_d     0.923783    1.356415   True
noise_alpha   68.718022   15.000000   True
../_images/c4238ff7e9ff2e5f43055b05d31f7d0d14facbcf13513e9921b00af330eace42.png

The fit report and the Figure above show that the model is not that great. The parameters have large standard errors, the goodness-of-fit metrics are not that high, and the simulated time series shows a very different behavior to the observed groundwater level.

Checking the explanatory time series and solve settings#

A common pitfall is that there is a problem with the explanatory time series (e.g., precipitation, pumping discharge). This should be the first thing to check when the model fit is not as good as expected.

  • Length of Time Series: The time series should in principle be available for the entire period of calibration,

  • Warmup Period: For some models it is necessary that the time series are also available before the calibration period, during the warmup period. This is for example the case with the non-linear recharge models (e.g., FlexModel, Berendrecht).

  • Units of Time Series (1): While Pastas is in principle unitless, the units of the time series can impact the model calibration. For example, a pumping discharge provided in m\(^3\)/day may lead to very small parameter values (‘Gamma_A’) that are harder to estimate. If you end up with very small parameters for the gain parameter, it may help to rescale the input time series.

  • Units of Time Series (2): The initial parameters and bounds for the non-linear recharge models are set for precipitation and evaporaton time series provided in mm/day. Using these models with time series in m/day will give bad results.

  • Normalization of Time Series: Sometimes it can help to normalize the expanatory time series. For example, when using a river level that is high above a certain datum (e.g. tens of meters), it may help to subtract the mean water level from the time series first.

In the example model, many of these things are happening. First, the precipitation time series are not available for the entire calibration period. Secondly, because a non-linear model is applied, we need to to have precipitation and evaporation data before the calibration period starts (typically about one year is enough). We should therefore shorten the calibration period by using to 1986-2003. Note that we use 3650 days for the warmup period (warmup=3650 is the default), the last 365 days of which now has real precipitation and evaporation data . For the other 9 years the mean flux is used. Finally, the non-linear model requires the evaporation and precipitation in mm/day (unless we want to manually set all parameter bounds).

ml = ps.Model(head)
ml.add_noisemodel(ps.ArNoiseModel())
rch = ps.rch.FlexModel()
rm = ps.RechargeModel(
    rain * 1e3, evap * 1e3, recharge=rch, rfunc=ps.Gamma(), name="rch"
)
ml.add_stressmodel(rm)

ml.solve(tmin="1986", tmax="2003", report=False)
axes = ml.plots.results(
    tmin="1975", figsize=(10, 6)
)  # Use tmin=1975 to show warmup period
axes[0].axvline(pd.Timestamp("1986"), c="k", linestyle="--");  # Start of calibration
../_images/856fdb8e4461004cf8906a6fd529ba6a4ad6a131d2f2a0a62ec66c47caee8b00.png

Changing the explanatory time series and using the correct calibration period definitely improve the model fit in this example. Changing the explanatory time series a bit generally helps to resolve many issues with the calibration. If this does not work, we may try to help the solver a bit.

Improving initial parameters#

Although Pastas tries to set sensible initial parameters when constructing a model, it occurs that the initial parameters set by Pastas are not a great place to start the search for the optimal parameters. In this case, it may be tried to manually adapt the initial parameters using the ml.set_parameter as follows:

ml.set_parameter(
    "rch_n", initial=15.0, pmax=100.0
)  # Clearly wrong, just for educational purposes
ml.solve(tmin="1986", tmax="2003", report=True)
Fit report solver                   Fit Statistics
==================================================
nfev     30                     EVP          35.85
nobs     376                    R2            0.36
noise    True                   RMSE          0.17
tmin     1986-01-01 00:00:00    AICc      -1638.01
tmax     2003-01-01 00:00:00    BIC       -1603.13
freq     D                      Obj            nan
freq_obs None                   ___               
warmup   3650 days 00:00:00     Interp.         No

Parameters (9 optimized)
==================================================
                optimal     initial   vary
rch_A          0.015675    0.020641   True
rch_n          6.701674   15.000000   True
rch_a         15.628691   10.000000   True
rch_srmax    691.848262  250.000000   True
rch_lp         0.250000    0.250000  False
rch_ks        54.690895  100.000000   True
rch_gamma     15.427015    2.000000   True
rch_kv         1.259670    1.000000   True
rch_simax      2.000000    2.000000  False
constant_d     1.062056    1.356415   True
noise_alpha   59.479490   15.000000   True

Often we do not know what good initial parameters are, but we do get a bad fit, like with this initial value for rch_n above. While solving the model with a noise model is recommended, it does make the parameter estimation more difficult and more sensitive to the initial parameter values. One solution that often helps is to first solve the model without a noise model, and then solve the model with a noise model but without re-initializing the parameters.

By default the parameters are initialized upon each solve, such that each time we call solve we obtain the same result. By setting initial=False we prevent the re-initialisation and use the optimal parameters as initial parameters. This can be done as follows:

# First solve without noise model
ml.del_noisemodel()
ml.solve(report=False, tmin="1986", tmax="2003")
# Then solve with noise model, but do not initialize the parameters
ml.add_noisemodel(ps.ArNoiseModel())
ml.solve(initial=False, tmin="1986", tmax="2003", report=True)
axes = ml.plots.results(figsize=(10, 6))
---------------------------------------------------------------------------
KeyboardInterrupt                         Traceback (most recent call last)
Cell In[7], line 3
      1 # First solve without noise model
      2 ml.del_noisemodel()
----> 3 ml.solve(report=False, tmin="1986", tmax="2003")
      4 # Then solve with noise model, but do not initialize the parameters
      5 ml.add_noisemodel(ps.ArNoiseModel())
      6 ml.solve(initial=False, tmin="1986", tmax="2003", report=True)

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/model.py:991, in Model.solve(self, tmin, tmax, freq, warmup, solver, report, initial, weights, fit_constant, freq_obs, initialize, reset_settings, noise, **kwargs)
    988     self.add_solver(solver=LeastSquares())
    990 # Solve model
--> 991 solve_success, result = self.solver.solve(weights=weights, **kwargs)
    992 # Update the parameters with the results from the optimization
    993 for column in result.columns:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/solver/least_squares.py:826, in LeastSquares.solve(self, weights, **kwargs)
    812     bounds = Bounds(
    813         lb=pmin,
    814         ub=pmax,
    815         keep_feasible=True,
    816     )
    818 objfunction = partial(
    819     self.objfunction,
    820     noise=noise,
   (...)    823     vary=vary,
    824 )
--> 826 self.result = least_squares(
    827     fun=objfunction,
    828     x0=initial[vary],
    829     jac=self.jac,
    830     bounds=bounds,
    831     method=self.method,
    832     ftol=self.ftol,
    833     xtol=self.xtol,
    834     gtol=self.gtol,
    835     x_scale=self.x_scale,
    836     loss=self.loss,
    837     f_scale=self.f_scale,
    838     max_nfev=self.max_nfev,
    839     diff_step=self.diff_step,
    840     tr_solver=self.tr_solver,
    841     tr_options=self.tr_options,
    842     callback=self.callback,
    843     **kwargs,
    844 )
    846 self.pcov = DataFrame(
    847     self.get_covariances(
    848         self.result.jac,
   (...)    854     columns=parameters.index,
    855 )
    857 # Prepare return values

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/_lib/_util.py:660, in _workers_wrapper.<locals>.inner(*args, **kwds)
    658 with MapWrapper(_workers) as mf:
    659     kwargs['workers'] = mf
--> 660     return func(*args, **kwargs)

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_lsq/least_squares.py:1019, in least_squares(fun, x0, jac, bounds, method, ftol, xtol, gtol, x_scale, loss, f_scale, diff_step, tr_solver, tr_options, jac_sparsity, max_nfev, verbose, args, kwargs, callback, workers)
   1015     result = call_minpack(vector_fun.fun, x0, vector_fun.jac, ftol, xtol, gtol,
   1016                           max_nfev, x_scale, jac_method=jac)
   1018 elif method == 'trf':
-> 1019     result = trf(vector_fun.fun, vector_fun.jac, x0, f0, J0, lb, ub, ftol, xtol,
   1020                  gtol, max_nfev, x_scale, loss_function, tr_solver,
   1021                  tr_options.copy(), verbose, callback=callback_wrapped)
   1023 elif method == 'dogbox':
   1024     if tr_solver == 'lsmr' and 'regularize' in tr_options:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_lsq/trf.py:124, in trf(fun, jac, x0, f0, J0, lb, ub, ftol, xtol, gtol, max_nfev, x_scale, loss_function, tr_solver, tr_options, verbose, callback)
    120     return trf_no_bounds(
    121         fun, jac, x0, f0, J0, ftol, xtol, gtol, max_nfev, x_scale,
    122         loss_function, tr_solver, tr_options, verbose, callback=callback)
    123 else:
--> 124     return trf_bounds(
    125         fun, jac, x0, f0, J0, lb, ub, ftol, xtol, gtol, max_nfev, x_scale,
    126         loss_function, tr_solver, tr_options, verbose, callback=callback)

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_lsq/trf.py:376, in trf_bounds(fun, jac, x0, f0, J0, lb, ub, ftol, xtol, gtol, max_nfev, x_scale, loss_function, tr_solver, tr_options, verbose, callback)
    372 f_true = f.copy()
    374 cost = cost_new
--> 376 J = jac(x)
    377 njev += 1
    379 if loss_function is not None:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_differentiable_functions.py:750, in VectorFunction.jac(self, x)
    748 def jac(self, x):
    749     self._update_x(x)
--> 750     self._update_jac()
    751     if hasattr(self.J, "astype"):
    752         # returns a copy so that downstream can't overwrite the
    753         # internal attribute. But one can't copy a LinearOperator
    754         return self.J.astype(self.J.dtype)

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_differentiable_functions.py:719, in VectorFunction._update_jac(self)
    716 else:
    717     self._njev += 1
--> 719 self.J = self.jac_wrapped(xp_copy(self.x), f0=self.f)
    720 self.J_updated = True

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_differentiable_functions.py:454, in _VectorJacWrapper.__call__(self, x, f0, **kwds)
    452     self.njev += 1
    453 elif self.jac in FD_METHODS:
--> 454     J, dct = approx_derivative(
    455         self.fun,
    456         x,
    457         f0=f0,
    458         **self.finite_diff_options,
    459     )
    460     self.nfev += dct['nfev']
    462 if self.sparse_jacobian:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_numdiff.py:611, in approx_derivative(fun, x0, method, rel_step, abs_step, f0, bounds, sparsity, as_linear_operator, args, kwargs, full_output, workers)
    609 with MapWrapper(workers) as mf:
    610     if sparsity is None:
--> 611         J, _nfev = _dense_difference(fun_wrapped, x0, f0, h,
    612                                  use_one_sided, method,
    613                                  mf)
    614     else:
    615         if not issparse(sparsity) and len(sparsity) == 2:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_numdiff.py:710, in _dense_difference(fun, x0, f0, h, use_one_sided, method, workers)
    708 f_evals = workers(fun, x_generator2(x0, h))
    709 dx = [(x0[i] + h[i]) - x0[i] for i in range(n)]
--> 710 df = [f_eval - f0 for f_eval in f_evals]
    711 df_dx = [delf / delx for delf, delx in zip(df, dx)]
    712 nfev += len(df_dx)

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_numdiff.py:912, in _Fun_Wrapper.__call__(self, x)
    909 if xp.isdtype(x.dtype, "real floating"):
    910     x = xp.astype(x, self.x0.dtype)
--> 912 f = np.atleast_1d(self.fun(x, *self.args, **self.kwargs))
    913 if f.ndim > 1:
    914     raise RuntimeError("`fun` return value has "
    915                        "more than 1 dimension.")

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_differentiable_functions.py:424, in _VectorFunWrapper.__call__(self, x)
    422 def __call__(self, x):
    423     self.nfev += 1
--> 424     return np.atleast_1d(self.fun(x))

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/scipy/optimize/_lsq/least_squares.py:263, in _WrapArgsKwargs.__call__(self, x)
    262 def __call__(self, x):
--> 263     return self.f(x, *self.args, **self.kwargs)

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/solver/least_squares.py:769, in LeastSquares.objfunction(self, p, noise, weights, initial, vary)
    767 par = initial
    768 par[vary] = p
--> 769 return misfit(
    770     ml=self.ml, p=par, noise=noise, weights=weights, callback=self.callback
    771 )

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/solver/objective_function.py:45, in misfit(ml, p, noise, weights, callback, returnseparate)
     43     rv = ml.noise(p) * ml._noise_weights(p)
     44 else:
---> 45     rv = ml.residuals(p)
     47 # Apply weights if provided
     48 if weights is not None:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/model.py:598, in Model.residuals(self, p, tmin, tmax, freq, warmup)
    593 sim = self.simulate(
    594     p=p, tmin=tmin, tmax=tmax, freq=freq, warmup=warmup, return_warmup=False
    595 )
    597 # Get the oseries calibration series
--> 598 obs = self.observations(tmin=tmin, tmax=tmax, freq=freq_obs)
    599 # Get simulation at the correct indices
    600 if self._interpolate_simulation is None:

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/model.py:780, in Model.observations(self, tmin, tmax, freq, update_observations)
    768 if not update_observations and (
    769     tmin != self.settings["tmin"]
    770     or tmax != self.settings["tmax"]
    771     or freq != self.settings["freq"]
    772 ):
    773     # create a copy, so we do not alter the original self.oseries
    774     oseries = oseries.copy()
    776 oseries.update_series(
    777     tmin=tmin,
    778     tmax=tmax,
    779     freq=freq,
--> 780     time_offset=self.time_offset,
    781     force_update=update_observations,
    782 )
    784 return oseries.series

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/model.py:1328, in Model.time_offset(self)
   1326 time_offsets = set()
   1327 for stressmodel in self.stressmodels.values():
-> 1328     for st in stressmodel.stresses:
   1329         if st.freq_original:
   1330             # calculate the offset from the default frequency
   1331             t = st.series_original.index

File ~/checkouts/readthedocs.org/user_builds/pastas/envs/latest/lib/python3.14/site-packages/pastas/stressmodels.py:1824, in RechargeModel.stresses(self)
   1822 """All the stress time series in the stressmodel as a tuple."""
   1823 if self.temp is None:
-> 1824     nt = namedtuple("StressesTuple", ["prec", "evap"])
   1825     return nt(prec=self.prec, evap=self.evap)
   1826 else:

File ~/.asdf/installs/python/3.14.6/lib/python3.14/collections/__init__.py:515, in namedtuple(typename, field_names, rename, defaults, module)
    512     doc = _sys.intern(f'Alias for field number {index}')
    513     class_namespace[name] = _tuplegetter(index, doc)
--> 515 result = type(typename, (tuple,), class_namespace)
    517 # For pickling to work, the __module__ variable needs to be set to the frame
    518 # where the named tuple is created.  Bypass this step in environments where
    519 # sys._getframe is not defined (Jython for example) or sys._getframe is not
    520 # defined for arguments greater than 0 (IronPython), or where the user has
    521 # specified a particular module.
    522 if module is None:

KeyboardInterrupt: 

After solving the model without a noise model (providing the solver an easier problem), we solve again with the parameter estimated from the solve without a noise model. This generally works well. We may also choose to fix parameters that are hard to estimate, perhaps because they are correlated to other parameters, to certain values.

Changing the model structure#

At this point, one might start to think that the bad fit has something to do with the model structure. This could off course be an explanatory time series that is missing, but let’s assume that is not the case. One thing that might help is to change the response function. This can either be from a complicated function to a simpler function (e.g., Gamma to Exponential) or the other way around (e.g., Gamma to FourParam). Another option could be to change other parts of the model structure, for example by applying a non-linear recharge model instead of a linear model.

## Example to be added

More advanced solve options#

If all of the above does not work, and we still think we have the right model structure and explanatory time series, we can for example:

  • Don’t fit the constant. By default the constant (constant_d) is estimated as a parameter in Pastas. In specific cases it may help to turn this option off (ml.solve(fit_constant=False)).

  • Switch the solver. ps.solver.LeastSquares() is used by default, but ps.solver.Lmfit() provides a lot of different methods for the parameter estimation, from simple least_squares to the use of MCMC.

  • Remove observations from the groundwater level time series . The use of high frequency measurements is known to cause issues when trying to solve a model when using a noise model. See also the example notebook “Reducing Autocorrelation”.

Summary of Tips & Tricks#

In this notebook a variety of methods to improve the calibration result and model fit for Pastas models were shown. Although a specific type of model was used here to demonstrate these methods, the strategy can be applied to other types of time series and model structures as well.

A summary of all tips and tricks that may help to improve the model calibration given below:

  • Change units of input time series

  • Normalize the input time series

  • Change calibration period

  • Lengthen the warmup period

  • Solve first without, then with a noise model

  • Manually change initial parameters

  • Fix parameters

  • Change response functions

  • Fit constant or not

  • Try a different solve method

  • Remove observations