Comparing models visually#

Martin Vonk and Davíd Brakenhoff, Artesia 2022

In this notebook we introduce the CompareModels class in Pastas that can be used to compare models (visually), and construct custom model comparison plots.

import pandas as pd

import pastas as ps

ps.set_log_level("ERROR")
ps.show_versions()

Pastas     : 2.0.0
Python     : 3.14.6
Numpy      : 2.4.6
Pandas     : 3.0.5
Scipy      : 1.18.0
Matplotlib : 3.11.1
Numba      : 0.66.0

Load Time Series#

First load some data to construct models that we can compare with one another.

rain = pd.read_csv("./data/rain_nb1.csv", index_col=0, parse_dates=True).squeeze()
evap = pd.read_csv("./data/evap_nb1.csv", index_col=0, parse_dates=True).squeeze()
obs1 = pd.read_csv("./data/head_nb1.csv", index_col=0, parse_dates=True).squeeze()
obs2 = pd.read_csv("./data/nb18_head.csv", index_col=0, parse_dates=True).squeeze()

Create models#

Model1a: observations series 1 with linear RechargeModel and Exponential response function
Model1b: observations series 1 with linear RechargeModel and Gamma response function
Model1c: observation series 1 with precipitation and evaporation as separate stresses
Model2: has observation series 2 with linear RechargeModel and Exponential response function

ml1a = ps.Model(obs1, name="1a_exp")
ps.ArNoiseModel(model=ml1a)
sm1a = ps.RechargeModel(
    model=ml1a, prec=rain, evap=evap, rfunc=ps.Exponential(), name="recharge"
)
ml1a.solve(report=False)

ml1b = ps.Model(obs1, name="1b_gamma")
ps.ArNoiseModel(model=ml1b)
sm1b = ps.RechargeModel(
    model=ml1b, prec=rain, evap=evap, rfunc=ps.Gamma(), name="recharge"
)
ml1b.solve(report=False)

ml1c = ps.Model(obs1, name="1c_separate")
ps.ArNoiseModel(model=ml1c)
sm2_1 = ps.StressModel(
    model=ml1c, stress=rain, rfunc=ps.Gamma(), name="prec", settings="prec"
)
sm2_2 = ps.StressModel(
    model=ml1c, stress=evap, rfunc=ps.Gamma(), name="evap", settings="evap", up=False
)
ml1c.solve(report=False)

ml2 = ps.Model(obs2, name="model_2")
ps.ArNoiseModel(model=ml2)
sm2 = ps.RechargeModel(
    model=ml2, prec=rain, evap=evap, rfunc=ps.Exponential(), name="recharge"
)
ml2.solve(report=False)

CompareModels#

To compare models, just pass a list of models to ps.CompareModels. To plot the default comparison plot use the plot() method.

The class itself is linked to a figure and a set of axes, so for each comparison a new CompareModels class should be created.

mc = ps.CompareModels(models=[ml1b, ml1a])
mc.plot(figsize=(10, 6), layout="tight")

../_images/b07ad4eb72a1ef55fb8310146ac431f25945af8e96b76c5b890091c3e3af5f29.png

The layout of the plot is controlled by a so-called mosaic, which is essentially a 2D array with labels that define the positions of the axes. The mosaic for the plot above can be accessed through the mc.mosaic attribute. The oseries and model simulations are plotted in the “sim” axes which covers a 2x2 region at the top left of the figure.

mc.mosaic

[['sim', 'sim', 'met'],
 ['sim', 'sim', 'tab'],
 ['res', 'res', 'tab'],
 ['con0', 'con0', 'rf0']]

Access to the axes or the figure is available through mc.axes dictionary (e.g. for modifying axes labels, limits, or ticks) or mc.figure (e.g. for saving the figure).

# access the axes dictionary
mc.axes

{'sim': <Axes: label='sim'>,
 'met': <Axes: label='met'>,
 'tab': <Axes: label='tab'>,
 'res': <Axes: label='res'>,
 'con0': <Axes: label='con0'>,
 'rf0': <Axes: label='rf0'>}

Customizing the comparison#

Perhaps you want to view all contributions on the same subplot (and the step responses as well). For this we need to customize the default plot layout and tell the plotting method we want several stresses to be plotted on the same axis.

Customizing the layout (mosaic) can either be done manually, by providing a list of lists with axes labels, or we can modify the default mosaic slightly with mc.get_default_mosaic. By setting the number of stressmodels to 1 in this method there will be only one row for the contributions and response functions.

We are now comparing models 1a and 1c (which had “prec” and “evap” as separate stresses).

# initialize the comparison
mc = ps.CompareModels(models=[ml1a, ml1c])

# get a custom mosaic by modifying the default mosaic slightly
mosaic = mc.get_default_mosaic(n_stressmodels=1)
mosaic

[['sim', 'sim', 'met'],
 ['sim', 'sim', 'tab'],
 ['res', 'res', 'tab'],
 ['con0', 'con0', 'rf0']]

The default behavior (when no custom mosaic is provided) is shown below. Note the difference, with 3 rows showing up for plotting stress models.

# default mosaic when no customization is applied
mc.get_default_mosaic()

[['sim', 'sim', 'met'],
 ['sim', 'sim', 'tab'],
 ['res', 'res', 'tab'],
 ['con0', 'con0', 'rf0'],
 ['con1', 'con1', 'rf1'],
 ['con2', 'con2', 'rf2']]

In order to force the plot() method to plot all stressmodels on the same axes we have to pass it some extra information. This extra information is given as the smdict and is a dictionary that contains an integer index as a key (i.e, 0, 1, …) and a list of stress model names as its value. The following dictionary tells CompareModels to combine any stress models with names “recharge”, “Prec” or “Evap” from any model in the comparison list on the first row (with index 0).

smdict = {0: ["recharge", "prec", "evap"]}

# initialize the figure with our custom mosaic
mc.initialize_figure(mosaic=mosaic, figsize=(12, 8), layout="tight")

# now plot the model comparison
mc.plot(smdict=smdict)

../_images/9882862426cd68f280177d55b0f56a340d31e89583d1cbfefa05e9521bd17f7a.png

Using individual plotting methods#

Each component (i.e. time series or table) in the plots above is controlled by a separate method, making it easy to plot certain components separately. Check out all the methods starting with plot_* to see which options are available. When one of these methods is called separately after creating a CompareModels object, a single axis object is created on which the time series for each model are shown.

# compare model simulations
mc = ps.CompareModels(models=[ml1b, ml1a])
ax = mc.plot_simulation()
_ = ax.legend(loc=(0, 1), frameon=False, ncol=2)

../_images/4d4eb8e1869cf41b00dfcb367750b06d4f216b5e1b794a2a9f9d2fcc01578e83.png

# compare model optimal parameters
mc = ps.CompareModels(models=[ml1a, ml1b, ml1c])
ax = mc.plot_table_params()

../_images/121820d76bece118bd0bc0bf0c40f0620fd8e50c646b1a28873bf6762824d78b.png

# compare ACF plots
mc = ps.CompareModels(models=[ml1a, ml1c])
ax = mc.plot_acf()
ax.grid(True)

../_images/647e64c834ce39f8381a04ff7b2a0603cd531ff0cb2b534de291d9074fc50157.png

Some helper functions#

The ps.CompareModels class contains some helper methods to obtain information from the models passed to the class. Using these can be especially useful to customize the tables you wish to show on your comparison figure.

# get minimum tmin and maximum tmax
mc.get_tmin_tmax()

	tmin	tmax
1a_exp	1985-11-14	2015-06-28
1c_separate	1985-11-14	2015-06-28

# get table with all parameters
mc.get_parameters()

	1a_exp	1c_separate
recharge_A	685.680375	NaN
recharge_a	159.429753	NaN
recharge_f	-1.307338	NaN
constant_d	27.922326	28.411486
noise_alpha	49.715104	46.458858
prec_A	NaN	569.395138
prec_n	NaN	1.035145
prec_a	NaN	113.001347
evap_A	NaN	-1051.817385
evap_n	NaN	1.022711
evap_a	NaN	183.199199

# get table with parameters selected by substring
mc.get_parameters(param_selection=["_A"])

	1a_exp	1c_separate
evap_A	NaN	-1051.817385
prec_A	NaN	569.395138
recharge_A	685.680375	NaN

# get table with all p-values of statistical tests
mc.get_diagnostics()

	1a_exp	1c_separate
Shapiroo	0.00	0.00
D'Agostino	0.00	0.00
Runs test	0.43	0.08
Stoffer-Toloi	0.08	0.04

# get table with fit metrics
mc.get_metrics()

	1a_exp	1c_separate
rmse	0.114378	0.108860
rmsn	0.079589	0.078663
sse	8.424993	7.631687
mae	0.089996	0.083519
me	0.000275	0.001283
nse	0.929187	0.935855
evp	92.918780	93.586409
rsq	0.929187	0.935855
kge	0.963232	0.958353
bic	-3235.185967	-3234.924339
aic	-3257.524460	-3270.665929
aicc	-3257.430416	-3270.439157

Equal vertical scaling between subplots#

It is possible set the vertical scale equal for all the subplots. Just initialize the figure with initialize_adjust_height_figure() instead of initialize_figure(). Note that this does require the default naming convention for the mosaic to be used (i.e. axes labels must include "sim", "res" and "con*").

Note: the scaling is not perfect, probably because space taken up by the xticklabels, the legend and perhaps some other unknown quantities are not taken into consideration in the calculations, causing some small differences in the y-scales per subplot.

mc = ps.CompareModels(models=[ml1a, ml1c])
mc.plot(adjust_height=True, layout="tight")

../_images/e3a5d330d41abc2d94210c893b377166f1bba61d14ae9649d222fbfe6c5fe537.png

If you want to customize the figure yourself and use the adjusted height functionality, make sure that you provide the smdict to the initialize_adjust_height_figure() method. Keep in mind that only the first column of the mosaic is used for scaling.

mosaic = [
    ["sim", "sim", "met"],
    ["sim", "sim", "tab"],
    ["res", "res", "tab"],
    ["con0", "con0", "rf0"],
    ["con1", "con1", "rf1"],
]

smdict = {0: ["prec"], 1: ["recharge", "evap"]}

mc = ps.CompareModels([ml1a, ml1c])
mc.initialize_adjust_height_figure(
    mosaic=mosaic, smdict=smdict, figsize=(12, 8), layout="tight"
)
mc.plot(legend=True)

../_images/75cdaaabffb67e6ccf1b0993693a66137d44cc9a3ac6f5652bb38fcc7a86cd72.png

Going a bit overboard#

Just to show you what is possible, here is an extreme example in which we do the following:

compare 2 models that are related (ml1a and ml1c with the same oseries), and one that isn’t (ml2)
create a custom mosaic by manually providing one
plot just about every comparison we can think of
combine all the contributions of the different stresses on the same subplot
manually share the x-axes between certain plots
choose a different qualitative colormap

Note that this comparison doesn’t make all that much sense, but it does show you how easy it is to create custom comparison plots.

mosaic = [
    ["ose", "ose", "met"],
    ["sim", "sim", "tab"],
    ["res", "res", "tab"],
    ["con0", "con0", "dia"],
    ["acf", "acf", "dia"],
]

mc = ps.CompareModels(models=[ml1a, ml1c, ml2])
mc.initialize_figure(mosaic, figsize=(12, 8), cmap="Dark2")

# plot oseries on "ose" axis
mc.plot_oseries(axn="ose")

# plot simulation on "sim" axis
mc.plot_simulation()

# plot metrics
mc.plot_table_metrics()

# table of optimal parameters but only those containing the gain "_A"
mc.plot_table_params(param_selection=["_A"])

# plot residuals
mc.plot_residuals()

# plot all contributions on the same axis
mc.plot_contribution(smdict={0: ["Prec", "Evap", "Rech", "recharge"]}, axn="con{i}")

# plot p-value for diagnostic tests
mc.plot_table_diagnostics(axn="dia", diag_col=r"Reject H0 ($\alpha$=0.05)")

# plot ACF
mc.plot_acf(axn="acf")

# turn grid on
for axlbl in mc.axes:
    mc.axes[axlbl].grid(True)

# share x-axes between plots
mc.share_xaxes([mc.axes["ose"], mc.axes["sim"], mc.axes["res"], mc.axes["con0"]])

# set tight layout
mc.figure.tight_layout(pad=0.0)

../_images/c26528856f10d7bde8b2d27183c65955f27927ba72a224b056a9eef8aed81156.png

mc = ps.CompareModels(models=[ml1a, ml1c, ml2])

mosaic = [
    ["ose", "ose", "met"],
    ["sim", "sim", "tab"],
    ["res", "res", "tab"],
    ["con0", "con0", "tab"],
    ["con1", "con1", "tab"],
    ["stress", "stress", "dia"],
    ["acf", "acf", "dia"],
]

smdict = {0: ["recharge", "prec"], 1: ["evap"]}

mc.initialize_adjust_height_figure(
    mosaic, figsize=(12, 10), cmap="Dark2", smdict=smdict, layout="tight"
)
mc.plot_oseries(axn="ose")
mc.plot_simulation()
mc.plot_table_metrics(metric_selection=["evp", "rsq"])
mc.plot_table_params(param_selection=["_A"], param_col="stderr")
mc.plot_residuals()
mc.plot_stress()
mc.plot_contribution(axn="con{i}")
mc.plot_table_diagnostics(axn="dia", diag_col="Statistic")
mc.plot_acf(axn="acf")
for axlbl in mc.axes:
    mc.axes[axlbl].grid(True)
ps.plots.share_xaxes(
    [
        mc.axes["ose"],
        mc.axes["sim"],
        mc.axes["res"],
        mc.axes["con0"],
        mc.axes["con1"],
        mc.axes["stress"],
    ]
)

../_images/7f115eb3e252fed9554583b99ae5d5fed991382d9e0a2c3774683dba47dd1599.png