Note

Go to the end to download the full example code or to run this example in your browser via JupyterLite or Binder.

Online Covariance Hyperparameter Tuning#

This tutorial shows how to tune covariance estimator hyperparameters in an online setting using OnlineGridSearch and OnlineRandomizedSearch.

The online approach is equivalent to combining scikit-learn’s GridSearchCV (or RandomizedSearchCV) with WalkForward using expand_train=True, but instead of refitting every candidate from scratch at each split, it calls partial_fit to incrementally update the estimator. This is significantly faster for estimators that support this method.

Data#

We load the S&P 500 dataset composed of the daily prices of 20 assets from the S&P 500 Index composition starting from 2010-01-04 up to 2022-12-28.

import pandas as pd
import plotly.express as px
import plotly.graph_objects as go
from plotly.io import show
from scipy.stats import uniform

from skfolio.datasets import load_sp500_dataset
from skfolio.metrics import (
    diagonal_calibration_loss,
    make_scorer,
    portfolio_variance_qlike_loss,
)
from skfolio.model_selection import (
    OnlineGridSearch,
    OnlineRandomizedSearch,
    online_score,
)
from skfolio.moments import RegimeAdjustedEWCovariance
from skfolio.preprocessing import prices_to_returns

prices = load_sp500_dataset()
X = prices_to_returns(prices)
X = X["2010":]

Build Scorers#

We build scorers with make_scorer. We set response_method=None because a covariance estimator is a non-predictor estimator (it does not implement predict), and greater_is_better=False because both losses are minimized.

qlike_scorer = make_scorer(
    portfolio_variance_qlike_loss,
    greater_is_better=False,
    response_method=None,
)

calibration_scorer = make_scorer(
    diagonal_calibration_loss,
    greater_is_better=False,
    response_method=None,
)

OnlineGridSearch#

We now tune RegimeAdjustedEWCovariance with OnlineGridSearch.

We search over half_life, corr_half_life, and regime_half_life. corr_half_life controls the correlation smoothing separately from the variance half-life, while regime_half_life controls how quickly the regime adjustment adapts to market changes.

Each candidate is evaluated with a full online walk-forward pass. Here, warmup_size=252 uses the first year for initialization and test_size=5 evaluates windows of 5 consecutive daily observations (one trading week).

grid_search = OnlineGridSearch(
    estimator=RegimeAdjustedEWCovariance(),
    param_grid={
        "half_life": [20, 40, 60],
        "corr_half_life": [40, 80],
        "regime_half_life": [10, 20],
    },
    scoring=qlike_scorer,
    warmup_size=252,
    test_size=5,
    n_jobs=-1,
)
grid_search.fit(X)

OnlineGridSearch(estimator=RegimeAdjustedEWCovariance(), n_jobs=-1,
                 param_grid={'corr_half_life': [40, 80],
                             'half_life': [20, 40, 60],
                             'regime_half_life': [10, 20]},
                 scoring=make_scorer(portfolio_variance_qlike_loss, greater_is_better=False, response_method=None),
                 test_size=5)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

OnlineGridSearch

iFitted

Parameters

	estimator	RegimeAdjustedEWCovariance()
	param_grid	{'corr_half_life': [40, 80], 'half_life': [20, 40, ...], 'regime_half_life': [10, 20]}
	scoring	make_scorer(p...e_method=None)
	warmup_size	252
	test_size	5
	freq	None
	freq_offset	None
	previous	False
	purged_size	0
	reduce_test	False
	refit	True
	error_score	nan
	return_predictions	False
	portfolio_params	None
	n_jobs	-1
	verbose	0

estimator: RegimeAdjustedEWCovariance

RegimeAdjustedEWCovariance()

RegimeAdjustedEWCovariance

Parameters

	half_life	40
	corr_half_life	None
	hac_lags	None
	regime_half_life	None
	regime_target	PORTFOLIO
	regime_method	FIRST_MOMENT
	regime_portfolio_weights	None
	regime_multiplier_clip	(0.7, ...)
	regime_min_observations	None
	min_observations	None
	assume_centered	True
	nearest	True
	higham	False
	higham_max_iteration	100

Let’s display the best grid-search hyperparameters and score:

print(f"Grid best params: {grid_search.best_params_}")
print(f"Grid best score: {grid_search.best_score_:.6f}")

Grid best params: {'corr_half_life': 80, 'half_life': 60, 'regime_half_life': 10}
Grid best score: 6.964519

OnlineRandomizedSearch with Multi-Metric Scoring#

We can also use OnlineRandomizedSearch, which samples from continuous distributions instead of evaluating a full grid. Here, we search over the same three hyperparameters with 100 random combinations.

We track both QLIKE and calibration loss. Since multi-metric search requires an explicit selection rule, we set refit="neg_qlike" so that the best estimator is selected according to the QLIKE scorer.

random_search = OnlineRandomizedSearch(
    estimator=RegimeAdjustedEWCovariance(),
    param_distributions={
        "half_life": uniform(loc=20, scale=40),
        "corr_half_life": uniform(loc=40, scale=40),
        "regime_half_life": uniform(loc=10, scale=10),
    },
    n_iter=100,
    scoring={
        "neg_qlike": qlike_scorer,
        "neg_calibration_loss": calibration_scorer,
    },
    refit="neg_qlike",
    warmup_size=252,
    test_size=5,
    n_jobs=-1,
    random_state=1,
)
random_search.fit(X)

OnlineRandomizedSearch(estimator=RegimeAdjustedEWCovariance(), n_iter=100,
                       n_jobs=-1,
                       param_distributions={'corr_half_life': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa3e2b54830>,
                                            'half_life': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa42c3910f0>,
                                            'regime_half_life': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa4265a5f20>},
                       random_state=1, refit='neg_qlike',
                       scoring={'neg_calibration_loss': make_scorer(diagonal_calibration_loss, greater_is_better=False, response_method=None),
                                'neg_qlike': make_scorer(portfolio_variance_qlike_loss, greater_is_better=False, response_method=None)},
                       test_size=5)

OnlineRandomizedSearch

iFitted

Parameters

	estimator	RegimeAdjustedEWCovariance()
	param_distributions	{'corr_half_life': <scipy.stats....x7fa3e2b54830>, 'half_life': <scipy.stats....x7fa42c3910f0>, 'regime_half_life': <scipy.stats....x7fa4265a5f20>}
	n_iter	100
	scoring	{'neg_calibration_loss': make_scorer(d...e_method=None), 'neg_qlike': make_scorer(p...e_method=None)}
	warmup_size	252
	test_size	5
	freq	None
	freq_offset	None
	previous	False
	purged_size	0
	reduce_test	False
	refit	'neg_qlike'
	random_state	1
	error_score	nan
	return_predictions	False
	portfolio_params	None
	n_jobs	-1
	verbose	0

estimator: RegimeAdjustedEWCovariance

RegimeAdjustedEWCovariance()

RegimeAdjustedEWCovariance

Parameters

	half_life	40
	corr_half_life	None
	hac_lags	None
	regime_half_life	None
	regime_target	PORTFOLIO
	regime_method	FIRST_MOMENT
	regime_portfolio_weights	None
	regime_multiplier_clip	(0.7, ...)
	regime_min_observations	None
	min_observations	None
	assume_centered	True
	nearest	True
	higham	False
	higham_max_iteration	100

Let’s display the best randomized-search hyperparameters and score:

print(f"Random best params: {random_search.best_params_}")
print(f"Random best score (neg_qlike): {random_search.best_score_:.6f}")

Random best params: {'corr_half_life': np.float64(76.13518082249016), 'half_life': np.float64(42.947179466891434), 'regime_half_life': np.float64(10.02870327031159)}
Random best score (neg_qlike): 6.963007

Online Score#

OnlineRandomizedSearch already stores the aggregate online scores of all sampled candidates in cv_results_. We use online_score below only to evaluate a baseline specification that was not part of the search. This also illustrates the standalone scoring API.

baseline_cov = RegimeAdjustedEWCovariance(
    half_life=40,
    corr_half_life=80,
    regime_half_life=20,
)
cv_results = random_search.cv_results_
best_idx = random_search.best_index_

baseline_scores = online_score(
    baseline_cov,
    X,
    warmup_size=252,
    test_size=5,
    scoring={
        "neg_qlike": qlike_scorer,
        "neg_calibration_loss": calibration_scorer,
    },
)
tuned_scores = {
    "neg_qlike": cv_results["mean_score_neg_qlike"][best_idx],
    "neg_calibration_loss": cv_results["mean_score_neg_calibration_loss"][best_idx],
}

Let’s compare the baseline and tuned scores. The tuned scores are retrieved directly from the search results, while the baseline is scored separately with online_score. Since the scorers negate the losses, higher values indicate better performance:

print("Baseline scores:")
print(baseline_scores)
print("Tuned scores:")
print(tuned_scores)

Baseline scores:
{'neg_qlike': 6.92239803023065, 'neg_calibration_loss': -0.7392869425387292}
Tuned scores:
{'neg_qlike': np.float64(6.96300726860151), 'neg_calibration_loss': np.float64(-0.7197863505305562)}

Search Trade-Off Plot#

The scatter plot below summarizes the aggregate online losses of the 100 sampled parameter combinations, with the selected candidate highlighted and the baseline shown for reference.

results = pd.DataFrame(cv_results["params"])
results["qlike_loss"] = -cv_results["mean_score_neg_qlike"]
results["calibration_loss"] = -cv_results["mean_score_neg_calibration_loss"]

fig = px.scatter(
    results,
    x="qlike_loss",
    y="calibration_loss",
    color="regime_half_life",
    hover_data=["half_life", "corr_half_life", "regime_half_life"],
    color_continuous_scale="Viridis",
    labels={
        "qlike_loss": "QLIKE loss",
        "calibration_loss": "Diagonal calibration loss",
        "regime_half_life": "Regime half-life",
    },
    title="Online Random Search: QLIKE vs Calibration Loss",
)
fig.update_traces(marker=dict(size=9, opacity=0.75, line=dict(width=0)))

fig.add_trace(
    go.Scatter(
        x=[results.loc[best_idx, "qlike_loss"]],
        y=[results.loc[best_idx, "calibration_loss"]],
        mode="markers+text",
        name="Selected candidate",
        text=["Selected"],
        textposition="top right",
        marker=dict(symbol="star", size=16, color="green", line=dict(width=1)),
        showlegend=False,
    )
)
fig.add_trace(
    go.Scatter(
        x=[-baseline_scores["neg_qlike"]],
        y=[-baseline_scores["neg_calibration_loss"]],
        mode="markers+text",
        name="Baseline",
        text=["Baseline"],
        textposition="bottom right",
        marker=dict(symbol="x", size=13, color="black", line=dict(width=2)),
        showlegend=False,
    )
)
show(fig)

Conclusion#

This tutorial demonstrated how to tune online covariance estimator hyperparameters.

Build scorers with make_scorer and response_method=None for non-predictor estimators.
Use OnlineGridSearch for small, structured search spaces.
Use OnlineRandomizedSearch for larger continuous search spaces, specifying refit when scoring is multi-metric.
Evaluate and compare estimators numerically with online_score.

In the next tutorial, we move from covariance tuning to end-to-end online portfolio optimization evaluation with MeanRisk.

Total running time of the script: (1 minutes 3.255 seconds)

Gallery generated by Sphinx-Gallery