Fix the RegressionEnsembleModel error with output_chunk_shift > 0 #2789

cnhwl · 2025-04-23T07:36:09Z

Checklist before merging this PR:

Mentioned all issues that this PR fixes or addresses.
Summarized the updates of this PR under Summary.
Added an entry under Unreleased in the Changelog.

Summary

When forecasting model output_chunk_shift > 0 and RegressionEnsembleModel regression_train_n_points == -1 or some large number, it would occur the forecasting model predict error:

        if self.output_chunk_shift and is_autoregression:
            raise_log(
                ValueError(
                    "Cannot perform auto-regression `(n > output_chunk_length)` with a model that uses a "
                    "shifted output chunk `(output_chunk_shift > 0)`."
                ),
                logger=logger,
            )

Therefore, I try to limit the RegressionEnsembleModel regression_train_n_points to be not bigger than the forecasting model output_chunk_length. Moreover, not bigger than ((the forecasting model output_chunk_length) minus (the max lag of the forecasting model)).

codecov · 2025-04-23T08:40:14Z

Codecov Report

❌ Patch coverage is 68.42105% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 95.11%. Comparing base (033fafe) to head (4e82626).

Files with missing lines	Patch %	Lines
...ts/models/forecasting/regression_ensemble_model.py	68.42%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2789      +/-   ##
==========================================
- Coverage   95.22%   95.11%   -0.11%     
==========================================
  Files         146      146              
  Lines       15573    15583      +10     
==========================================
- Hits        14829    14822       -7     
- Misses        744      761      +17

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dennisbader

Thanks for giving this a go @cnhwl. However, I think we need to adapt the proposed solution.

Here are some points:

It should be possible to train an ensemble model on base forecasting models that use an output_chunk_shift. Requirements:
- All models must use the same output_chunk_shift value.
- All models must use the same output_chunk_length value.
- In case of base models using output_chunk_shift, the actual regression_model (the ensemble model) must also use the same output_chunk_shift. In that case we need to check that the future covariates lags for regression_model are {"future": [output_chunk_shift]} (see here)
After that: the first predict() call in RegressionEnsembleModel.fit() (see here) should probably not be performed when we use historical fc to fit the model. This predict call is anyways only used to validate all series have the expected time index. Can we find another way to validate that all models have the required time frames? Maybe we can perform a check on the generated historical forecasts.
Given all of the above, the model should be able to generate the desired forecasts

…m/cnhwl/darts into Fix/RegressionEnsembleModel_error

cnhwl · 2025-04-30T01:21:55Z

Thanks for giving this a go @cnhwl. However, I think we need to adapt the proposed solution.

Here are some points:

It should be possible to train an ensemble model on base forecasting models that use an output_chunk_shift. Requirements:

All models must use the same output_chunk_shift value.

All models must use the same output_chunk_length value.

In case of base models using output_chunk_shift, the actual regression_model (the ensemble model) must also use the same output_chunk_shift. In that case we need to check that the future covariates lags for regression_model are {"future": [output_chunk_shift]} (see here)

After that: the first predict() call in RegressionEnsembleModel.fit() (see here) should probably not be performed when we use historical fc to fit the model. This predict call is anyways only used to validate all series have the expected time index. Can we find another way to validate that all models have the required time frames? Maybe we can perform a check on the generated historical forecasts.

Given all of the above, the model should be able to generate the desired forecasts

Hi! @dennisbader I have completed the three requirements by checking the output_chunk_shift and output_chunk_length.
I still keep the code of making self.regression_model.output_chunk_length > self.forecasting_models[0].output_chunk_length - input_shift to avoid autoregression. If you have better ideas on series length assignment (forecast_training and regression_target), please let me know. 🤝

Fix the RegressionEnsembleModel error with output_chunk_shift > 0

9371e16

cnhwl requested review from dennisbader and madtoinou as code owners April 23, 2025 07:36

cnhwl added 2 commits April 23, 2025 15:53

Fix error

822eced

Fix error

88d1baa

dennisbader requested changes Apr 28, 2025

View reviewed changes

cnhwl added 4 commits April 29, 2025 09:29

Merge branch 'master' into Fix/RegressionEnsembleModel_error

f162bdc

Add check of output_chunk_shift and output_chunk_length

42b8a8c

Merge branch 'Fix/RegressionEnsembleModel_error' of https://github.co…

1a4050d

…m/cnhwl/darts into Fix/RegressionEnsembleModel_error

Fix error

1f6cfbd

cnhwl added 2 commits April 30, 2025 09:22

Merge branch 'master' into Fix/RegressionEnsembleModel_error

3a572a5

Merge branch 'master' into Fix/RegressionEnsembleModel_error

4e82626

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the RegressionEnsembleModel error with output_chunk_shift > 0 #2789

Fix the RegressionEnsembleModel error with output_chunk_shift > 0 #2789

Uh oh!

cnhwl commented Apr 23, 2025

Uh oh!

codecov bot commented Apr 23, 2025 •

edited

Loading

Uh oh!

dennisbader left a comment •

edited

Loading

Uh oh!

cnhwl commented Apr 30, 2025

Uh oh!

Uh oh!

Fix the RegressionEnsembleModel error with output_chunk_shift > 0 #2789

Are you sure you want to change the base?

Fix the RegressionEnsembleModel error with output_chunk_shift > 0 #2789

Uh oh!

Conversation

cnhwl commented Apr 23, 2025

Summary

Uh oh!

codecov bot commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dennisbader left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cnhwl commented Apr 30, 2025

Uh oh!

Uh oh!

codecov bot commented Apr 23, 2025 •

edited

Loading

dennisbader left a comment •

edited

Loading