Inside endeavr.ai's Four-Model Deep-Learning Ensemble
2026-03-09 · endeavr.ai Research
There is no single best neural architecture for financial time series. Each captures different structure and fails in different ways. That is exactly why we use four of them and stack their predictions rather than betting the platform on one.
The four base models
An LSTM with attention captures momentum and mean-reversion over weeks. A GRU with skip connections offers a lighter recurrent view that trains faster and resists overfitting. A Temporal CNN in the WaveNet style reads multi-scale patterns through dilated convolutions. A Transformer encoder relates any two days in the lookback window directly through self-attention.
Why stacking beats any single model
Because the four models make uncorrelated errors, a stacking meta-learner that combines them cancels idiosyncratic mistakes and produces a more robust forecast than the best individual model. It is the same reason a panel of independent forecasters beats any single pundit.
Calibrated, not just confident
Raw model probabilities are usually poorly calibrated. We calibrate ensemble confidence on held-out data so that a 70%-confidence prediction is genuinely right about 70% of the time — which is what makes conviction-weighted position sizing meaningful.